You might take a look at a couple of projects.
First to split your pdf up you could use the python based stapler program.
-> https://github.com/hellerbarde/stapler/tree/master
And to convert the pdf to html you would take a look at pdf2htmlEX ->
https://github.com/coolwanglu/pdf2htmlEX
On Wed, Apr 29, 2015 at 9:04 AM, Sergio Letuche <[log in to unmask]>
wrote:
> Dear all,
>
> we have a pdf, that is taken from a to be printed pdf, full of tables. The
> text is split in two columns. How would you suggest we uploaded this pdf to
> the web? We would like to keep the structure, and split each section taken
> from the table of contents as a page, but also keep the format, and if
> possible, serve the content both in an html view, and in a pdf view, based
> on the preference of the user.
>
> Looking forward for your input.
>
> The document is made with Indesign CS6, and i do not know in which format i
> could transform it into
>
> Best
>
--
Ronald Houk
Assistant Director
Ottumwa Public Library
102 W. Fourth Street
Ottumwa, IA 52501
(641)682-7563x203
[log in to unmask]
|