If there are particular HTML errors you encountered as a pattern in the
journal website, please feel free to let us know about them on the
Journal listserv, if you're interested.
On 1/4/2011 12:30 PM, Louis St-Amour wrote:
> Given my journal2epub script's experience with the Code4Lib journal
> site, does Anthologize have an option to produce TOC items from post
> headings, modifying the HTML to add IDs where necessary? Does it map
> links to posts with their offline copies, preserving references? Does
> it try to add the largest image it can, or does it include only
> embedded, potentially smaller ones? (In iBooks, unlike Adobe-based
> readers, you can double-tap to zoom in on an automatically resized
> large image.) Are metadata and stylesheets specified manually? And
> finally, does it clean up the HTML to produce strict XHTML 1.1 as
> required? In the journal's case, I had to process HTML three times
> with manual checks to delete invalid attributes before things would
> mostly validate. (Turns out validation is the hardest thing about
> automatically producing EPUB files.) As to my script's use in
> producing official EPUB files, sure, that's why I made it. But if you
> look closely, it makes assumptions about the HTML structure of the
> pages, so it might need modifications if the design or templates
> change.
>
>
> Louis.
>
> Sent from my iPhone
>
> On 2011-01-04, at 12:01 PM, "Hanrath, Scott"<[log in to unmask]> wrote:
>
>> Anthologize lets you be as picky as you like about the content you use
>> with it. Essentially you create multiple Anthologize 'projects', then add
>> the whatever subset of content you need (native local WordPress content or
>> content imported via a feed) to the project. The Anthologize content is
>> added as copies, preserving the originals and allowing for editing
>> specific to your output needs.
>>
>> Eric's right that it *is* manual and a bit tedious, but it's (hopefully)
>> getting less so. You do need to created a 'part' structure within your
>> project to organize your content. But when adding content you can filter
>> by Tag/Category/Date Range/Post Type. And with the last release you can
>> add more than one post at a time.
>>
>> The Anthologize dev team would certainly be interested in the code4lib
>> journal committee's take on the tool and ways it could be improved.
>> (Support for automated project creation and output generation would an
>> interesting feature to see on the roadmap).
>>
>> -- Scott
>>
>> On 1/4/11 10:45 AM, "Eric Lease Morgan"<[log in to unmask]> wrote:
>>
>>> On Jan 4, 2011, at 11:40 AM, Jonathan Rochkind wrote:
>>>
>>>> ...Is there any easy way to get it to, for instance, make an anthology
>>>> of
>>>> all the posts with a certain WordPress tag or category instead?...
>>> Based on my (poor) recollection of playing with the Anthologize plug-in,
>>> the process is a bit manual. Initialize epub. Drag postings to it.
>>> Annotate/tweak titles. Click 'Go'. Get epub file. The process is not
>>> laborious, just a bit tedious. I would definitely recommend the "Journal
>>> Committee" experiment with Anthologize.
>>>
>>> --
>>> Eric Morgan
|