Tom Morris

Results 686 comments of Tom Morris

I've changed this behavior in pull request #18

Code to deal with this, as well as convert Google Code style cross references, is included in this pull request: https://github.com/arthur-debert/google-code-issues-migrator/pull/18

The epub converts with Calibre and validates with Sigil, so perhaps the problem is in the epub to mobi converter.

There's still no EPUB online for that volume, but a locally generated epub validates using epubcheck 1.1 and 4.0.1.

This processes without generating errors now, but because the title and copyright page identification failed for _scandata.xml, the first page is lost because it's tagged as a title page, which...

I'm not 100% sure I follow what you're asking, but the file with the necessary information is available. For this example, if you look at https://archive.org/download/billgalactichero00harr/billgalactichero00harr_scandata.xml or, more generally, https://archive.org/download//_scandata.xml...

It's not easy for me to do remotely, but it's easy to do at scale at IA. It's simply a matter of scanning every _scandata.xml file for the signature string....

@wumpus If IA were to think about regenerating all epubs, I hope they'd consider fixing some of the other bugs first (e.g. there have been a number of reports of...

Well, the original patch wasn't even valid Python, plus it didn't handle a bunch of the additional types besides `Chapter`, but PR #34 has had some limited testing and appears...

OK, color me confused then. The current code _already_ checks addToAccessFormats: https://github.com/internetarchive/epub/blob/master/iabook_to_epub.py#L106 but it also checks for expected page types and I don't see any commits/PRs that change that behavior....