Skip to content
Ian McDougall edited this page Jun 4, 2015 · 6 revisions

ToDo

  • Check each epub for duplicate files. Not sure how common this is.
  • Change the html sed commands to only change filenames, not text.
  • Change the syntax to take arguments, instead of blindly operating on all epub files.
  • Ideally, use epubcheck to validate the result. This is slightly complicated because what I really care about is if I have added any more warnings, ignoring those that existed already.
  • Currently, the script doesn't handle css, db, dfont, epub (for a preview chapter), ncx, opf, otf, plist, ttc, ttf, txt, xml, or xpgt files that are found inside an epub. However, these are all file types that I have observed.
  • I have not observed svg files, but the OPS spec does allow them. If these are encountered, scour might be a good tool to use.
  • I have not experimented with different compression levels for zip.
  • I have not experimented with different file formats than png. A tool like optipng might be useful here, depending on the default settings of imagemagick.
Clone this wiki locally