on making and using ebooks

by now i've put a fair amount of time into hobbyist book digitization and dedrming ebooks. this is my accumulated wisdom: reasons, tools, tricks. updated <2021-10-16 六> and <2025-12-07 日>.

i have used ebooks almost to the exclusion of physical copies for over a decade, for a few reasons:

  1. possession of a physical copy creates a psychological illusion that one has already done the work, a barrier to actually reading (hence tsundoku etc)
  2. it's simply not economical to collect physical copies. if you're doing research, you just need to bite the bullet and get 100% used to ebooks, whatever it takes. the time saved in pulling up whatever texts the chain of mental associations takes you to ultimately creates an altered relationship with texts that previous generations couldn't have imagined.
  3. in the past i was very adhd and relied on text-to-speech while reading to keep my focus from drifting. i am more medicated now, but still do this to an extent.

tools for scanning ebooks

these are tools i've found useful at some point. however, i no longer have a working pipeline, as it's been a long time since i digitized any books and some tools have become broken. i was never completely happy with my pipeline anyway.

improving existing pdfs

many ebooks are put together badly in one way or another: pdfs that are badly scanned, have a spread per page, skewed pages, bad or nonexistent ocr layer, no table of contents. there are a few ways to correct them short of finding and rescanning the book. i often find myself using briss or pdfsandwich to improve somebody else's badly-done scans before reading.

converting between formats

it's also often expedient to convert between various formats (epub to pdf, pdf to mobi, etc.) for different purposes, though the process is very lossy.

dedrming books

tools for reading ebooks

i've ended up trying many ebook readers, and only a few have been particularly worthwhile.

books that i have put together

i've scanned or dedrmed around fifty books, most of which i uploaded to the now-defunct aaarg (RIP), whose contents have now been merged into anna's archive.