I recently got a book in eBook format, but unfortunately, there isn't an audiobook version available. As someone who is visually impaired, I'm looking for a solution where I can take screenshots of the book pages and convert them into plain text. This way, I could feed the text into my text-to-speech software for a continuous listening experience, similar to an audiobook. I've tried using Google Gemini for this, but it wouldn't transcribe the text. Are there any free or low-cost tools that can help me convert these screenshots into text?
1 Answer
It sounds like you need an OCR (Optical Character Recognition) solution. You mentioned you have a text book, so if it's in a common format like .epub or .mobi, you might try out Calibre. It can convert those formats directly into plain text. Just a heads up, some books are protected by DRM, which might get in the way if you bought it from Amazon.
You might want to check out Gscan2pdf too. It can make a PDF with searchable text, which could work better with your screen reader.

Thanks for the tips! I found out that my book is specifically in Amazon's format, and Calibre ran into a DRM error. But I appreciate the info!