[REQUEST] PDF organizer w/ OCR


Recommended Posts

Hello

 

Recently really started into going paperless, and scanning all my bills / documents / tax stuff /etc. as PDF files.

 

What I am looking to do is have some sort of software to organize them in, and make them searchable (aka OCR recognition).

 

The only thing I found remotely close is OCRmyPDF (https://registry.hub.docker.com/u/paulstaab/ocrmypdf/) but I am more interested in watching a folder, running OCR, and them dumping them in a searchable index.

 

This is more or less what I am looking to do, but mainly through a docker runing on unRAID. (http://matthopkins.com/technology/automatically-ocr-scanned-documents/). I can scan my document to a network folder, and everything else is automated.

 

Played around with Calibre for a little bit, but its more geared towards e-books.

 

I don't even know if this is possible, but I figured this was a great place to start and see if anyone has any ideas.

 

Thanks

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.