Deep Insights Posted May 18, 2021 Share Posted May 18, 2021 I currently have FILENAME_FORMAT set as: {create_year}/{correspondent}/{title}. This works well for me but not for my better half My wife wants to have all the scans under a separate folder (different from the one above that I'm using). She would like to have all scans stored under her name. If I can have all PDFs prefixed with her name (eg: wife_document1.pdf), how can I use/extract the prefix of the filename to have all documents stored under? In summary, My documents get stored at: /usr/src/paperless/media/{create_year}/{correspondent}/{title}. My wife would like to have everything under: /usr/src/paperless/media/wife_name/{create_year}/{correspondent}/{title}.{create_year}/{correspondent}/{title}. How do I accomplish this? Quote Link to comment
Deep Insights Posted May 18, 2021 Share Posted May 18, 2021 On 4/12/2021 at 1:30 PM, CvT said: Got paperless-ng up and running with Tika and Gotenberg, really nice! Does anybody know if you can use parts of the document's filename to tag? Would you mind sharing how you got Tika & Gotenberg installed/configured? Also, did you figure out how to use filename for tagging? Thanks. 2 Quote Link to comment
luk Posted May 20, 2021 Share Posted May 20, 2021 Hi all, how I can do a full backup of Paperless-ng ? I would like to avoid the situation, if I have to replace the hardware/hard disk, that I lose the data I have stored so far. Thank you, Luk Quote Link to comment
mattie112 Posted May 21, 2021 Share Posted May 21, 2021 On 5/20/2021 at 6:03 PM, luk said: Hi all, how I can do a full backup of Paperless-ng ? I would like to avoid the situation, if I have to replace the hardware/hard disk, that I lose the data I have stored so far. Thank you, Luk Just backup the mounted folder in your appdata or use the CA backup plugin 1 Quote Link to comment
luk Posted May 21, 2021 Share Posted May 21, 2021 1 hour ago, mattie112 said: Just backup the mounted folder in your appdata or use the CA backup plugin you mean the "Appdata Backup/Restore v2" plugin? Will be the database also included in backup? Quote Link to comment
mattie112 Posted May 21, 2021 Share Posted May 21, 2021 These folders contain the data: Simply backup those with whatever tool you like and you are good to go 1 Quote Link to comment
luk Posted May 22, 2021 Share Posted May 22, 2021 other question guys: How to backup the whole paperless docker? Quote Link to comment
mattie112 Posted May 22, 2021 Share Posted May 22, 2021 That's not an other question, that is the same question. Either use CA backup or backup the directories listed above with any other tool. Quote Link to comment
luk Posted May 24, 2021 Share Posted May 24, 2021 (edited) On 5/22/2021 at 10:22 AM, mattie112 said: That's not an other question, that is the same question. Either use CA backup or backup the directories listed above with any other tool. Ok, got it finally, thank you Edited May 24, 2021 by luk Quote Link to comment
r3wind Posted May 24, 2021 Share Posted May 24, 2021 Set it up this weekend, and having a small issue. I can manually add files just fine. However, the consumption watcher never hits the directory I'm using. I set the directory to a share on my server so I can export from my scanner, etc Looking at the logs, it seems to be watching a different directory for changes: I'm sure I made a simple mistake but I think I've looked at it so long now I'm overlooking it. Quote Link to comment
Deep Insights Posted May 26, 2021 Share Posted May 26, 2021 How to change LOGLEVEL to INFO? Currently, in the paperless.log, I see DEBUG messages. I'm trying to minimize disk writes. In paperless-ng documentation, I only see two LOG related variables (PAPERLESS_LOGROTATE_MAX_SIZE, PAPERLESS_LOGROTATE_MAX_BACKUPS). There's nothing to change the LOGLEVEL. Any idea how do I change the log level? Thanks. Quote Link to comment
mattie112 Posted May 26, 2021 Share Posted May 26, 2021 I only see INFO messages: What kind of DEBUG messages do you see? Quote Link to comment
Lumpy_BD Posted June 8, 2021 Share Posted June 8, 2021 Hi all, I'd like to add a password to my Redis setup. Is there anyway to get Paperless to authenticate with Redis with a password? It doesn't appear obvious at initial glance. Thanks. Quote Link to comment
mattie112 Posted June 8, 2021 Share Posted June 8, 2021 24 minutes ago, Lumpy_BD said: Hi all, I'd like to add a password to my Redis setup. Is there anyway to get Paperless to authenticate with Redis with a password? It doesn't appear obvious at initial glance. Thanks. https://forums.unraid.net/search/?q=Redis password&quick=1&type=forums_topic&item=100843 Quote Link to comment
nug Posted June 9, 2021 Share Posted June 9, 2021 (edited) My Paperless was working well for a while but my mail fetching doesn't seem to work anymore. When I manually add files by drag and drop or browse files it all works fine. When I run the mail fetcher I get this in the paperless.log This is also shown in the docker log: The mail fetching side of things seems to be working fine: I have 777 permissions on my three mounted Media, Consume and Export directories and as I said - manual uploads work fine. Has anyone had any issues with mail fetching lately? Edited June 9, 2021 by nug redaction of file names, oops. Quote Link to comment
mattie112 Posted June 9, 2021 Share Posted June 9, 2021 4 hours ago, nug said: My Paperless was working well for a while but my mail fetching doesn't seem to work anymore. When I manually add files by drag and drop or browse files it all works fine. When I run the mail fetcher I get this in the paperless.log This is also shown in the docker log: The mail fetching side of things seems to be working fine: I have 777 permissions on my three mounted Media, Consume and Export directories and as I said - manual uploads work fine. Has anyone had any issues with mail fetching lately? I have no issues with the latest version (no update available). Does this happen with all mails? Or only a certain document? Quote Link to comment
nug Posted June 9, 2021 Share Posted June 9, 2021 (edited) 4 hours ago, mattie112 said: I have no issues with the latest version (no update available). Does this happen with all mails? Or only a certain document? Every email and on both accounts that I have fetching. Edited June 9, 2021 by nug Quote Link to comment
strongy Posted July 12, 2021 Share Posted July 12, 2021 (edited) On 3/20/2021 at 3:36 PM, Shad0wWulf said: For everyone using Bitnami Redis with a password the answer to this is to use this connection string: redis://default:[PASSWORD]@[IP]:6379 if like me anyone else has the problem with it saying invalid combination or username disabled try leaving the username blank, thats what made mine work. redis://:[email protected]:6379 Edited July 12, 2021 by strongy Quote Link to comment
GigaGuy Posted July 22, 2021 Share Posted July 22, 2021 Has anyone figured out how to convert emailed receipts to pdf or some format that can be stored in paperless-ng? The emails seem to be HTML. I can pull PDF from my email, but not these receipts that are the body of the email. TIA Quote Link to comment
mattie112 Posted July 24, 2021 Share Posted July 24, 2021 You can "print" your email to PDF and then store it. Perhaps the most simple solution for a few reciepts? Quote Link to comment
Ehtele Posted August 6, 2021 Share Posted August 6, 2021 Does anyone know how to install extra languages, or know any resources they can point me to? The default tesseract package only includes six languages. Quote Link to comment
lgb Posted August 19, 2021 Share Posted August 19, 2021 Hi there folks, here another user with some configuration problems. Here are the steps I've completed so far: 1) Redis docker installation with default configuration. No password has been configured, just plain installation: 2) Paperless-ng installation; folders have been configured and here comes the problem: redis IP config (and maybe the consumption folder). I'm trying to follow the same workflow explained by the OP. This is, taking documents from SMB folder in printer and process them. Here is my docker config. Must say I've tried with localhost, 127.0.0.1, server name and so on, and nothing seems to work. 3) Finally, paperless-ng logs: [2021-08-19 10:44:31,907] [INFO] [paperless.management.consumer] Adding /usr/src/paperless/src/../consume/scan.jpg to the task queue. [2021-08-19 10:44:31,914] [ERROR] [paperless.management.consumer] Error while consuming document Traceback (most recent call last): File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 559, in connect sock = self._connect() File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 615, in _connect raise err File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 603, in _connect sock.connect(socket_address) ConnectionRefusedError: [Errno 111] Connection refused During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/usr/src/paperless/src/documents/management/commands/document_consumer.py", line 76, in _consume task_name=os.path.basename(filepath)[:100]) File "/usr/local/lib/python3.7/site-packages/django_q/tasks.py", line 73, in async_task enqueue_id = broker.enqueue(pack) File "/usr/local/lib/python3.7/site-packages/django_q/brokers/redis_broker.py", line 18, in enqueue return self.connection.rpush(self.list_key, task) File "/usr/local/lib/python3.7/site-packages/redis/client.py", line 2016, in rpush return self.execute_command('RPUSH', name, *values) File "/usr/local/lib/python3.7/site-packages/redis/client.py", line 898, in execute_command conn = self.connection or pool.get_connection(command_name, **options) File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 1192, in get_connection connection.connect() File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 563, in connect raise ConnectionError(self._error_message(e)) redis.exceptions.ConnectionError: Error 111 connecting to 127.0.0.1:6379. Connection refused. [2021-08-19 10:44:31,919] [INFO] [paperless.management.consumer] Adding /usr/src/paperless/src/../consume/scan.pdf to the task queue. [2021-08-19 10:44:31,924] [ERROR] [paperless.management.consumer] Error while consuming document Traceback (most recent call last): File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 559, in connect sock = self._connect() File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 615, in _connect raise err File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 603, in _connect sock.connect(socket_address) ConnectionRefusedError: [Errno 111] Connection refused During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/usr/src/paperless/src/documents/management/commands/document_consumer.py", line 76, in _consume task_name=os.path.basename(filepath)[:100]) File "/usr/local/lib/python3.7/site-packages/django_q/tasks.py", line 73, in async_task enqueue_id = broker.enqueue(pack) File "/usr/local/lib/python3.7/site-packages/django_q/brokers/redis_broker.py", line 18, in enqueue return self.connection.rpush(self.list_key, task) File "/usr/local/lib/python3.7/site-packages/redis/client.py", line 2016, in rpush return self.execute_command('RPUSH', name, *values) File "/usr/local/lib/python3.7/site-packages/redis/client.py", line 898, in execute_command conn = self.connection or pool.get_connection(command_name, **options) File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 1192, in get_connection connection.connect() File "/usr/local/lib/python3.7/site-packages/redis/connection.py", line 563, in connect raise ConnectionError(self._error_message(e)) redis.exceptions.ConnectionError: Error 111 connecting to 127.0.0.1:6379. Connection refused. [2021-08-19 10:44:31,926] [WARNING] [paperless.management.consumer] Not consuming file /usr/src/paperless/src/../consume/.HPIMAGE.VFS: Unknown file extension. [2021-08-19 10:44:31,928] [INFO] [paperless.management.consumer] Using inotify to watch directory for changes: /usr/src/paperless/src/../consume Thank you! Quote Link to comment
T0a Posted August 19, 2021 Author Share Posted August 19, 2021 What does the log tell you when you use your server IP address i.e. `redis://XXX.XXX.XXX.XX:6379`. I think `127.0.0.1` will not work, because it tries to resolve Redis in the paperless-ng container then. Keep me posted. 1 Quote Link to comment
lgb Posted August 19, 2021 Share Posted August 19, 2021 (edited) 39 minutes ago, T0a said: What does the log tell you when you use your server IP address i.e. `redis://XXX.XXX.XXX.XX:6379`. I think `127.0.0.1` will not work, because it tries to resolve Redis in the paperless-ng container then. Keep me posted. Great! Changed IP to 192.168.x.x and seems network issue has been solved. Now a new error appears: [2021-08-19 11:37:06,938] [INFO] [paperless.management.consumer] Adding /usr/src/paperless/src/../consume/scan.jpg to the task queue. [2021-08-19 11:37:06,944] [INFO] [paperless.management.consumer] Adding /usr/src/paperless/src/../consume/scan.pdf to the task queue. [2021-08-19 11:37:06,948] [WARNING] [paperless.management.consumer] Not consuming file /usr/src/paperless/src/../consume/.HPIMAGE.VFS: Unknown file extension. [2021-08-19 11:37:06,950] [INFO] [paperless.management.consumer] Using inotify to watch directory for changes: /usr/src/paperless/src/../consume [2021-08-19 11:37:10,173] [INFO] [paperless.consumer] Consuming scan.pdf [2021-08-19 11:37:15,214] [DEBUG] [paperless.consumer] Detected mime type: application/pdf [2021-08-19 11:37:15,314] [DEBUG] [paperless.consumer] Parser: RasterisedDocumentParser [2021-08-19 11:37:15,323] [DEBUG] [paperless.consumer] Parsing scan.pdf... [2021-08-19 11:37:16,592] [INFO] [paperless.consumer] Consuming scan.jpg [2021-08-19 11:37:20,145] [DEBUG] [paperless.consumer] Detected mime type: image/jpeg [2021-08-19 11:37:20,162] [DEBUG] [paperless.consumer] Parser: RasterisedDocumentParser [2021-08-19 11:37:20,170] [DEBUG] [paperless.consumer] Parsing scan.jpg... [2021-08-19 11:37:20,248] [WARNING] [paperless.parsing.tesseract] Error while getting text from PDF document with pdfminer.six Traceback (most recent call last): File "/usr/src/paperless/src/paperless_tesseract/parsers.py", line 120, in extract_text stripped = post_process_text(pdfminer_extract_text(pdf_file)) File "/usr/local/lib/python3.7/site-packages/pdfminer/high_level.py", line 119, in extract_text caching=caching, File "/usr/local/lib/python3.7/site-packages/pdfminer/pdfpage.py", line 128, in get_pages doc = PDFDocument(parser, password=password, caching=caching) File "/usr/local/lib/python3.7/site-packages/pdfminer/pdfdocument.py", line 572, in __init__ self.read_xref_from(parser, pos, self.xrefs) File "/usr/local/lib/python3.7/site-packages/pdfminer/pdfdocument.py", line 806, in read_xref_from (pos, token) = parser.nexttoken() File "/usr/local/lib/python3.7/site-packages/pdfminer/psparser.py", line 493, in nexttoken self.fillbuf() File "/usr/local/lib/python3.7/site-packages/pdfminer/psparser.py", line 219, in fillbuf self.buf = self.fp.read(self.BUFSIZ) PermissionError: [Errno 13] Permission denied [2021-08-19 11:37:20,763] [DEBUG] [paperless.parsing.tesseract] Calling OCRmyPDF with args: {'input_file': '/usr/src/paperless/src/../consume/scan.pdf', 'output_file': '/tmp/paperless/paperless-rhgt24hs/archive.pdf', 'use_threads': True, 'jobs': 3, 'language': 'spa+eng', 'output_type': 'pdfa', 'progress_bar': False, 'skip_text': True, 'clean': True, 'deskew': True, 'rotate_pages': True, 'rotate_pages_threshold': 12.0, 'sidecar': '/tmp/paperless/paperless-rhgt24hs/sidecar.txt'} Thank you @T0a! Forgot to say I'm using unassigned devices plugin to mount SMB folder Edited August 19, 2021 by lgb Adding some info Quote Link to comment
mattie112 Posted August 19, 2021 Share Posted August 19, 2021 For reference: 127.0.0.1 IN A container refers to the container itself. So that will only work if you have a single container with for example paperless AND redis. 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.