I'm working through this setup as well and would appreciate any help.
I have paperless-ng setup and working.
Following the steps to enable this functionality I have done the following:
Added tika server using docker container "apache/tika"
Using default container port 9998
Added gotenberg using docker container "gotenberg/gotenberg"
Using default container port 3000
Added the following variables to my paperless-ng docker:
PAPERLESS_TIKA_ENABLED: 1
PAPERLESS_TIKA_ENDPOINT: http://tika:9998
PAPERLESS_TIKA_GOTENBERG_ENDPOINT: http://gotenberg:3000
When I try to add a word document to paperless-ng I get the following error:
Error while converting document to PDF: 404 Client Error: Not Found for url: http://gotenberg:3000/convert/office
The paperless-ng log shows:
09:06:18 [Q] INFO Process-1:2 processing [unraid.docx]
[2021-08-24 09:06:19,525] [INFO] [paperless.consumer] Consuming unraid.docx
[2021-08-24 09:06:19,540] [INFO] [paperless.parsing.tika] Sending /tmp/paperless/paperless-upload-_0wpk51m to Tika server
[2021-08-24 09:06:20,368] [INFO] [paperless.parsing.tika] Converting /tmp/paperless/paperless-upload-_0wpk51m to PDF as /tmp/paperless/paperless-cd7vp84i/convert.pdf
[2021-08-24 09:06:20,879] [ERROR] [paperless.consumer] Error while consuming document unraid.docx: Error while converting document to PDF: 404 Client Error: Not Found for url: http://gotenberg:3000/convert/office
Traceback (most recent call last):
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 79, in convert_to_pdf
response.raise_for_status() # ensure we notice bad responses
File "/usr/local/lib/python3.9/site-packages/requests/models.py", line 953, in raise_for_status
raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 404 Client Error: Not Found for url: http://gotenberg:3000/convert/office
And I see the following in the gotenberg log:
A Docker-powered stateless API for PDF files.
Version: 7.0.2
-------------------------------------------------------
[SYSTEM] modules: api chromium gc libreoffice logging pdfcpu pdfengines pdftk unoconv unoconv-pdfengine
[SYSTEM] gc: application started
[SYSTEM] api: server listening on port 3000
{"level":"error","ts":1629813980.3731606,"logger":"api","msg":"code=404, message=Not Found","trace":"463a6609-9a4a-46ca-add3-4dad45b98694","remote_ip":"172.18.0.23","host":"gotenberg:3000","uri":"/convert/office","method":"POST","path":"/convert/office","referer":"","user_agent":"python-requests/2.26.0","status":404,"latency":46414,"latency_human":"46.414µs","bytes_in":1081155,"bytes_out":9}
Am I missing a configuration step?