Docspell Hilfe


Hoddl

Recommended Posts

wenn ich es auf close stelle komme ich nicht mehr in die WebUI also die ip wird nicht mehr gefunden...

ich hab es nun auf Invite gesetzt und alles ist gut...

 

Einen Ordner im Ordner z.b. Hauptorder

                                                 Unterordner

                                                       Unterordner

 

geht wohl nicht???

Link to comment


 

6 hours ago, Hoddl said:
 
Einen Ordner im Ordner z.b. Hauptorder
                                                 Unterordner
 
geht wohl nicht???

 


Nein, leider nicht.
Es gibt aber ein issue bzw Feature Request auf Github "Nested folders & Tags" hierzu, dem könntest du dich "anschließen".

Habe mich dem auch angeschlossen, da ich finde es wäre eine zusätzliche und praktische Sortier- und Suchmöglichkeit.

 

Link to comment
1 hour ago, Hoddl said:

ich hab mir das Skript zur "Überwachung eines Ordners" angeschaut... nur ist es mir nicht klar wo und wie das skript laufen muss/soll...

Wenn ich das richtig sehe, ist das von dir genannte Script der Consumedir Container.

Wenn du das Verzeichnis "/mnt/appdata/docspell/docs" angelegt hast musst du noch einen Unterordner erstellen, der den Namen deines "Collective" trägt. Dort legst du die Dokumente ab und die werden dann automatisch importiert.

 

30 minutes ago, Hoddl said:

jetzt stoppt docspell-joex immer

was sagen die Joex Logs?

Link to comment

hier das Log...

 

 

ist etwas groß 🙂

 

 

FehlerWarnungSystemArrayAnmelden


[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Next job found: Some(FUJ3Yznjz.../docspell-system/make-preview/Low)
[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Creating context for job FUJ3Yznjz.../docspell-system/make-preview/Low to run JobTask(Ident(make-preview),docspell.joex.scheduler.Task$$anonfun$contramap$4@789d931d,docspell.joex.scheduler.Task$$anonfun$contramap$4@35599e84)
[ioapp-compute-1] [34mINFO [0;39m [36mo.h.b.c.n.NIO1SocketServerGroup[0;39m - Service bound to address /0:0:0:0:0:0:0:0:7878
[ioapp-compute-1] [34mINFO [0;39m [36mo.h.s.b.BlazeServerBuilder[0;39m - http4s v0.21.19 on blaze v0.14.15 started at http://[::]:7878/
[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Forking job FUJ3Yznjz.../docspell-system/make-preview/Low
[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (0 free)
[ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Starting task now
[ioapp-compute-2] [34mINFO [0;39m [36md.j.s.LogSink[0;39m - >>> 2021-03-05T14:01:12.920345Z Info FUJ3Yznjz.../docspell-system/make-preview/Low: Generating preview image for attachment Ident(7eo43raYQqy-ZzRwenWHSwM-fwZ8HTQ2Dqw-LJgUkvi2mPF)
java.lang.OutOfMemoryError: Java heap space
at java.desktop/java.awt.image.DataBufferByte.<init>(DataBufferByte.java:76)
at java.desktop/java.awt.image.Raster.createInterleavedRaster(Raster.java:266)
at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readInternal(JPEGImageReader.java:1228)
at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readRaster(JPEGImageReader.java:1541)
at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.readImageAsRasterAndReplaceColorProfile(JPEGImageReader.java:502)
at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.read(JPEGImageReader.java:395)
at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:91)
at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:80)
at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:179)
at org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:241)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.createInputStream(PDImageXObject.java:793)
at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.from8bit(SampledImageReader.java:517)
at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.getRGBImage(SampledImageReader.java:226)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:479)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:460)
at org.apache.pdfbox.rendering.PageDrawer.drawImage(PageDrawer.java:1059)
at org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:67)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:933)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:515)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:489)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:156)
at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:275)
at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:347)
at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:268)
at org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(PDFRenderer.java:240)
at docspell.extract.pdfbox.PdfboxPreview$.docspell$extract$pdfbox$PdfboxPreview$$getPageImage(PdfboxPreview.scala:46)
at docspell.extract.pdfbox.PdfboxPreview$$anon$1.$anonfun$previewImage$2(PdfboxPreview.scala:29)
at docspell.extract.pdfbox.PdfboxPreview$$anon$1$$Lambda$1975/0x000000010098a040.apply(Unknown Source)
at cats.effect.internals.IORunLoop$.cats$effect$internals$IORunLoop$$loop(IORunLoop.scala:104)
at cats.effect.internals.IORunLoop$.restartCancelable(IORunLoop.scala:51)
at cats.effect.internals.IOBracket$BracketStart.run(IOBracket.scala:100)
at cats.effect.internals.Trampoline.cats$effect$internals$Trampoline$$immediateLoop(Trampoline.scala:67)
[blaze-acceptor-0-0] [34mINFO [0;39m [36mo.h.b.c.ServerChannel[0;39m - Closing NIO1 channel /0:0:0:0:0:0:0:0:7878
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-0
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-1
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-2
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-3
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-4
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-5
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-6
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-7
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-8
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-acceptor-0-0
[shutdownHook1] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Unregister app joex1
[ioapp-compute-1] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown initiated...
[ioapp-compute-1] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown completed.
[ioapp-compute-1] [34mINFO [0;39m [36mo.h.c.PoolManager[0;39m - Shutting down connection pool: curAllocated=0 idleQueues.size=0 waitQueue.size=0 maxWaitQueueLimit=256 closed=false
Starting unoconv listener
[main] [34mINFO [0;39m [36md.joex.Main[0;39m - Using given config file: /opt/docspell.conf
[main] [34mINFO [0;39m [36md.joex.Main[0;39m -
***> ______ _ _
***> | _ \ | | |
***> | | | |___ ___ ___ _ __ ___| | |
***> | | | / _ \ / __/ __| '_ \ / _ \ | |
***> | |/ / (_) | (__\__ \ |_) | __/ | |
***> |___/ \___/ \___|___/ .__/ \___|_|_|
***> | |
***> |_| v0.20.0 (#4d3a25a8)
***> << JOEX >>
***> Id: joex1
***> Base-Url: http://192.168.178.4:7878
***> Database: jdbc:postgresql://192.168.178.6:5432/docspell
***> Fts: http://192.168.178.2:8983/solr/docspell
***> Config: /opt/docspell.conf
***>
[ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Running db migrations...
[ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Using migration locations: List(classpath:db/migration/postgresql)
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.d.b.DatabaseType[0;39m - Database: jdbc:postgresql://192.168.178.6:5432/docspell (PostgreSQL 11.7)
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.s.JdbcTableSchemaHistory[0;39m - Repair of failed migration in Schema History table "public"."flyway_schema_history" not necessary. No failed migration detected.
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbRepair[0;39m - Successfully repaired schema history table "public"."flyway_schema_history" (execution time 00:00.067s).
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbValidate[0;39m - Successfully validated 29 migrations (execution time 00:00.024s)
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Current version of schema "public": 1.20.4
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Schema "public" is up to date. No migration necessary.
[ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Clearing StanfordNLP cache after Duration(900000ms) idle time
[ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Creating nlp pipeline cache
[docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Starting...
[docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Start completed.
[ioapp-compute-5] [34mINFO [0;39m [36md.j.s.SchedulerImpl[0;39m - Starting scheduler
[ioapp-compute-6] [34mINFO [0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Starting periodic scheduler
[ioapp-compute-4] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Registering node joex1
[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (1 free)
[ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Going into main loop
[ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Looking for next periodic task
[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - New permit acquired
[ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Found periodic task 'Docspell house-keeping/Sun *-*-* 00:00:00'
[ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Scheduling next notify for timer Sun *-*-* 00:00:00 -> Some(2021-03-07T00:00)
[ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Waiting for notify
[ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Next job found: Some(FUJ3Yznjz.../docspell-system/make-preview/Low)
[ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Creating context for job FUJ3Yznjz.../docspell-system/make-preview/Low to run JobTask(Ident(make-preview),docspell.joex.scheduler.Task$$anonfun$contramap$4@40fb8022,docspell.joex.scheduler.Task$$anonfun$contramap$4@6f62ed31)
[ioapp-compute-4] [34mINFO [0;39m [36mo.h.b.c.n.NIO1SocketServerGroup[0;39m - Service bound to address /0:0:0:0:0:0:0:0:7878
[ioapp-compute-4] [34mINFO [0;39m [36mo.h.s.b.BlazeServerBuilder[0;39m - http4s v0.21.19 on blaze v0.14.15 started at http://[::]:7878/
[ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Forking job FUJ3Yznjz.../docspell-system/make-preview/Low
[ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (0 free)
[ioapp-compute-2] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Starting task now
[ioapp-compute-3] [34mINFO [0;39m [36md.j.s.LogSink[0;39m - >>> 2021-03-05T14:01:34.611363Z Info FUJ3Yznjz.../docspell-system/make-preview/Low: Generating preview image for attachment Ident(7eo43raYQqy-ZzRwenWHSwM-fwZ8HTQ2Dqw-LJgUkvi2mPF)
java.lang.OutOfMemoryError: Java heap space
at java.desktop/java.awt.image.DataBufferByte.<init>(DataBufferByte.java:76)
at java.desktop/java.awt.image.Raster.createInterleavedRaster(Raster.java:266)
at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readInternal(JPEGImageReader.java:1228)
at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readRaster(JPEGImageReader.java:1541)
at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.readImageAsRasterAndReplaceColorProfile(JPEGImageReader.java:502)
at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.read(JPEGImageReader.java:395)
at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:91)
at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:80)
at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:179)
at org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:241)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.createInputStream(PDImageXObject.java:793)
at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.from8bit(SampledImageReader.java:517)
at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.getRGBImage(SampledImageReader.java:226)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:479)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:460)
at org.apache.pdfbox.rendering.PageDrawer.drawImage(PageDrawer.java:1059)
at org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:67)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:933)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:515)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:489)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:156)
at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:275)
at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:347)
at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:268)
at org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(PDFRenderer.java:240)
at docspell.extract.pdfbox.PdfboxPreview$.docspell$extract$pdfbox$PdfboxPreview$$getPageImage(PdfboxPreview.scala:46)
at docspell.extract.pdfbox.PdfboxPreview$$anon$1.$anonfun$previewImage$2(PdfboxPreview.scala:29)
at docspell.extract.pdfbox.PdfboxPreview$$anon$1$$Lambda$1970/0x000000010099f840.apply(Unknown Source)
at cats.effect.internals.IORunLoop$.cats$effect$internals$IORunLoop$$loop(IORunLoop.scala:104)
at cats.effect.internals.IORunLoop$.restartCancelable(IORunLoop.scala:51)
at cats.effect.internals.IOBracket$BracketStart.run(IOBracket.scala:100)
at cats.effect.internals.Trampoline.cats$effect$internals$Trampoline$$immediateLoop(Trampoline.scala:67)
[blaze-acceptor-0-0] [34mINFO [0;39m [36mo.h.b.c.ServerChannel[0;39m - Closing NIO1 channel /0:0:0:0:0:0:0:0:7878
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-0
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-1
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-2
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-3
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-4
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-5
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-6
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-7
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-8
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-acceptor-0-0
[shutdownHook1] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Unregister app joex1
[ioapp-compute-7] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown initiated...
[ioapp-compute-7] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown completed.
[ioapp-compute-7] [34mINFO [0;39m [36mo.h.c.PoolManager[0;39m - Shutting down connection pool: curAllocated=0 idleQueues.size=0 waitQueue.size=0 maxWaitQueueLimit=256 closed=false
Starting unoconv listener
[main] [34mINFO [0;39m [36md.joex.Main[0;39m - Using given config file: /opt/docspell.conf
[main] [34mINFO [0;39m [36md.joex.Main[0;39m -
***> ______ _ _
***> | _ \ | | |
***> | | | |___ ___ ___ _ __ ___| | |
***> | | | / _ \ / __/ __| '_ \ / _ \ | |
***> | |/ / (_) | (__\__ \ |_) | __/ | |
***> |___/ \___/ \___|___/ .__/ \___|_|_|
***> | |
***> |_| v0.20.0 (#4d3a25a8)
***> << JOEX >>
***> Id: joex1
***> Base-Url: http://192.168.178.4:7878
***> Database: jdbc:postgresql://192.168.178.6:5432/docspell
***> Fts: http://192.168.178.2:8983/solr/docspell
***> Config: /opt/docspell.conf
***>
[ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Running db migrations...
[ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Using migration locations: List(classpath:db/migration/postgresql)
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.d.b.DatabaseType[0;39m - Database: jdbc:postgresql://192.168.178.6:5432/docspell (PostgreSQL 11.7)
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.s.JdbcTableSchemaHistory[0;39m - Repair of failed migration in Schema History table "public"."flyway_schema_history" not necessary. No failed migration detected.
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbRepair[0;39m - Successfully repaired schema history table "public"."flyway_schema_history" (execution time 00:00.070s).
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbValidate[0;39m - Successfully validated 29 migrations (execution time 00:00.024s)
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Current version of schema "public": 1.20.4
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Schema "public" is up to date. No migration necessary.
[ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Clearing StanfordNLP cache after Duration(900000ms) idle time
[ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Creating nlp pipeline cache
[docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Starting...
[docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Start completed.
[ioapp-compute-5] [34mINFO [0;39m [36md.j.s.SchedulerImpl[0;39m - Starting scheduler
[ioapp-compute-6] [34mINFO [0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Starting periodic scheduler
[ioapp-compute-4] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Registering node joex1
[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (1 free)
[ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Going into main loop
[ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Looking for next periodic task
[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - New permit acquired
[ioapp-compute-1] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Found periodic task 'Docspell house-keeping/Sun *-*-* 00:00:00'
[ioapp-compute-1] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Scheduling next notify for timer Sun *-*-* 00:00:00 -> Some(2021-03-07T00:00)
[ioapp-compute-2] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Waiting for notify
[ioapp-compute-0] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Next job found: Some(FUJ3Yznjz.../docspell-system/make-preview/Low)
[ioapp-compute-0] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Creating context for job FUJ3Yznjz.../docspell-system/make-preview/Low to run JobTask(Ident(make-preview),docspell.joex.scheduler.Task$$anonfun$contramap$4@2d9cbbdd,docspell.joex.scheduler.Task$$anonfun$contramap$4@36799cf5)
[ioapp-compute-4] [34mINFO [0;39m [36mo.h.b.c.n.NIO1SocketServerGroup[0;39m - Service bound to address /0:0:0:0:0:0:0:0:7878
[ioapp-compute-4] [34mINFO [0;39m [36mo.h.s.b.BlazeServerBuilder[0;39m - http4s v0.21.19 on blaze v0.14.15 started at http://[::]:7878/
[ioapp-compute-0] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Forking job FUJ3Yznjz.../docspell-system/make-preview/Low
[ioapp-compute-0] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (0 free)
[ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Starting task now
[ioapp-compute-5] [34mINFO [0;39m [36md.j.s.LogSink[0;39m - >>> 2021-03-05T14:02:44.438593Z Info FUJ3Yznjz.../docspell-system/make-preview/Low: Generating preview image for attachment Ident(7eo43raYQqy-ZzRwenWHSwM-fwZ8HTQ2Dqw-LJgUkvi2mPF)
java.lang.OutOfMemoryError: Java heap space
at java.desktop/java.awt.image.DataBufferByte.<init>(DataBufferByte.java:76)
at java.desktop/java.awt.image.Raster.createInterleavedRaster(Raster.java:266)
at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readInternal(JPEGImageReader.java:1228)
at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readRaster(JPEGImageReader.java:1541)
at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.readImageAsRasterAndReplaceColorProfile(JPEGImageReader.java:502)
at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.read(JPEGImageReader.java:395)
at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:91)
at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:80)
at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:179)
at org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:241)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.createInputStream(PDImageXObject.java:793)
at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.from8bit(SampledImageReader.java:517)
at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.getRGBImage(SampledImageReader.java:226)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:479)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:460)
at org.apache.pdfbox.rendering.PageDrawer.drawImage(PageDrawer.java:1059)
at org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:67)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:933)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:515)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:489)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:156)
at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:275)
at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:347)
at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:268)
at org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(PDFRenderer.java:240)
at docspell.extract.pdfbox.PdfboxPreview$.docspell$extract$pdfbox$PdfboxPreview$$getPageImage(PdfboxPreview.scala:46)
at docspell.extract.pdfbox.PdfboxPreview$$anon$1.$anonfun$previewImage$2(PdfboxPreview.scala:29)
at docspell.extract.pdfbox.PdfboxPreview$$anon$1$$Lambda$1970/0x000000010099f840.apply(Unknown Source)
at cats.effect.internals.IORunLoop$.cats$effect$internals$IORunLoop$$loop(IORunLoop.scala:104)
at cats.effect.internals.IORunLoop$.restartCancelable(IORunLoop.scala:51)
at cats.effect.internals.IOBracket$BracketStart.run(IOBracket.scala:100)
at cats.effect.internals.Trampoline.cats$effect$internals$Trampoline$$immediateLoop(Trampoline.scala:67)
[blaze-acceptor-0-0] [34mINFO [0;39m [36mo.h.b.c.ServerChannel[0;39m - Closing NIO1 channel /0:0:0:0:0:0:0:0:7878
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-0
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-1
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-2
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-3
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-4
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-5
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-6
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-7
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-8
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-acceptor-0-0
[shutdownHook1] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Unregister app joex1
[ioapp-compute-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown initiated...
[ioapp-compute-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown completed.
[ioapp-compute-0] [34mINFO [0;39m [36mo.h.c.PoolManager[0;39m - Shutting down connection pool: curAllocated=0 idleQueues.size=0 waitQueue.size=0 maxWaitQueueLimit=256 closed=false
Starting unoconv listener
[main] [34mINFO [0;39m [36md.joex.Main[0;39m - Using given config file: /opt/docspell.conf
[main] [34mINFO [0;39m [36md.joex.Main[0;39m -
***> ______ _ _
***> | _ \ | | |
***> | | | |___ ___ ___ _ __ ___| | |
***> | | | / _ \ / __/ __| '_ \ / _ \ | |
***> | |/ / (_) | (__\__ \ |_) | __/ | |
***> |___/ \___/ \___|___/ .__/ \___|_|_|
***> | |
***> |_| v0.20.0 (#4d3a25a8)
***> << JOEX >>
***> Id: joex1
***> Base-Url: http://192.168.178.4:7878
***> Database: jdbc:postgresql://192.168.178.6:5432/docspell
***> Fts: http://192.168.178.2:8983/solr/docspell
***> Config: /opt/docspell.conf
***>
[ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Running db migrations...
[ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Using migration locations: List(classpath:db/migration/postgresql)
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.d.b.DatabaseType[0;39m - Database: jdbc:postgresql://192.168.178.6:5432/docspell (PostgreSQL 11.7)
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.s.JdbcTableSchemaHistory[0;39m - Repair of failed migration in Schema History table "public"."flyway_schema_history" not necessary. No failed migration detected.
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbRepair[0;39m - Successfully repaired schema history table "public"."flyway_schema_history" (execution time 00:00.069s).
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbValidate[0;39m - Successfully validated 29 migrations (execution time 00:00.025s)
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Current version of schema "public": 1.20.4
[ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Schema "public" is up to date. No migration necessary.
[ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Clearing StanfordNLP cache after Duration(900000ms) idle time
[ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Creating nlp pipeline cache
[docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Starting...
[docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Start completed.
[ioapp-compute-5] [34mINFO [0;39m [36md.j.s.SchedulerImpl[0;39m - Starting scheduler
[ioapp-compute-6] [34mINFO [0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Starting periodic scheduler
[ioapp-compute-4] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Registering node joex1
[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (1 free)
[ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Going into main loop
[ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Looking for next periodic task
[ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - New permit acquired
[ioapp-compute-1] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Found periodic task 'Docspell house-keeping/Sun *-*-* 00:00:00'
[ioapp-compute-1] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Scheduling next notify for timer Sun *-*-* 00:00:00 -> Some(2021-03-07T00:00)
[ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Waiting for notify
[ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Next job found: Some(FUJ3Yznjz.../docspell-system/make-preview/Low)
[ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Creating context for job FUJ3Yznjz.../docspell-system/make-preview/Low to run JobTask(Ident(make-preview),docspell.joex.scheduler.Task$$anonfun$contramap$4@291b2e71,docspell.joex.scheduler.Task$$anonfun$contramap$4@2992d164)
[ioapp-compute-4] [34mINFO [0;39m [36mo.h.b.c.n.NIO1SocketServerGroup[0;39m - Service bound to address /0:0:0:0:0:0:0:0:7878
[ioapp-compute-4] [34mINFO [0;39m [36mo.h.s.b.BlazeServerBuilder[0;39m - http4s v0.21.19 on blaze v0.14.15 started at http://[::]:7878/
[ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Forking job FUJ3Yznjz.../docspell-system/make-preview/Low
[ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (0 free)
[ioapp-compute-2] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Starting task now
[ioapp-compute-6] [34mINFO [0;39m [36md.j.s.LogSink[0;39m - >>> 2021-03-05T14:04:53.638169Z Info FUJ3Yznjz.../docspell-system/make-preview/Low: Generating preview image for attachment Ident(7eo43raYQqy-ZzRwenWHSwM-fwZ8HTQ2Dqw-LJgUkvi2mPF)
java.lang.OutOfMemoryError: Java heap space
at java.desktop/java.awt.image.DataBufferByte.<init>(DataBufferByte.java:76)
at java.desktop/java.awt.image.Raster.createInterleavedRaster(Raster.java:266)
at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readInternal(JPEGImageReader.java:1228)
at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readRaster(JPEGImageReader.java:1541)
at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.readImageAsRasterAndReplaceColorProfile(JPEGImageReader.java:502)
at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.read(JPEGImageReader.java:395)
at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:91)
at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:80)
at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:179)
at org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:241)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.createInputStream(PDImageXObject.java:793)
at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.from8bit(SampledImageReader.java:517)
at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.getRGBImage(SampledImageReader.java:226)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:479)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:460)
at org.apache.pdfbox.rendering.PageDrawer.drawImage(PageDrawer.java:1059)
at org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:67)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:933)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:515)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:489)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:156)
at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:275)
at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:347)
at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:268)
at org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(PDFRenderer.java:240)
at docspell.extract.pdfbox.PdfboxPreview$.docspell$extract$pdfbox$PdfboxPreview$$getPageImage(PdfboxPreview.scala:46)
at docspell.extract.pdfbox.PdfboxPreview$$anon$1.$anonfun$previewImage$2(PdfboxPreview.scala:29)
at docspell.extract.pdfbox.PdfboxPreview$$anon$1$$Lambda$1968/0x000000010099ac40.apply(Unknown Source)
at cats.effect.internals.IORunLoop$.cats$effect$internals$IORunLoop$$loop(IORunLoop.scala:104)
at cats.effect.internals.IORunLoop$.restartCancelable(IORunLoop.scala:51)
at cats.effect.internals.IOBracket$BracketStart.run(IOBracket.scala:100)
at cats.effect.internals.Trampoline.cats$effect$internals$Trampoline$$immediateLoop(Trampoline.scala:67)
[blaze-acceptor-0-0] [34mINFO [0;39m [36mo.h.b.c.ServerChannel[0;39m - Closing NIO1 channel /0:0:0:0:0:0:0:0:7878
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-0
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-1
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-2
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-3
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-4
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-5
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-6
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-7
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-8
[shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-acceptor-0-0
[shutdownHook1] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Unregister app joex1
[ioapp-compute-3] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown initiated...
[ioapp-compute-3] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown completed.
[ioapp-compute-3] [34mINFO [0;39m [36mo.h.c.PoolManager[0;39m - Shutting down connection pool: curAllocated=0 idleQueues.size=0 waitQueue.size=0 maxWaitQueueLimit=256 closed=false

Link to comment

hmm, "java.lang.OutOfMemoryError: Java heap space" ist offensichtlich das Problem.....

Vielleicht waren 300 Belege zu viel? Sind die einzelnen Dateien groß?

Wieviele wurden erfolgreich importiert?

Kannst du die (noch nicht importierten) Belege aus dem Importverzeichnis entfernen und schauen ob joex dann nicht mehr beendet wird?

 

Es gibt auf Github ein issue (geschlossen) dem ich aber nicht entnehmen kann wie das Problem gelöst wurde: issue 284

In der Doku steht auch was, aber auch hier kann ich nicht herauslesen wie der Wert gesetzt wird: hier unter Memory usage

 

Du kannst probieren im Joex Container in der "advanced view" in den "extra parameters" einen Wert zu setzen.

In der Doku steht "When using mode=full, a heap setting of at least -Xmx1400M is recommended"

Setze in den "extra parameters" mal folgendes: -e JAVA_OPTS="-Xmx2500m" oder -e JAVA_OPTS="-Xmx2500m -Xms256m" wenn du auch ein Minimum festlegen willst.

Ansonsten könntest du noch ein neues Issue in Github eröffnen.

Link to comment

Auch das Anlegen einer Variable sollte möglich sein (Joex Container => Edit => Add another Path, Port, Variable, Label or Device => ...)

Wenn das funktioniert könnte ich in mein Template auch diese neue Variable definieren. Die wäre dann aber immer da und müsste eventuell von Hand angepasst werden...

(...)
    <Variable>
      <Value>-Xms256m -Xmx2500m</Value>
      <Name>JAVA_OPTS</Name>
      <Mode/>
    </Variable>
  </Environment>
(...)
  <Config Name="JAVA_OPTS" Target="JAVA_OPTS" Default="-Xms256m -Xmx2500m" Mode="" Description="Container Variable: JAVA_OPTS" Type="Variable" Display="always" Required="false" Mask="false">-Xms256m -Xmx2500m</Config>
(...)

 

Link to comment

Hab jetzt 5 files in den ordner kopiert.

docspell legt auch gleich los.

doch seit über 5 minuten versucht joex das pdf zu erfassen... hier mal der process:

 

2021-03-06T0:00:08: ============ Start processing 2017-01-02_Ausgabe_4685_MEDIAMARKT NÜRNBERG GMBH.pdf ============
2021-03-06T0:00:08: Not checking for duplicates
2021-03-06T0:00:08: Creating new item with 1 attachment(s)
2021-03-06T0:00:08: Creating item finished in 42 ms
2021-03-06T0:00:08: Not an archive: application/pdf
2021-03-06T0:00:08: Converting file Some(2017-01-02_Ausgabe_4685_MEDIAMARKT NÜRNBERG GMBH.pdf) (application/pdf) into a PDF
2021-03-06T0:00:08: Storing input to file /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/infile for running ocrmypdf
2021-03-06T0:00:08: Running external command: ocrmypdf -l deu --skip-text --deskew -j 1 /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/infile /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/out.pdf
2021-03-06T0:02:22: Command `ocrmypdf -l deu --skip-text --deskew -j 1 /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/infile /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/out.pdf` finished: 0
2021-03-06T0:02:22: ocrmypdf stdout:
2021-03-06T0:02:22: ocrmypdf stderr: 1 /usr/lib/python3.8/site-packages/PIL/Image.py:2832: DecompressionBombWarning: Image size (151037461 pixels) exceeds limit of 128000000 pixels, could be decompression bomb DOS attack. warnings.warn( Postprocessing... /usr/lib/python3.8/site-packages/PIL/Image.py:2832: DecompressionBombWarning: Image size (151037461 pixels) exceeds limit of 128000000 pixels, could be decompression bomb DOS attack. warnings.warn( Optimize ratio: 1.00 savings: -0.0% Image optimization did not improve the file - discarded Output file is a PDF/A-2B (as expected) The output file size is 5.92× larger than the input file. Possible reasons for this include: The argument --deskew was issued, causing transcoding. PDF/A conversion was enabled. (Try `--output-type pdf`.)
2021-03-06T0:02:22: Conversion to pdf successful. Saving file.
2021-03-06T0:02:22: Closing process: `ocrmypdf -l deu --skip-text --deskew -j 1 /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/infile /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/out.pdf`
2021-03-06T0:02:22: Starting text extraction for 1 files
2021-03-06T0:02:22: Extracting text for attachment 2017-01-02_Ausgabe_4685_MEDIAMARKT NÜRNBERG GMBH.converted
2021-03-06T0:02:22: Trying to strip text from pdf using pdfbox.
2021-03-06T0:02:22: Stripped text from PDF is small (0). Trying with OCR.
2021-03-06T0:02:22: Running external command: gs -dLastPage=10 -dNOPAUSE -dBATCH -dSAFER -sDEVICE=tiffscaled8 -sOutputFile=%d.tif -
2021-03-06T0:02:42: Command `gs -dLastPage=10 -dNOPAUSE -dBATCH -dSAFER -sDEVICE=tiffscaled8 -sOutputFile=%d.tif -` finished: 0
2021-03-06T0:02:42: Running external command: unpaper /tmp/docspell-extraction/extractpdf8837783680149879878/1.tif /tmp/docspell-extraction/extractpdf8837783680149879878/u-1.tif
2021-03-06T0:02:42: Command `unpaper /tmp/docspell-extraction/extractpdf8837783680149879878/1.tif /tmp/docspell-extraction/extractpdf8837783680149879878/u-1.tif` finished: 1
2021-03-06T0:02:42: Closing process: `unpaper /tmp/docspell-extraction/extractpdf8837783680149879878/1.tif /tmp/docspell-extraction/extractpdf8837783680149879878/u-1.tif`
2021-03-06T0:02:42: Running external command: tesseract 1.tif stdout -l deu
2021-03-06T0:03:48: Command `tesseract 1.tif stdout -l deu` finished: 0
2021-03-06T0:03:48: Closing process: `tesseract 1.tif stdout -l deu`
2021-03-06T0:03:48: Closing process: `gs -dLastPage=10 -dNOPAUSE -dBATCH -dSAFER -sDEVICE=tiffscaled8 -sOutputFile=%d.tif -`
2021-03-06T0:03:48: Using stripped text (not OCR), as it is longer (0 > 0)
2021-03-06T0:03:48: Extracting text for attachment 2017-01-02_Ausgabe_4685_MEDIAMARKT NÜRNBERG GMBH.converted finished in 86194 ms
2021-03-06T0:03:48: Storing extracted texts …
2021-03-06T0:03:48: Extracted text stored.
2021-03-06T0:03:48: Updating SOLR index
2021-03-06T0:03:48: Text extraction finished in 86248 ms.
2021-03-06T0:03:48: Creating preview images for 1 files…

Link to comment
8 hours ago, Hoddl said:

-e JAVA_OPTS="-Xmx2500m -Xms256m"

hat geholfen...

Das ist schon mal gut! Mal schauen ob ich das in das Template einbaue.

 

Das joex log sieht ansich ganz gut aus.

Auf fallend ist:

7 hours ago, Hoddl said:

DecompressionBombWarning: Image size (151037461 pixels) exceeds limit of 128000000 pixels, could be decompression bomb DOS attack. warnings.warn( Optimize ratio: 1.00 savings: -0.0% Image optimization did not improve the file - discarded Output file is a PDF/A-2B (as expected) The output file size is 5.92× larger than the input file.

...aber der Output zu einer PDF/A-2B ist erfolgreich... Wenn auch Sie fast 6 mal so groß ist wie das Original...

 

Auch erfolgreich: die Text Extraktion und die Aktualisierung der Volltextsuche.

8 hours ago, Hoddl said:

Extracted text stored.
2021-03-06T0:03:48: Updating SOLR index
2021-03-06T0:03:48: Text extraction finished in 86248 ms.
2021-03-06T0:03:48: Creating preview images for 1 files…

Allerdings scheint er bei der Erstellung des Vorschaubildes stehen zu bleiben...

 

Ist das Dokument nachher evtl. ohne  Vorschaubild in der webui zu finden?

Link to comment

wieder das gleiche joex steht dann irgendwann... ich werde die Variable jetzt mal einbauen... mal sehen ob es was bringt...

 

(...) <Variable> <Value>-Xms256m -Xmx2500m</Value> <Name>JAVA_OPTS</Name> <Mode/> </Variable> </Environment> (...) <Config Name="JAVA_OPTS" Target="JAVA_OPTS" Default="-Xms256m -Xmx2500m" Mode="" Description="Container Variable: JAVA_OPTS" Type="Variable" Display="always" Required="false" Mask="false">-Xms256m -Xmx2500m</Config> (...)

Link to comment

Name ist der von dir vergebene Name

Schlüssel ist der Variablen Namen

Wert ist der Variablenwert

 

Brauchst du aber nicht, denn ob über extra parameters oder so bleibt sich gleich.

 

Geh bitte in die Prozessliste/Warteschlange und lösche die nicht erfolgreich beendeten tasks (Kreuzchen oben rechts)

Link to comment

ok hab ich gemacht.

 

Nun hab ich mal in den Ordner unter docs nachgeschaut hier hat docspell die Belege einfach drinnen gelassen und evtl. immer wieder neu angefangen?!

Diesen Ordner werde ich erst mal nicht mehr nehmen um docspell mit Daten zu füttern...

Ich komme deswegen da drauf da in der Warteschlange immer wieder Belege von 18 Uhr aufgetaucht sind.

 

Jetzt hab ich mal ganz normal 10 Stück hochgeladen mal schauen ob es abgearbeitet wird.

 

Ich melde mich wieder 🙂

 

 

Link to comment

Joex hat mal wieder gestoppt 😞 

 

irgendwas mag er bei mir nicht 😞

 

ich lade mal nur die roten und die gelben fehler aus dem log hoch sonst wird es wieder meterlang 🙂

 

 

Error: /BBox has zero width or height, which is not allowed.

 

 

[docspell-joex-blocking-9] [31mWARN [0;39m [36md.b.ops.OItem[0;39m - Error updating full-text index: unexpected HTTP status: 500 Server Error

 

 

[ioapp-compute-3] [39mDEBUG[0;39m [36md.s.q.QItem[0;39m - FindByChecksum: Fragment("SELECT DISTINCT i.itemid, i.cid, i.name, i.itemdate, i.source, i.incoming, i.state, i.corrorg, i.corrperson, i.concperson, i.concequipment, i.inreplyto, i.duedate, i.created, i.updated, i.notes, i.folder_id FROM item i INNER JOIN attachment a ON a.itemid = i.itemid INNER JOIN attachment_source s ON s.id = a.attachid INNER JOIN filemeta m1 ON m1.id = a.filemetaid INNER JOIN filemeta m2 ON m2.id = s.file_id LEFT JOIN attachment_archive r ON r.id = a.attachid LEFT JOIN filemeta m3 ON m3.id = r.file_id WHERE (i.cid = ? AND (m1.checksum = ? OR m2.checksum = ? OR m3.checksum = ? ) AND (m1.id is null OR NOT m1.id IN (? )) AND (m2.id is null OR NOT m2.id IN (? )) AND (m3.id is null OR NOT m3.id IN (? )))")

 

 

 

Link to comment
5 hours ago, Hoddl said:

Nun hab ich mal in den Ordner unter docs nachgeschaut hier hat docspell die Belege einfach drinnen gelassen

Standardmäßig verbleiben dort die Dokumente. Es ist wie ein "Fileserver" zu sehen. Dokumente die dort hin gelegt werden können auch umbenannt und in einer Ordnerstruktur organisiert werden. Docspell erkennt bereits hochgeladene Dokumente und wird sie - auch nach "Reorganisierung - nicht nocheinmal importieren.

Siehe https://docspell.org/docs/feed/#scanners-watch-directories

 

Es gibt jedoch auch die Möglichkeit dieses Consumedir Verzeichnis ("docs") aufzuräumen. Dokumente werden dann entweder "archiviert" oder gelöscht, je nach Einstellung. Siehe https://docspell.org/docs/tools/consumedir-cleaner/#introduction

 

5 hours ago, Hoddl said:

Error: /BBox has zero width or height, which is not allowed.

Das ist ein Fehler von Ghostscript bei der Konvertierung von Bildern...(?)

 

Bitte ein issue auf Github eröffnen, da kann ich mir auch keinen Reim draus machen.

 Vorher vielleicht noch mal alle Container in der richtigen Reihenfolge (postfix > solr > joex > restserver > consumedir) mit einigen Sekunden Abstand nach erfolgtem Start (schau ins Log des Containers ob der Start erfolgreich und beendet ist) neu starten.

 

5 hours ago, Hoddl said:

[docspell-joex-blocking-9] [31mWARN [0;39m [36md.b.ops.OItem[0;39m - Error updating full-text index: unexpected HTTP status: 500 Server Error

Das ist ein Fehler bei der Erstellung des Volltextindexes. Schaut so aus als würde Solr nicht (ordentlich) laufen...?

Du kannst mal auf http://deine.solr.ip.adresse/solr/#/ gehen und schauen ob du da was findest, allerdings kenne ich mich da auch nicht aus....

Hast du Solr zwischenzeitlich aktualisiert? Ich habe nämlich eine Aktualisierung mitgemacht, da hatte der Entwickler (bitnami) etwas grundlegendes geändert, da musste ich den Index neu aufbauen. Schau mal in Docspell ob deine Volltextsuche noch funktioniert.

 

Edited by vakilando
added missing infos
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.