Hoddl Posted March 5, 2021 Author Share Posted March 5, 2021 wenn ich es auf close stelle komme ich nicht mehr in die WebUI also die ip wird nicht mehr gefunden... ich hab es nun auf Invite gesetzt und alles ist gut... Einen Ordner im Ordner z.b. Hauptorder Unterordner Unterordner geht wohl nicht??? Quote Link to comment
vakilando Posted March 5, 2021 Share Posted March 5, 2021 6 hours ago, Hoddl said: Einen Ordner im Ordner z.b. Hauptorder Unterordner geht wohl nicht??? Nein, leider nicht. Es gibt aber ein issue bzw Feature Request auf Github "Nested folders & Tags" hierzu, dem könntest du dich "anschließen". Habe mich dem auch angeschlossen, da ich finde es wäre eine zusätzliche und praktische Sortier- und Suchmöglichkeit. Quote Link to comment
Hoddl Posted March 5, 2021 Author Share Posted March 5, 2021 und dann das ganze in deutsch übersetzen 🙂 Quote Link to comment
Hoddl Posted March 5, 2021 Author Share Posted March 5, 2021 ich hab mir das Skript zur "Überwachung eines Ordners" angeschaut... nur ist es mir nicht klar wo und wie das skript laufen muss/soll... Quote Link to comment
Hoddl Posted March 5, 2021 Author Share Posted March 5, 2021 Hab mir jetzt erst mal mit den Sources geholfen um die Belege auf einmal gleich in die richtigen Ordner einsortieren zu lassen 🙂 Quote Link to comment
Hoddl Posted March 5, 2021 Author Share Posted March 5, 2021 Hab mal fasst 300 Belege hochgeladen... jetzt stoppt docspell-joex immer... 😞 Quote Link to comment
vakilando Posted March 5, 2021 Share Posted March 5, 2021 1 hour ago, Hoddl said: ich hab mir das Skript zur "Überwachung eines Ordners" angeschaut... nur ist es mir nicht klar wo und wie das skript laufen muss/soll... Wenn ich das richtig sehe, ist das von dir genannte Script der Consumedir Container. Wenn du das Verzeichnis "/mnt/appdata/docspell/docs" angelegt hast musst du noch einen Unterordner erstellen, der den Namen deines "Collective" trägt. Dort legst du die Dokumente ab und die werden dann automatisch importiert. 30 minutes ago, Hoddl said: jetzt stoppt docspell-joex immer was sagen die Joex Logs? Quote Link to comment
Hoddl Posted March 5, 2021 Author Share Posted March 5, 2021 hier das Log... ist etwas groß 🙂 FehlerWarnungSystemArrayAnmelden [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Next job found: Some(FUJ3Yznjz.../docspell-system/make-preview/Low) [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Creating context for job FUJ3Yznjz.../docspell-system/make-preview/Low to run JobTask(Ident(make-preview),docspell.joex.scheduler.Task$$anonfun$contramap$4@789d931d,docspell.joex.scheduler.Task$$anonfun$contramap$4@35599e84) [ioapp-compute-1] [34mINFO [0;39m [36mo.h.b.c.n.NIO1SocketServerGroup[0;39m - Service bound to address /0:0:0:0:0:0:0:0:7878 [ioapp-compute-1] [34mINFO [0;39m [36mo.h.s.b.BlazeServerBuilder[0;39m - http4s v0.21.19 on blaze v0.14.15 started at http://[::]:7878/ [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Forking job FUJ3Yznjz.../docspell-system/make-preview/Low [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (0 free) [ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Starting task now [ioapp-compute-2] [34mINFO [0;39m [36md.j.s.LogSink[0;39m - >>> 2021-03-05T14:01:12.920345Z Info FUJ3Yznjz.../docspell-system/make-preview/Low: Generating preview image for attachment Ident(7eo43raYQqy-ZzRwenWHSwM-fwZ8HTQ2Dqw-LJgUkvi2mPF) java.lang.OutOfMemoryError: Java heap space at java.desktop/java.awt.image.DataBufferByte.<init>(DataBufferByte.java:76) at java.desktop/java.awt.image.Raster.createInterleavedRaster(Raster.java:266) at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readInternal(JPEGImageReader.java:1228) at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readRaster(JPEGImageReader.java:1541) at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.readImageAsRasterAndReplaceColorProfile(JPEGImageReader.java:502) at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.read(JPEGImageReader.java:395) at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:91) at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:80) at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:179) at org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:241) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.createInputStream(PDImageXObject.java:793) at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.from8bit(SampledImageReader.java:517) at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.getRGBImage(SampledImageReader.java:226) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:479) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:460) at org.apache.pdfbox.rendering.PageDrawer.drawImage(PageDrawer.java:1059) at org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:67) at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:933) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:515) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:489) at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:156) at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:275) at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:347) at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:268) at org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(PDFRenderer.java:240) at docspell.extract.pdfbox.PdfboxPreview$.docspell$extract$pdfbox$PdfboxPreview$$getPageImage(PdfboxPreview.scala:46) at docspell.extract.pdfbox.PdfboxPreview$$anon$1.$anonfun$previewImage$2(PdfboxPreview.scala:29) at docspell.extract.pdfbox.PdfboxPreview$$anon$1$$Lambda$1975/0x000000010098a040.apply(Unknown Source) at cats.effect.internals.IORunLoop$.cats$effect$internals$IORunLoop$$loop(IORunLoop.scala:104) at cats.effect.internals.IORunLoop$.restartCancelable(IORunLoop.scala:51) at cats.effect.internals.IOBracket$BracketStart.run(IOBracket.scala:100) at cats.effect.internals.Trampoline.cats$effect$internals$Trampoline$$immediateLoop(Trampoline.scala:67) [blaze-acceptor-0-0] [34mINFO [0;39m [36mo.h.b.c.ServerChannel[0;39m - Closing NIO1 channel /0:0:0:0:0:0:0:0:7878 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-0 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-1 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-2 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-3 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-4 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-5 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-6 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-7 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-8 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-acceptor-0-0 [shutdownHook1] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Unregister app joex1 [ioapp-compute-1] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown initiated... [ioapp-compute-1] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown completed. [ioapp-compute-1] [34mINFO [0;39m [36mo.h.c.PoolManager[0;39m - Shutting down connection pool: curAllocated=0 idleQueues.size=0 waitQueue.size=0 maxWaitQueueLimit=256 closed=false Starting unoconv listener [main] [34mINFO [0;39m [36md.joex.Main[0;39m - Using given config file: /opt/docspell.conf [main] [34mINFO [0;39m [36md.joex.Main[0;39m - ***> ______ _ _ ***> | _ \ | | | ***> | | | |___ ___ ___ _ __ ___| | | ***> | | | / _ \ / __/ __| '_ \ / _ \ | | ***> | |/ / (_) | (__\__ \ |_) | __/ | | ***> |___/ \___/ \___|___/ .__/ \___|_|_| ***> | | ***> |_| v0.20.0 (#4d3a25a8) ***> << JOEX >> ***> Id: joex1 ***> Base-Url: http://192.168.178.4:7878 ***> Database: jdbc:postgresql://192.168.178.6:5432/docspell ***> Fts: http://192.168.178.2:8983/solr/docspell ***> Config: /opt/docspell.conf ***> [ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Running db migrations... [ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Using migration locations: List(classpath:db/migration/postgresql) [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.d.b.DatabaseType[0;39m - Database: jdbc:postgresql://192.168.178.6:5432/docspell (PostgreSQL 11.7) [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.s.JdbcTableSchemaHistory[0;39m - Repair of failed migration in Schema History table "public"."flyway_schema_history" not necessary. No failed migration detected. [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbRepair[0;39m - Successfully repaired schema history table "public"."flyway_schema_history" (execution time 00:00.067s). [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbValidate[0;39m - Successfully validated 29 migrations (execution time 00:00.024s) [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Current version of schema "public": 1.20.4 [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Schema "public" is up to date. No migration necessary. [ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Clearing StanfordNLP cache after Duration(900000ms) idle time [ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Creating nlp pipeline cache [docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Starting... [docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Start completed. [ioapp-compute-5] [34mINFO [0;39m [36md.j.s.SchedulerImpl[0;39m - Starting scheduler [ioapp-compute-6] [34mINFO [0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Starting periodic scheduler [ioapp-compute-4] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Registering node joex1 [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (1 free) [ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Going into main loop [ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Looking for next periodic task [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - New permit acquired [ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Found periodic task 'Docspell house-keeping/Sun *-*-* 00:00:00' [ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Scheduling next notify for timer Sun *-*-* 00:00:00 -> Some(2021-03-07T00:00) [ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Waiting for notify [ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Next job found: Some(FUJ3Yznjz.../docspell-system/make-preview/Low) [ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Creating context for job FUJ3Yznjz.../docspell-system/make-preview/Low to run JobTask(Ident(make-preview),docspell.joex.scheduler.Task$$anonfun$contramap$4@40fb8022,docspell.joex.scheduler.Task$$anonfun$contramap$4@6f62ed31) [ioapp-compute-4] [34mINFO [0;39m [36mo.h.b.c.n.NIO1SocketServerGroup[0;39m - Service bound to address /0:0:0:0:0:0:0:0:7878 [ioapp-compute-4] [34mINFO [0;39m [36mo.h.s.b.BlazeServerBuilder[0;39m - http4s v0.21.19 on blaze v0.14.15 started at http://[::]:7878/ [ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Forking job FUJ3Yznjz.../docspell-system/make-preview/Low [ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (0 free) [ioapp-compute-2] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Starting task now [ioapp-compute-3] [34mINFO [0;39m [36md.j.s.LogSink[0;39m - >>> 2021-03-05T14:01:34.611363Z Info FUJ3Yznjz.../docspell-system/make-preview/Low: Generating preview image for attachment Ident(7eo43raYQqy-ZzRwenWHSwM-fwZ8HTQ2Dqw-LJgUkvi2mPF) java.lang.OutOfMemoryError: Java heap space at java.desktop/java.awt.image.DataBufferByte.<init>(DataBufferByte.java:76) at java.desktop/java.awt.image.Raster.createInterleavedRaster(Raster.java:266) at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readInternal(JPEGImageReader.java:1228) at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readRaster(JPEGImageReader.java:1541) at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.readImageAsRasterAndReplaceColorProfile(JPEGImageReader.java:502) at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.read(JPEGImageReader.java:395) at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:91) at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:80) at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:179) at org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:241) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.createInputStream(PDImageXObject.java:793) at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.from8bit(SampledImageReader.java:517) at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.getRGBImage(SampledImageReader.java:226) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:479) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:460) at org.apache.pdfbox.rendering.PageDrawer.drawImage(PageDrawer.java:1059) at org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:67) at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:933) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:515) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:489) at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:156) at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:275) at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:347) at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:268) at org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(PDFRenderer.java:240) at docspell.extract.pdfbox.PdfboxPreview$.docspell$extract$pdfbox$PdfboxPreview$$getPageImage(PdfboxPreview.scala:46) at docspell.extract.pdfbox.PdfboxPreview$$anon$1.$anonfun$previewImage$2(PdfboxPreview.scala:29) at docspell.extract.pdfbox.PdfboxPreview$$anon$1$$Lambda$1970/0x000000010099f840.apply(Unknown Source) at cats.effect.internals.IORunLoop$.cats$effect$internals$IORunLoop$$loop(IORunLoop.scala:104) at cats.effect.internals.IORunLoop$.restartCancelable(IORunLoop.scala:51) at cats.effect.internals.IOBracket$BracketStart.run(IOBracket.scala:100) at cats.effect.internals.Trampoline.cats$effect$internals$Trampoline$$immediateLoop(Trampoline.scala:67) [blaze-acceptor-0-0] [34mINFO [0;39m [36mo.h.b.c.ServerChannel[0;39m - Closing NIO1 channel /0:0:0:0:0:0:0:0:7878 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-0 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-1 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-2 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-3 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-4 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-5 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-6 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-7 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-8 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-acceptor-0-0 [shutdownHook1] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Unregister app joex1 [ioapp-compute-7] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown initiated... [ioapp-compute-7] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown completed. [ioapp-compute-7] [34mINFO [0;39m [36mo.h.c.PoolManager[0;39m - Shutting down connection pool: curAllocated=0 idleQueues.size=0 waitQueue.size=0 maxWaitQueueLimit=256 closed=false Starting unoconv listener [main] [34mINFO [0;39m [36md.joex.Main[0;39m - Using given config file: /opt/docspell.conf [main] [34mINFO [0;39m [36md.joex.Main[0;39m - ***> ______ _ _ ***> | _ \ | | | ***> | | | |___ ___ ___ _ __ ___| | | ***> | | | / _ \ / __/ __| '_ \ / _ \ | | ***> | |/ / (_) | (__\__ \ |_) | __/ | | ***> |___/ \___/ \___|___/ .__/ \___|_|_| ***> | | ***> |_| v0.20.0 (#4d3a25a8) ***> << JOEX >> ***> Id: joex1 ***> Base-Url: http://192.168.178.4:7878 ***> Database: jdbc:postgresql://192.168.178.6:5432/docspell ***> Fts: http://192.168.178.2:8983/solr/docspell ***> Config: /opt/docspell.conf ***> [ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Running db migrations... [ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Using migration locations: List(classpath:db/migration/postgresql) [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.d.b.DatabaseType[0;39m - Database: jdbc:postgresql://192.168.178.6:5432/docspell (PostgreSQL 11.7) [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.s.JdbcTableSchemaHistory[0;39m - Repair of failed migration in Schema History table "public"."flyway_schema_history" not necessary. No failed migration detected. [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbRepair[0;39m - Successfully repaired schema history table "public"."flyway_schema_history" (execution time 00:00.070s). [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbValidate[0;39m - Successfully validated 29 migrations (execution time 00:00.024s) [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Current version of schema "public": 1.20.4 [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Schema "public" is up to date. No migration necessary. [ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Clearing StanfordNLP cache after Duration(900000ms) idle time [ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Creating nlp pipeline cache [docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Starting... [docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Start completed. [ioapp-compute-5] [34mINFO [0;39m [36md.j.s.SchedulerImpl[0;39m - Starting scheduler [ioapp-compute-6] [34mINFO [0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Starting periodic scheduler [ioapp-compute-4] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Registering node joex1 [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (1 free) [ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Going into main loop [ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Looking for next periodic task [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - New permit acquired [ioapp-compute-1] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Found periodic task 'Docspell house-keeping/Sun *-*-* 00:00:00' [ioapp-compute-1] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Scheduling next notify for timer Sun *-*-* 00:00:00 -> Some(2021-03-07T00:00) [ioapp-compute-2] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Waiting for notify [ioapp-compute-0] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Next job found: Some(FUJ3Yznjz.../docspell-system/make-preview/Low) [ioapp-compute-0] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Creating context for job FUJ3Yznjz.../docspell-system/make-preview/Low to run JobTask(Ident(make-preview),docspell.joex.scheduler.Task$$anonfun$contramap$4@2d9cbbdd,docspell.joex.scheduler.Task$$anonfun$contramap$4@36799cf5) [ioapp-compute-4] [34mINFO [0;39m [36mo.h.b.c.n.NIO1SocketServerGroup[0;39m - Service bound to address /0:0:0:0:0:0:0:0:7878 [ioapp-compute-4] [34mINFO [0;39m [36mo.h.s.b.BlazeServerBuilder[0;39m - http4s v0.21.19 on blaze v0.14.15 started at http://[::]:7878/ [ioapp-compute-0] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Forking job FUJ3Yznjz.../docspell-system/make-preview/Low [ioapp-compute-0] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (0 free) [ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Starting task now [ioapp-compute-5] [34mINFO [0;39m [36md.j.s.LogSink[0;39m - >>> 2021-03-05T14:02:44.438593Z Info FUJ3Yznjz.../docspell-system/make-preview/Low: Generating preview image for attachment Ident(7eo43raYQqy-ZzRwenWHSwM-fwZ8HTQ2Dqw-LJgUkvi2mPF) java.lang.OutOfMemoryError: Java heap space at java.desktop/java.awt.image.DataBufferByte.<init>(DataBufferByte.java:76) at java.desktop/java.awt.image.Raster.createInterleavedRaster(Raster.java:266) at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readInternal(JPEGImageReader.java:1228) at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readRaster(JPEGImageReader.java:1541) at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.readImageAsRasterAndReplaceColorProfile(JPEGImageReader.java:502) at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.read(JPEGImageReader.java:395) at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:91) at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:80) at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:179) at org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:241) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.createInputStream(PDImageXObject.java:793) at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.from8bit(SampledImageReader.java:517) at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.getRGBImage(SampledImageReader.java:226) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:479) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:460) at org.apache.pdfbox.rendering.PageDrawer.drawImage(PageDrawer.java:1059) at org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:67) at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:933) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:515) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:489) at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:156) at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:275) at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:347) at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:268) at org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(PDFRenderer.java:240) at docspell.extract.pdfbox.PdfboxPreview$.docspell$extract$pdfbox$PdfboxPreview$$getPageImage(PdfboxPreview.scala:46) at docspell.extract.pdfbox.PdfboxPreview$$anon$1.$anonfun$previewImage$2(PdfboxPreview.scala:29) at docspell.extract.pdfbox.PdfboxPreview$$anon$1$$Lambda$1970/0x000000010099f840.apply(Unknown Source) at cats.effect.internals.IORunLoop$.cats$effect$internals$IORunLoop$$loop(IORunLoop.scala:104) at cats.effect.internals.IORunLoop$.restartCancelable(IORunLoop.scala:51) at cats.effect.internals.IOBracket$BracketStart.run(IOBracket.scala:100) at cats.effect.internals.Trampoline.cats$effect$internals$Trampoline$$immediateLoop(Trampoline.scala:67) [blaze-acceptor-0-0] [34mINFO [0;39m [36mo.h.b.c.ServerChannel[0;39m - Closing NIO1 channel /0:0:0:0:0:0:0:0:7878 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-0 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-1 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-2 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-3 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-4 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-5 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-6 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-7 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-8 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-acceptor-0-0 [shutdownHook1] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Unregister app joex1 [ioapp-compute-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown initiated... [ioapp-compute-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown completed. [ioapp-compute-0] [34mINFO [0;39m [36mo.h.c.PoolManager[0;39m - Shutting down connection pool: curAllocated=0 idleQueues.size=0 waitQueue.size=0 maxWaitQueueLimit=256 closed=false Starting unoconv listener [main] [34mINFO [0;39m [36md.joex.Main[0;39m - Using given config file: /opt/docspell.conf [main] [34mINFO [0;39m [36md.joex.Main[0;39m - ***> ______ _ _ ***> | _ \ | | | ***> | | | |___ ___ ___ _ __ ___| | | ***> | | | / _ \ / __/ __| '_ \ / _ \ | | ***> | |/ / (_) | (__\__ \ |_) | __/ | | ***> |___/ \___/ \___|___/ .__/ \___|_|_| ***> | | ***> |_| v0.20.0 (#4d3a25a8) ***> << JOEX >> ***> Id: joex1 ***> Base-Url: http://192.168.178.4:7878 ***> Database: jdbc:postgresql://192.168.178.6:5432/docspell ***> Fts: http://192.168.178.2:8983/solr/docspell ***> Config: /opt/docspell.conf ***> [ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Running db migrations... [ioapp-compute-0] [34mINFO [0;39m [36md.s.m.FlywayMigrate[0;39m - Using migration locations: List(classpath:db/migration/postgresql) [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.d.b.DatabaseType[0;39m - Database: jdbc:postgresql://192.168.178.6:5432/docspell (PostgreSQL 11.7) [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.s.JdbcTableSchemaHistory[0;39m - Repair of failed migration in Schema History table "public"."flyway_schema_history" not necessary. No failed migration detected. [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbRepair[0;39m - Successfully repaired schema history table "public"."flyway_schema_history" (execution time 00:00.069s). [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.l.VersionPrinter[0;39m - Flyway Community Edition 7.5.3 by Redgate [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbValidate[0;39m - Successfully validated 29 migrations (execution time 00:00.025s) [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Current version of schema "public": 1.20.4 [ioapp-compute-0] [34mINFO [0;39m [36mo.f.c.i.c.DbMigrate[0;39m - Schema "public" is up to date. No migration necessary. [ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Clearing StanfordNLP cache after Duration(900000ms) idle time [ioapp-compute-0] [34mINFO [0;39m [36md.a.n.PipelineCache[0;39m - Creating nlp pipeline cache [docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Starting... [docspell-joex-dbconnect-0] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Start completed. [ioapp-compute-5] [34mINFO [0;39m [36md.j.s.SchedulerImpl[0;39m - Starting scheduler [ioapp-compute-6] [34mINFO [0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Starting periodic scheduler [ioapp-compute-4] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Registering node joex1 [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (1 free) [ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Going into main loop [ioapp-compute-6] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Looking for next periodic task [ioapp-compute-5] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - New permit acquired [ioapp-compute-1] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Found periodic task 'Docspell house-keeping/Sun *-*-* 00:00:00' [ioapp-compute-1] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Scheduling next notify for timer Sun *-*-* 00:00:00 -> Some(2021-03-07T00:00) [ioapp-compute-7] [39mDEBUG[0;39m [36md.j.s.PeriodicSchedulerImpl[0;39m - Waiting for notify [ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Next job found: Some(FUJ3Yznjz.../docspell-system/make-preview/Low) [ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Creating context for job FUJ3Yznjz.../docspell-system/make-preview/Low to run JobTask(Ident(make-preview),docspell.joex.scheduler.Task$$anonfun$contramap$4@291b2e71,docspell.joex.scheduler.Task$$anonfun$contramap$4@2992d164) [ioapp-compute-4] [34mINFO [0;39m [36mo.h.b.c.n.NIO1SocketServerGroup[0;39m - Service bound to address /0:0:0:0:0:0:0:0:7878 [ioapp-compute-4] [34mINFO [0;39m [36mo.h.s.b.BlazeServerBuilder[0;39m - http4s v0.21.19 on blaze v0.14.15 started at http://[::]:7878/ [ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Forking job FUJ3Yznjz.../docspell-system/make-preview/Low [ioapp-compute-3] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Try to acquire permit (0 free) [ioapp-compute-2] [39mDEBUG[0;39m [36md.j.s.SchedulerImpl[0;39m - Starting task now [ioapp-compute-6] [34mINFO [0;39m [36md.j.s.LogSink[0;39m - >>> 2021-03-05T14:04:53.638169Z Info FUJ3Yznjz.../docspell-system/make-preview/Low: Generating preview image for attachment Ident(7eo43raYQqy-ZzRwenWHSwM-fwZ8HTQ2Dqw-LJgUkvi2mPF) java.lang.OutOfMemoryError: Java heap space at java.desktop/java.awt.image.DataBufferByte.<init>(DataBufferByte.java:76) at java.desktop/java.awt.image.Raster.createInterleavedRaster(Raster.java:266) at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readInternal(JPEGImageReader.java:1228) at java.desktop/com.sun.imageio.plugins.jpeg.JPEGImageReader.readRaster(JPEGImageReader.java:1541) at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.readImageAsRasterAndReplaceColorProfile(JPEGImageReader.java:502) at com.twelvemonkeys.imageio.plugins.jpeg.JPEGImageReader.read(JPEGImageReader.java:395) at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:91) at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:80) at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:179) at org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:241) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.createInputStream(PDImageXObject.java:793) at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.from8bit(SampledImageReader.java:517) at org.apache.pdfbox.pdmodel.graphics.image.SampledImageReader.getRGBImage(SampledImageReader.java:226) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:479) at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.getImage(PDImageXObject.java:460) at org.apache.pdfbox.rendering.PageDrawer.drawImage(PageDrawer.java:1059) at org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:67) at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:933) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:515) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:489) at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:156) at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:275) at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:347) at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:268) at org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(PDFRenderer.java:240) at docspell.extract.pdfbox.PdfboxPreview$.docspell$extract$pdfbox$PdfboxPreview$$getPageImage(PdfboxPreview.scala:46) at docspell.extract.pdfbox.PdfboxPreview$$anon$1.$anonfun$previewImage$2(PdfboxPreview.scala:29) at docspell.extract.pdfbox.PdfboxPreview$$anon$1$$Lambda$1968/0x000000010099ac40.apply(Unknown Source) at cats.effect.internals.IORunLoop$.cats$effect$internals$IORunLoop$$loop(IORunLoop.scala:104) at cats.effect.internals.IORunLoop$.restartCancelable(IORunLoop.scala:51) at cats.effect.internals.IOBracket$BracketStart.run(IOBracket.scala:100) at cats.effect.internals.Trampoline.cats$effect$internals$Trampoline$$immediateLoop(Trampoline.scala:67) [blaze-acceptor-0-0] [34mINFO [0;39m [36mo.h.b.c.ServerChannel[0;39m - Closing NIO1 channel /0:0:0:0:0:0:0:0:7878 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-0 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-1 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-2 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-3 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-4 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-5 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-6 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-7 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-selector-8 [shutdownHook1] [34mINFO [0;39m [36mo.h.b.c.n.SelectorLoop[0;39m - Shutting down SelectorLoop blaze-acceptor-0-0 [shutdownHook1] [34mINFO [0;39m [36md.b.ops.ONode[0;39m - Unregister app joex1 [ioapp-compute-3] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown initiated... [ioapp-compute-3] [34mINFO [0;39m [36mc.z.h.HikariDataSource[0;39m - HikariPool-1 - Shutdown completed. [ioapp-compute-3] [34mINFO [0;39m [36mo.h.c.PoolManager[0;39m - Shutting down connection pool: curAllocated=0 idleQueues.size=0 waitQueue.size=0 maxWaitQueueLimit=256 closed=false Quote Link to comment
vakilando Posted March 5, 2021 Share Posted March 5, 2021 hmm, "java.lang.OutOfMemoryError: Java heap space" ist offensichtlich das Problem..... Vielleicht waren 300 Belege zu viel? Sind die einzelnen Dateien groß? Wieviele wurden erfolgreich importiert? Kannst du die (noch nicht importierten) Belege aus dem Importverzeichnis entfernen und schauen ob joex dann nicht mehr beendet wird? Es gibt auf Github ein issue (geschlossen) dem ich aber nicht entnehmen kann wie das Problem gelöst wurde: issue 284 In der Doku steht auch was, aber auch hier kann ich nicht herauslesen wie der Wert gesetzt wird: hier unter Memory usage Du kannst probieren im Joex Container in der "advanced view" in den "extra parameters" einen Wert zu setzen. In der Doku steht "When using mode=full, a heap setting of at least -Xmx1400M is recommended" Setze in den "extra parameters" mal folgendes: -e JAVA_OPTS="-Xmx2500m" oder -e JAVA_OPTS="-Xmx2500m -Xms256m" wenn du auch ein Minimum festlegen willst. Ansonsten könntest du noch ein neues Issue in Github eröffnen. Quote Link to comment
vakilando Posted March 5, 2021 Share Posted March 5, 2021 Auch das Anlegen einer Variable sollte möglich sein (Joex Container => Edit => Add another Path, Port, Variable, Label or Device => ...) Wenn das funktioniert könnte ich in mein Template auch diese neue Variable definieren. Die wäre dann aber immer da und müsste eventuell von Hand angepasst werden... (...) <Variable> <Value>-Xms256m -Xmx2500m</Value> <Name>JAVA_OPTS</Name> <Mode/> </Variable> </Environment> (...) <Config Name="JAVA_OPTS" Target="JAVA_OPTS" Default="-Xms256m -Xmx2500m" Mode="" Description="Container Variable: JAVA_OPTS" Type="Variable" Display="always" Required="false" Mask="false">-Xms256m -Xmx2500m</Config> (...) Quote Link to comment
Hoddl Posted March 5, 2021 Author Share Posted March 5, 2021 hab die jetzt mal alle raus gelöscht... wird immer noch gestoppt... Ich teste mal die extra parameter... stoppe dann alle und starte sie dann wieder alle... Quote Link to comment
Hoddl Posted March 5, 2021 Author Share Posted March 5, 2021 -e JAVA_OPTS="-Xmx2500m -Xms256m" hat geholfen... die grösse der files sind immer unter 1mb... erfolgreich wurden nur 2 Stück bearbeitet... ich teste mal dann neu und melde mich dann wieder Danke schon mal Quote Link to comment
Hoddl Posted March 6, 2021 Author Share Posted March 6, 2021 Hab jetzt 5 files in den ordner kopiert. docspell legt auch gleich los. doch seit über 5 minuten versucht joex das pdf zu erfassen... hier mal der process: 2021-03-06T0:00:08: ============ Start processing 2017-01-02_Ausgabe_4685_MEDIAMARKT NÜRNBERG GMBH.pdf ============ 2021-03-06T0:00:08: Not checking for duplicates 2021-03-06T0:00:08: Creating new item with 1 attachment(s) 2021-03-06T0:00:08: Creating item finished in 42 ms 2021-03-06T0:00:08: Not an archive: application/pdf 2021-03-06T0:00:08: Converting file Some(2017-01-02_Ausgabe_4685_MEDIAMARKT NÜRNBERG GMBH.pdf) (application/pdf) into a PDF 2021-03-06T0:00:08: Storing input to file /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/infile for running ocrmypdf 2021-03-06T0:00:08: Running external command: ocrmypdf -l deu --skip-text --deskew -j 1 /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/infile /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/out.pdf 2021-03-06T0:02:22: Command `ocrmypdf -l deu --skip-text --deskew -j 1 /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/infile /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/out.pdf` finished: 0 2021-03-06T0:02:22: ocrmypdf stdout: 2021-03-06T0:02:22: ocrmypdf stderr: 1 /usr/lib/python3.8/site-packages/PIL/Image.py:2832: DecompressionBombWarning: Image size (151037461 pixels) exceeds limit of 128000000 pixels, could be decompression bomb DOS attack. warnings.warn( Postprocessing... /usr/lib/python3.8/site-packages/PIL/Image.py:2832: DecompressionBombWarning: Image size (151037461 pixels) exceeds limit of 128000000 pixels, could be decompression bomb DOS attack. warnings.warn( Optimize ratio: 1.00 savings: -0.0% Image optimization did not improve the file - discarded Output file is a PDF/A-2B (as expected) The output file size is 5.92× larger than the input file. Possible reasons for this include: The argument --deskew was issued, causing transcoding. PDF/A conversion was enabled. (Try `--output-type pdf`.) 2021-03-06T0:02:22: Conversion to pdf successful. Saving file. 2021-03-06T0:02:22: Closing process: `ocrmypdf -l deu --skip-text --deskew -j 1 /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/infile /tmp/docspell-convert/docspell-ocrmypdf11422737783394336992/out.pdf` 2021-03-06T0:02:22: Starting text extraction for 1 files 2021-03-06T0:02:22: Extracting text for attachment 2017-01-02_Ausgabe_4685_MEDIAMARKT NÜRNBERG GMBH.converted 2021-03-06T0:02:22: Trying to strip text from pdf using pdfbox. 2021-03-06T0:02:22: Stripped text from PDF is small (0). Trying with OCR. 2021-03-06T0:02:22: Running external command: gs -dLastPage=10 -dNOPAUSE -dBATCH -dSAFER -sDEVICE=tiffscaled8 -sOutputFile=%d.tif - 2021-03-06T0:02:42: Command `gs -dLastPage=10 -dNOPAUSE -dBATCH -dSAFER -sDEVICE=tiffscaled8 -sOutputFile=%d.tif -` finished: 0 2021-03-06T0:02:42: Running external command: unpaper /tmp/docspell-extraction/extractpdf8837783680149879878/1.tif /tmp/docspell-extraction/extractpdf8837783680149879878/u-1.tif 2021-03-06T0:02:42: Command `unpaper /tmp/docspell-extraction/extractpdf8837783680149879878/1.tif /tmp/docspell-extraction/extractpdf8837783680149879878/u-1.tif` finished: 1 2021-03-06T0:02:42: Closing process: `unpaper /tmp/docspell-extraction/extractpdf8837783680149879878/1.tif /tmp/docspell-extraction/extractpdf8837783680149879878/u-1.tif` 2021-03-06T0:02:42: Running external command: tesseract 1.tif stdout -l deu 2021-03-06T0:03:48: Command `tesseract 1.tif stdout -l deu` finished: 0 2021-03-06T0:03:48: Closing process: `tesseract 1.tif stdout -l deu` 2021-03-06T0:03:48: Closing process: `gs -dLastPage=10 -dNOPAUSE -dBATCH -dSAFER -sDEVICE=tiffscaled8 -sOutputFile=%d.tif -` 2021-03-06T0:03:48: Using stripped text (not OCR), as it is longer (0 > 0) 2021-03-06T0:03:48: Extracting text for attachment 2017-01-02_Ausgabe_4685_MEDIAMARKT NÜRNBERG GMBH.converted finished in 86194 ms 2021-03-06T0:03:48: Storing extracted texts … 2021-03-06T0:03:48: Extracted text stored. 2021-03-06T0:03:48: Updating SOLR index 2021-03-06T0:03:48: Text extraction finished in 86248 ms. 2021-03-06T0:03:48: Creating preview images for 1 files… Quote Link to comment
vakilando Posted March 6, 2021 Share Posted March 6, 2021 8 hours ago, Hoddl said: -e JAVA_OPTS="-Xmx2500m -Xms256m" hat geholfen... Das ist schon mal gut! Mal schauen ob ich das in das Template einbaue. Das joex log sieht ansich ganz gut aus. Auf fallend ist: 7 hours ago, Hoddl said: DecompressionBombWarning: Image size (151037461 pixels) exceeds limit of 128000000 pixels, could be decompression bomb DOS attack. warnings.warn( Optimize ratio: 1.00 savings: -0.0% Image optimization did not improve the file - discarded Output file is a PDF/A-2B (as expected) The output file size is 5.92× larger than the input file. ...aber der Output zu einer PDF/A-2B ist erfolgreich... Wenn auch Sie fast 6 mal so groß ist wie das Original... Auch erfolgreich: die Text Extraktion und die Aktualisierung der Volltextsuche. 8 hours ago, Hoddl said: Extracted text stored. 2021-03-06T0:03:48: Updating SOLR index 2021-03-06T0:03:48: Text extraction finished in 86248 ms. 2021-03-06T0:03:48: Creating preview images for 1 files… Allerdings scheint er bei der Erstellung des Vorschaubildes stehen zu bleiben... Ist das Dokument nachher evtl. ohne Vorschaubild in der webui zu finden? Quote Link to comment
Hoddl Posted March 6, 2021 Author Share Posted March 6, 2021 nein ist nichts zu finden.. so sieht die Warteschlange aus: Quote Link to comment
Hoddl Posted March 6, 2021 Author Share Posted March 6, 2021 und der joex steht wieder 😞 Quote Link to comment
Hoddl Posted March 6, 2021 Author Share Posted March 6, 2021 ich hab joex noch mal neu aufgesetzt und diesmal mit -e JAVA_OPTS="-Xmx2500m" also ohne das Minimum... mal sehen was passiert wenn ich wieder ein paar auf einmal hochlade... Quote Link to comment
Hoddl Posted March 6, 2021 Author Share Posted March 6, 2021 wieder das gleiche joex steht dann irgendwann... ich werde die Variable jetzt mal einbauen... mal sehen ob es was bringt... (...) <Variable> <Value>-Xms256m -Xmx2500m</Value> <Name>JAVA_OPTS</Name> <Mode/> </Variable> </Environment> (...) <Config Name="JAVA_OPTS" Target="JAVA_OPTS" Default="-Xms256m -Xmx2500m" Mode="" Description="Container Variable: JAVA_OPTS" Type="Variable" Display="always" Required="false" Mask="false">-Xms256m -Xmx2500m</Config> (...) Quote Link to comment
Hoddl Posted March 6, 2021 Author Share Posted March 6, 2021 wo soll den die Variable eingetragen werden 🙂 Quote Link to comment
vakilando Posted March 6, 2021 Share Posted March 6, 2021 Name ist der von dir vergebene Name Schlüssel ist der Variablen Namen Wert ist der Variablenwert Brauchst du aber nicht, denn ob über extra parameters oder so bleibt sich gleich. Geh bitte in die Prozessliste/Warteschlange und lösche die nicht erfolgreich beendeten tasks (Kreuzchen oben rechts) Quote Link to comment
Hoddl Posted March 6, 2021 Author Share Posted March 6, 2021 ok hab ich gemacht. Nun hab ich mal in den Ordner unter docs nachgeschaut hier hat docspell die Belege einfach drinnen gelassen und evtl. immer wieder neu angefangen?! Diesen Ordner werde ich erst mal nicht mehr nehmen um docspell mit Daten zu füttern... Ich komme deswegen da drauf da in der Warteschlange immer wieder Belege von 18 Uhr aufgetaucht sind. Jetzt hab ich mal ganz normal 10 Stück hochgeladen mal schauen ob es abgearbeitet wird. Ich melde mich wieder 🙂 Quote Link to comment
Hoddl Posted March 6, 2021 Author Share Posted March 6, 2021 Joex hat mal wieder gestoppt 😞 irgendwas mag er bei mir nicht 😞 ich lade mal nur die roten und die gelben fehler aus dem log hoch sonst wird es wieder meterlang 🙂 Error: /BBox has zero width or height, which is not allowed. [docspell-joex-blocking-9] [31mWARN [0;39m [36md.b.ops.OItem[0;39m - Error updating full-text index: unexpected HTTP status: 500 Server Error [ioapp-compute-3] [39mDEBUG[0;39m [36md.s.q.QItem[0;39m - FindByChecksum: Fragment("SELECT DISTINCT i.itemid, i.cid, i.name, i.itemdate, i.source, i.incoming, i.state, i.corrorg, i.corrperson, i.concperson, i.concequipment, i.inreplyto, i.duedate, i.created, i.updated, i.notes, i.folder_id FROM item i INNER JOIN attachment a ON a.itemid = i.itemid INNER JOIN attachment_source s ON s.id = a.attachid INNER JOIN filemeta m1 ON m1.id = a.filemetaid INNER JOIN filemeta m2 ON m2.id = s.file_id LEFT JOIN attachment_archive r ON r.id = a.attachid LEFT JOIN filemeta m3 ON m3.id = r.file_id WHERE (i.cid = ? AND (m1.checksum = ? OR m2.checksum = ? OR m3.checksum = ? ) AND (m1.id is null OR NOT m1.id IN (? )) AND (m2.id is null OR NOT m2.id IN (? )) AND (m3.id is null OR NOT m3.id IN (? )))") Quote Link to comment
vakilando Posted March 7, 2021 Share Posted March 7, 2021 (edited) 5 hours ago, Hoddl said: Nun hab ich mal in den Ordner unter docs nachgeschaut hier hat docspell die Belege einfach drinnen gelassen Standardmäßig verbleiben dort die Dokumente. Es ist wie ein "Fileserver" zu sehen. Dokumente die dort hin gelegt werden können auch umbenannt und in einer Ordnerstruktur organisiert werden. Docspell erkennt bereits hochgeladene Dokumente und wird sie - auch nach "Reorganisierung - nicht nocheinmal importieren. Siehe https://docspell.org/docs/feed/#scanners-watch-directories Es gibt jedoch auch die Möglichkeit dieses Consumedir Verzeichnis ("docs") aufzuräumen. Dokumente werden dann entweder "archiviert" oder gelöscht, je nach Einstellung. Siehe https://docspell.org/docs/tools/consumedir-cleaner/#introduction 5 hours ago, Hoddl said: Error: /BBox has zero width or height, which is not allowed. Das ist ein Fehler von Ghostscript bei der Konvertierung von Bildern...(?) Bitte ein issue auf Github eröffnen, da kann ich mir auch keinen Reim draus machen. Vorher vielleicht noch mal alle Container in der richtigen Reihenfolge (postfix > solr > joex > restserver > consumedir) mit einigen Sekunden Abstand nach erfolgtem Start (schau ins Log des Containers ob der Start erfolgreich und beendet ist) neu starten. 5 hours ago, Hoddl said: [docspell-joex-blocking-9] [31mWARN [0;39m [36md.b.ops.OItem[0;39m - Error updating full-text index: unexpected HTTP status: 500 Server Error Das ist ein Fehler bei der Erstellung des Volltextindexes. Schaut so aus als würde Solr nicht (ordentlich) laufen...? Du kannst mal auf http://deine.solr.ip.adresse/solr/#/ gehen und schauen ob du da was findest, allerdings kenne ich mich da auch nicht aus.... Hast du Solr zwischenzeitlich aktualisiert? Ich habe nämlich eine Aktualisierung mitgemacht, da hatte der Entwickler (bitnami) etwas grundlegendes geändert, da musste ich den Index neu aufbauen. Schau mal in Docspell ob deine Volltextsuche noch funktioniert. Edited March 7, 2021 by vakilando added missing infos Quote Link to comment
Hoddl Posted March 7, 2021 Author Share Posted March 7, 2021 danke für die hilfe... das werde ich alles morgen mal durchgehen.... Quote Link to comment
Hoddl Posted March 7, 2021 Author Share Posted March 7, 2021 ich dachte ja das es vielleicht mit den selbst gescannten belege Probleme gibt. Das kann ich ausschliesen habe ich heut getestet. Ich hab dann in solr reingeschaut und ich denke das das RAM nicht reicht: Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.