pika Posted January 14, 2023 Share Posted January 14, 2023 Hello! I'm on unraid 6.11.2 and my system went down 3 days ago (no idea why, no power shortage). after new start i saw parity check starting and i did not inquire further (no time atm). today i opened the dashboard and there are were several things missing: all the infos in the marked areas were missing (while i was typing the first lines of this post they came back)... i was not able to download diagnostics, also working again now. i saw these errors in the syslog: Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: AER: Corrected error received: 0000:00:01.0 Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID) Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: device [1022:15d3] error status/mask=00000040/00006000 Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: [ 6] BadTLP Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: AER: Multiple Corrected error received: 0000:00:01.0 Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID) Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: device [1022:15d3] error status/mask=00001040/00006000 Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: [ 6] BadTLP Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: [12] Timeout Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: AER: Corrected error received: 0000:00:01.0 Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID) Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: device [1022:15d3] error status/mask=00000040/00006000 Jan 13 20:24:18 DataTower kernel: pcieport 0000:00:01.2: [ 6] BadTLP Jan 13 20:24:48 DataTower kernel: pcieport 0000:00:01.2: AER: Corrected error received: 0000:00:01.0 Jan 13 20:24:48 DataTower kernel: pcieport 0000:00:01.2: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID) Jan 13 20:24:48 DataTower kernel: pcieport 0000:00:01.2: device [1022:15d3] error status/mask=00000040/00006000 Jan 13 20:24:48 DataTower kernel: pcieport 0000:00:01.2: [ 6] BadTLP Jan 13 20:24:51 DataTower kernel: pcieport 0000:00:01.2: AER: Corrected error received: 0000:00:01.0 Jan 13 20:24:51 DataTower kernel: pcieport 0000:00:01.2: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID) Jan 13 20:24:51 DataTower kernel: pcieport 0000:00:01.2: device [1022:15d3] error status/mask=00000040/00006000 Jan 13 20:24:51 DataTower kernel: pcieport 0000:00:01.2: [ 6] BadTLP no idea what this means. looks like everything is working for now but i think there is something wrong with my system... could somebody please take a look at my diagnostics? have a nice day! datatower-diagnostics-20230114-1024.zip Quote Link to comment
JorgeB Posted January 14, 2023 Share Posted January 14, 2023 51 minutes ago, pika said: i saw these errors in the syslog: Try this: https://forums.unraid.net/topic/118286-nvme-drives-throwing-errors-filling-logs-instantly-how-to-resolve/?do=findComment&comment=1165009 Quote Link to comment
JorgeB Posted January 14, 2023 Share Posted January 14, 2023 Nothing else obvious on the logs, but you should update to latest release, that one has known issues. Quote Link to comment
pika Posted January 14, 2023 Author Share Posted January 14, 2023 (edited) 9 hours ago, JorgeB said: Nothing else obvious on the logs, but you should update to latest release, that one has known issues. huh, tried the update. server didn't boot after "restart server now". had to press the power button... that shouldn't happen, right? edit: parity check runs again after manual boot. so i guess there was something wrong during the update? should i provice another diagnostics? Edited January 14, 2023 by pika Quote Link to comment
Brucey7 Posted January 15, 2023 Share Posted January 15, 2023 (edited) I have this problem too, been having it for a while, it appears the flash drive is filling up somehow, it takes a week or so Edited January 15, 2023 by Brucey7 context Quote Link to comment
itimpi Posted January 15, 2023 Share Posted January 15, 2023 6 hours ago, Brucey7 said: I have this problem too, been having it for a while, it appears the flash drive is filling up somehow, it takes a week or so Do you have something like mover logging enabled, or the syslog server with option to mirror to flash. Even so a bit unusual for either of these to cause this sort of problem. I expect it would be relatively obvious what the culprit is by examining the contents of the flash drive when it starts getting full to see what is taking up the space? Quote Link to comment
Brucey7 Posted January 15, 2023 Share Posted January 15, 2023 I don't use mover, syslog server is not mirrored to flash. I haven't tried to look at the log file when it happens, no drives or shares are shown but clients can still access shares, reboot and shutdown tabs don't work. I will try and look at the log file next time it happens. Quote Link to comment
itimpi Posted January 15, 2023 Share Posted January 15, 2023 55 minutes ago, Brucey7 said: will try and look at the log file next time it happens It might not be the log file(s), but it should be easy enough to identify which files are taking up all the space on the flash drive. Quote Link to comment
Brucey7 Posted January 18, 2023 Share Posted January 18, 2023 Attached is the diagnostics file, something strange is going on tower2-diagnostics-20230118-0738.zip Quote Link to comment
JorgeB Posted January 18, 2023 Share Posted January 18, 2023 Lots of nginx errors: Jan 17 09:10:49 Tower2 nginx: 2023/01/17 09:10:49 [crit] 3371#3371: accept4() failed (24: Too many open files) Jan 17 09:10:52 Tower2 nginx: 2023/01/17 09:10:52 [error] 3371#3371: OUTPUT:can't create output chain, file in buffer won't open Jan 17 09:10:53 Tower2 nginx: 2023/01/17 09:10:53 [error] 3371#3371: OUTPUT:can't create output chain, file in buffer won't open Jan 17 09:10:54 Tower2 nginx: 2023/01/17 09:10:54 [error] 3371#3371: OUTPUT:can't create output chain, file in buffer won't open Jan 17 09:10:56 Tower2 nginx: 2023/01/17 09:10:56 [error] 3371#3371: OUTPUT:can't create output chain, file in buffer won't open Jan 17 09:10:56 Tower2 nginx: 2023/01/17 09:10:56 [crit] 3371#3371: accept4() failed (24: Too many open files) Jan 17 09:10:57 Tower2 nginx: 2023/01/17 09:10:57 [crit] 3371#3371: accept4() failed (24: Too many open files) Try booting in safe mode Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.