easoncyh Posted September 7, 2024 Posted September 7, 2024 Dear everyone, I would like seek help on server crashes and segfault errors regarding smartctl and CPU. I've installed Unraid from scratch and it is up and running. I've started preclear task on my newly installed harddisks this morning. Then, I notice that I cannot access the WebUI this evening. In the end, I have to power off the server by pressing the power button and then power on the server again to resume the Unraid server. I check the previous syslog and found the segfault errors. I would like to know if those errors are related to bad RAM, CPU or harddisk. Sep 7 09:51:32 Tower sshd-session[18361]: Disconnected from user root 192.168.68.56 port 63310 Sep 7 09:51:32 Tower sshd-session[18013]: pam_unix(sshd:session): session closed for user root Sep 7 09:51:32 Tower elogind-daemon[1083]: Removed session c1. Sep 7 10:07:26 Tower avahi-dnsconfd[5063]: read(): EOF Sep 7 11:28:47 Tower kernel: smartctl[22541]: segfault at 0 ip 0000000000000000 sp 00007ffd7686acc8 error 14 in smartctl[400000+4000] likely on CPU 1 (core 1, socket 0) Sep 7 11:28:47 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6. Sep 7 12:50:53 Tower kernel: smartctl[7450]: segfault at 0 ip 0000000000000000 sp 00007ffcea150ce8 error 14 in smartctl[400000+4000] likely on CPU 2 (core 0, socket 0) Sep 7 12:50:53 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6. Sep 7 13:27:52 Tower kernel: smartctl[15243]: segfault at 0 ip 0000000000000000 sp 00007fff172f0ce8 error 14 in smartctl[400000+4000] likely on CPU 2 (core 0, socket 0) Sep 7 13:27:52 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6. Sep 7 13:30:25 Tower kernel: smartctl_type[15770]: segfault at 0 ip 00000000008c0949 sp 00007ffd59e76330 error 4 in php[600000+3b3000] likely on CPU 1 (core 1, socket 0) Sep 7 13:30:25 Tower kernel: Code: 00 00 00 00 e9 1e ff ff ff e8 93 20 fe ff e9 29 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 53 48 83 ec 08 48 8b 1f <80> 3b 02 0f 84 8e 00 00 00 48 8b 7b 08 f6 47 04 40 74 14 48 83 7b Sep 7 14:57:05 Tower kernel: smartctl[1618]: segfault at 0 ip 0000000000000000 sp 00007ffe33f39bb8 error 14 in smartctl[400000+4000] likely on CPU 2 (core 0, socket 0) Sep 7 14:57:05 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6. Sep 7 15:11:47 Tower kernel: smartctl_type[4727]: segfault at 4 ip 00000000008c0956 sp 00007ffc01948ad0 error 4 in php[600000+3b3000] likely on CPU 3 (core 1, socket 0) Sep 7 15:11:47 Tower kernel: Code: ff e9 29 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 53 48 83 ec 08 48 8b 1f 80 3b 02 0f 84 8e 00 00 00 48 8b 7b 08 <f6> 47 04 40 74 14 48 83 7b 10 00 74 20 f6 43 07 02 74 67 48 83 c4 Sep 7 18:02:34 Tower kernel: smartctl_type[8341]: segfault at 0 ip 00000000008c0949 sp 00007ffebc95f750 error 4 in php[600000+3b3000] likely on CPU 3 (core 1, socket 0) Sep 7 18:02:34 Tower kernel: Code: 00 00 00 00 e9 1e ff ff ff e8 93 20 fe ff e9 29 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 53 48 83 ec 08 48 8b 1f <80> 3b 02 0f 84 8e 00 00 00 48 8b 7b 08 f6 47 04 40 74 14 48 83 7b Sep 7 18:18:47 Tower kernel: smartctl_type[11746]: segfault at 0 ip 0000000000000000 sp 00007ffc8f0dca58 error 14 in php[400000+3b000] likely on CPU 0 (core 0, socket 0) Sep 7 18:18:47 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6. Sep 7 18:32:58 Tower kernel: smartctl[14753]: segfault at 0 ip 0000000000000000 sp 00007fff6a500bd8 error 14 in smartctl[400000+4000] likely on CPU 3 (core 1, socket 0) Sep 7 18:32:58 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6. Sep 7 19:01:52 Tower kernel: smartctl_type[20822]: segfault at 4 ip 00000000008c0956 sp 00007ffe0fe01fe0 error 4 in php[600000+3b3000] likely on CPU 0 (core 0, socket 0) Sep 7 19:01:52 Tower kernel: Code: ff e9 29 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 53 48 83 ec 08 48 8b 1f 80 3b 02 0f 84 8e 00 00 00 48 8b 7b 08 <f6> 47 04 40 74 14 48 83 7b 10 00 74 20 f6 43 07 02 74 67 48 83 c4 Sep 7 19:03:53 Tower kernel: smartctl_type[21256]: segfault at 4 ip 000000000094a603 sp 00007fffe4deef78 error 4 in php[600000+3b3000] likely on CPU 3 (core 1, socket 0) Sep 7 19:03:53 Tower kernel: Code: 80 00 05 00 00 00 00 00 00 c3 66 66 2e 0f 1f 84 00 00 00 00 00 66 90 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 8b 3f <f6> 47 04 80 74 07 e9 52 a7 cb ff 66 90 e9 ab 83 f5 ff 66 66 2e 0f Sep 7 21:22:56 Tower webGUI: Successful login user root from 192.168.68.56 Sep 7 21:22:56 Tower webGUI: Invalid .page format: webGui/SMBWorkGroup.page Sep 7 21:22:56 Tower webGUI: Invalid .page format: webGui/TrimSettings.page Sep 7 21:23:04 Tower webGUI: Invalid .page format: webGui/SMBWorkGroup.page Sep 7 21:23:04 Tower webGUI: Invalid .page format: webGui/TrimSettings.page Additionally, I've attached diagnostics file in this post. Thank you for your help and support. tower-diagnostics-20240908-0047.zip Quote
easoncyh Posted September 8, 2024 Author Posted September 8, 2024 I will do it soon and report the result when completed. Thanks. Quote
easoncyh Posted September 8, 2024 Author Posted September 8, 2024 The memtest has run for 13 hours and no error is found. From previous discussions, the segfault error is hardware related. I ordered another a single 16GB RAM for my server. Finger crossed. Quote
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.