Jump to content

Server crashes and segfault errors regarding smartctl and CPU


easoncyh

Recommended Posts

Posted

Dear everyone,

 

I would like seek help on server crashes and segfault errors regarding smartctl and CPU.  I've installed Unraid from scratch and it is up and running.  I've started preclear task on my newly installed harddisks this morning.  Then, I notice that I cannot access the WebUI this evening.  In the end, I have to power off the server by pressing the power button and then power on the server again to resume the Unraid server.

 

I check the previous syslog and found the segfault errors.  I would like to know if those errors are related to bad RAM, CPU or harddisk.  

 

Sep  7 09:51:32 Tower sshd-session[18361]: Disconnected from user root 192.168.68.56 port 63310
Sep  7 09:51:32 Tower sshd-session[18013]: pam_unix(sshd:session): session closed for user root
Sep  7 09:51:32 Tower elogind-daemon[1083]: Removed session c1.
Sep  7 10:07:26 Tower avahi-dnsconfd[5063]: read(): EOF
Sep  7 11:28:47 Tower kernel: smartctl[22541]: segfault at 0 ip 0000000000000000 sp 00007ffd7686acc8 error 14 in smartctl[400000+4000] likely on CPU 1 (core 1, socket 0)
Sep  7 11:28:47 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Sep  7 12:50:53 Tower kernel: smartctl[7450]: segfault at 0 ip 0000000000000000 sp 00007ffcea150ce8 error 14 in smartctl[400000+4000] likely on CPU 2 (core 0, socket 0)
Sep  7 12:50:53 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Sep  7 13:27:52 Tower kernel: smartctl[15243]: segfault at 0 ip 0000000000000000 sp 00007fff172f0ce8 error 14 in smartctl[400000+4000] likely on CPU 2 (core 0, socket 0)
Sep  7 13:27:52 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Sep  7 13:30:25 Tower kernel: smartctl_type[15770]: segfault at 0 ip 00000000008c0949 sp 00007ffd59e76330 error 4 in php[600000+3b3000] likely on CPU 1 (core 1, socket 0)
Sep  7 13:30:25 Tower kernel: Code: 00 00 00 00 e9 1e ff ff ff e8 93 20 fe ff e9 29 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 53 48 83 ec 08 48 8b 1f <80> 3b 02 0f 84 8e 00 00 00 48 8b 7b 08 f6 47 04 40 74 14 48 83 7b
Sep  7 14:57:05 Tower kernel: smartctl[1618]: segfault at 0 ip 0000000000000000 sp 00007ffe33f39bb8 error 14 in smartctl[400000+4000] likely on CPU 2 (core 0, socket 0)
Sep  7 14:57:05 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Sep  7 15:11:47 Tower kernel: smartctl_type[4727]: segfault at 4 ip 00000000008c0956 sp 00007ffc01948ad0 error 4 in php[600000+3b3000] likely on CPU 3 (core 1, socket 0)
Sep  7 15:11:47 Tower kernel: Code: ff e9 29 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 53 48 83 ec 08 48 8b 1f 80 3b 02 0f 84 8e 00 00 00 48 8b 7b 08 <f6> 47 04 40 74 14 48 83 7b 10 00 74 20 f6 43 07 02 74 67 48 83 c4
Sep  7 18:02:34 Tower kernel: smartctl_type[8341]: segfault at 0 ip 00000000008c0949 sp 00007ffebc95f750 error 4 in php[600000+3b3000] likely on CPU 3 (core 1, socket 0)
Sep  7 18:02:34 Tower kernel: Code: 00 00 00 00 e9 1e ff ff ff e8 93 20 fe ff e9 29 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 53 48 83 ec 08 48 8b 1f <80> 3b 02 0f 84 8e 00 00 00 48 8b 7b 08 f6 47 04 40 74 14 48 83 7b
Sep  7 18:18:47 Tower kernel: smartctl_type[11746]: segfault at 0 ip 0000000000000000 sp 00007ffc8f0dca58 error 14 in php[400000+3b000] likely on CPU 0 (core 0, socket 0)
Sep  7 18:18:47 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Sep  7 18:32:58 Tower kernel: smartctl[14753]: segfault at 0 ip 0000000000000000 sp 00007fff6a500bd8 error 14 in smartctl[400000+4000] likely on CPU 3 (core 1, socket 0)
Sep  7 18:32:58 Tower kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Sep  7 19:01:52 Tower kernel: smartctl_type[20822]: segfault at 4 ip 00000000008c0956 sp 00007ffe0fe01fe0 error 4 in php[600000+3b3000] likely on CPU 0 (core 0, socket 0)
Sep  7 19:01:52 Tower kernel: Code: ff e9 29 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 53 48 83 ec 08 48 8b 1f 80 3b 02 0f 84 8e 00 00 00 48 8b 7b 08 <f6> 47 04 40 74 14 48 83 7b 10 00 74 20 f6 43 07 02 74 67 48 83 c4
Sep  7 19:03:53 Tower kernel: smartctl_type[21256]: segfault at 4 ip 000000000094a603 sp 00007fffe4deef78 error 4 in php[600000+3b3000] likely on CPU 3 (core 1, socket 0)
Sep  7 19:03:53 Tower kernel: Code: 80 00 05 00 00 00 00 00 00 c3 66 66 2e 0f 1f 84 00 00 00 00 00 66 90 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 8b 3f <f6> 47 04 80 74 07 e9 52 a7 cb ff 66 90 e9 ab 83 f5 ff 66 66 2e 0f
Sep  7 21:22:56 Tower webGUI: Successful login user root from 192.168.68.56
Sep  7 21:22:56 Tower webGUI: Invalid .page format: webGui/SMBWorkGroup.page
Sep  7 21:22:56 Tower webGUI: Invalid .page format: webGui/TrimSettings.page
Sep  7 21:23:04 Tower webGUI: Invalid .page format: webGui/SMBWorkGroup.page
Sep  7 21:23:04 Tower webGUI: Invalid .page format: webGui/TrimSettings.page

 

Additionally, I've attached diagnostics file in this post.  Thank you for your help and support.

tower-diagnostics-20240908-0047.zip

Posted

The memtest has run for 13 hours and no error is found.  From previous discussions, the segfault error is hardware related.  I ordered another a single 16GB RAM for my server.  Finger crossed.

Screenshot 2024-09-08 at 9.47.43 PM.png

  • 1 month later...

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...