gcseed Posted March 9, 2021 Share Posted March 9, 2021 (edited) Hi guys, I had this problem before and I thought it was because the controller card overheats.. I have tried resit the controller card, changing out new cables etc, and it was stable for about 200 days until today... I was transferring some larger files from disk to disk, then it started freezing again. I then force reboot the server and now it keeps on freezing every time I run a parity check. Here is the syslog: My controller card is LSI Logic SAS 9207-8i with a 40mm fan blowing directly onto the heatsink Mar 8 18:47:00 Mancave root: Fix Common Problems Version 2021.03.07 Mar 8 18:47:00 Mancave root: Fix Common Problems: Error: parity (ST10000VN0004-1ZD101_ZA2C4EJ1) is disabled Mar 8 18:47:00 Mancave root: Fix Common Problems: Error: parity2 (ST10000VN0004-1ZD101_ZA2C9K53) is disabled Mar 8 18:47:12 Mancave ntpd[1710]: kernel reports TIME_ERROR: 0x41: Clock Unsynchronized Mar 8 20:17:02 Mancave kernel: mpt2sas_cm0: SAS host is non-operational !!!! Mar 8 20:17:03 Mancave kernel: mpt2sas_cm0: SAS host is non-operational !!!! Mar 8 20:17:04 Mancave kernel: mpt2sas_cm0: SAS host is non-operational !!!! Mar 8 20:17:05 Mancave kernel: mpt2sas_cm0: SAS host is non-operational !!!! Mar 8 20:17:06 Mancave kernel: mpt2sas_cm0: SAS host is non-operational !!!! Mar 8 20:17:07 Mancave kernel: mpt2sas_cm0: SAS host is non-operational !!!! Mar 8 20:17:07 Mancave kernel: mpt2sas_cm0: _base_fault_reset_work: Running mpt3sas_dead_ioc thread success !!!! Mar 8 20:17:07 Mancave kernel: sd 9:0:0:0: [sdb] Synchronizing SCSI cache Mar 8 20:17:07 Mancave kernel: sd 9:0:0:0: [sdb] Synchronize Cache(10) failed: Result: hostbyte=0x01 driverbyte=0x00 Mar 8 20:17:07 Mancave kernel: sd 9:0:1:0: [sdc] tag#8360 UNKNOWN(0x2003) Result: hostbyte=0x01 driverbyte=0x00 cmd_age=6s Mar 8 20:17:07 Mancave kernel: sd 9:0:1:0: [sdc] tag#8360 CDB: opcode=0x88 88 00 00 00 00 00 a6 d3 c0 a8 00 00 04 00 00 00 Mar 8 20:17:07 Mancave kernel: blk_update_request: I/O error, dev sdc, sector 2798895272 op 0x0:(READ) flags 0x0 phys_seg 128 prio class 0 Please help, because I have some very important files and I've been working on... after 5 force restarts, I am literally afraid to start up the server again... Many thanks! Edited March 9, 2021 by gcseed Quote Link to comment
JorgeB Posted March 9, 2021 Share Posted March 9, 2021 Try a different PCIe slot if available, if the same try another HBA if you can. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.