December 8, 20214 yr My new 16TB Exos Party Drive keeps going into disabled state and I'm not sure why Here's what i've done to try to troubleshoot: 1. Switched SATA cables 2. Moved drive to different slots 3. Pulled drive and update Firmware 4. Did Extended Self-test (Passed successful) 5. Did offline (outside of Unraid) surface scan test (no bad blocks reported) I'm stumped, any ideas? Here are my log files thetower-diagnostics-20211208-1715.zip
December 8, 20214 yr Community Expert 5 minutes ago, Sniper00X said: 4. Did Extended Self-test (Passed successful) Not recently SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 1900 - # 2 Extended offline Completed without error 00% 332 -
December 8, 20214 yr Author I literally did it yesterday, not on the Unraid Server In a another computer that I was testing the drive Does the smart runs not get updated on the drive itself?
December 8, 20214 yr Community Expert 2 minutes ago, Sniper00X said: Does the smart runs not get updated on the drive itself? Yes, but unclear what test the other computer ran.
December 8, 20214 yr Author The only other thing i can think of is, do you think my SAS controller may need a firmware update to effectively run a 16TB drive? This is the first 16TB drive that I've put into the system probably a couple months ago and been having issues on/off Controller version: 04:00.0 Serial Attached SCSI controller: Broadcom / LSI SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] (rev 03)
December 8, 20214 yr Community Expert 33 minutes ago, Sniper00X said: Did Extended Self-test How long did this take? I would expect 16TB to take most of a day or so.
December 9, 20214 yr Author Should I run the Extended Smart test again? This time while mounted in Unraid?
December 9, 20214 yr Author Woke up this morning to the following error: Last SMART test result: Interrupted (host reset) I've attached the new diag files thetower-diagnostics-20211209-0815.zip
December 9, 20214 yr Author ok did that and re-kicked off the test about 2 hours ago. It's still saying it's running and at 10% Will let it run it's course and report back.
December 9, 20214 yr Community Expert 4 minutes ago, Sniper00X said: ok did that and re-kicked off the test about 2 hours ago. It's still saying it's running and at 10% Will let it run it's course and report back. I think the progress only tends to update in 10% increments.
December 10, 20214 yr Author Ok it says the test completed without error Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 1962 - # 2 Extended offline Interrupted (host reset) 00% 1929 - # 3 Short offline Completed without error 00% 1900 - # 4 Extended offline Completed without error 00% 332 - I've attached the new diag file. Drive is still spun down and in disabled state, I haven't changed anything. Awaiting advice. thetower-diagnostics-20211210-1625.zip
December 10, 20214 yr Community Expert Disk looks good. Is this the only controller? 04:00.0 Serial Attached SCSI controller [0107]: Broadcom / LSI SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] [1000:0072] (rev 03) Subsystem: Broadcom / LSI SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] [1000:3020] Kernel driver in use: mpt3sas Kernel modules: mpt3sas Maybe cable or power issue?
December 11, 20214 yr Author Yes that's the only controller. Onboard SATA are unused. Ok i think it's time i unrack the server, crack it open and do some troubleshooting Will report back steps.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.