October 3, 20178 yr Diags attached. I've replaced cables.... At this point I think it may be a backplane issue. Comments welcome. ffs2-diagnostics-20171003-1609.zip
October 4, 20178 yr Community Expert Almost all your disks have several UDMA CRC errors, this is very unusual, while 1 or 2 disks with errors would suggest bad SATA cables, almost all suggests a more serious issue, backplane is a strong possibility.
October 4, 20178 yr Author Thanks Johnnie. Disks are spread out over three backplanes and two three controllers (onboard plus two PCIe controllers) Motherboard or power supply issue maybe?
October 4, 20178 yr Community Expert Power supply is a strong possibility, first thing you should do is to add 199 to the monitored SMART attributes so you get notified if/when more errors increase, then start ruling things out.
October 4, 20178 yr Author Power supply was recently upgraded, but it is a well used unit with a lot of miles on it. How do I add the extra SMART attributes that you mentioned?
October 4, 20178 yr Community Expert 15 minutes ago, tucansam said: Power supply was recently upgraded, but it is a well used unit with a lot of miles on it. PSU is a possibility but this type o error is usually connection related, cable, backplane or controller are the most probable causes. 17 minutes ago, tucansam said: How do I add the extra SMART attributes that you mentioned? Settings -> Disk Settings -> Global SMART Settings -> Default SMART attribute notifications
October 4, 20178 yr Author Well the controller is an LSI 8-port, upgraded from another (SuperMicro) card from a while ago. The common thread between all iterations of this system are the SATA break-out cables that connect to the two ports on the controller. Perhaps I'll start there since its the cheapest thing to test.
October 5, 20178 yr Author So I turned on the additional SMART attribute yesterday..... All disks have errors. This can't be good. Its spread out across all backplanes, all controllers, all cables. So the common denominators are memory, motherboard, and power supply. ffs2-diagnostics-20171005-0752.zip
October 6, 20178 yr Community Expert Work normally and check if those CRC errors continue to increase and if so you need to start ruling things out.
Archived
This topic is now archived and is closed to further replies.