Vossy Posted March 2, 2021 Share Posted March 2, 2021 Hello, Not sure what's goings on here. I have been using UNRAID for a few years and going all fine up until a few months ago, lost a drive and when trying to replace found mobo not seeing all ram, further investigation reviled defect mobo. Removed all equipment m(SAS contollers, etc (not RAM or CPU)) used old (Spare) mobo and back up and running all data assessable and present. Showed failures on two drives whilst waiting for new mobo to arrive, ordered replacement disk (was a little shocked as one disk was only 18 months old). Got new board installed and changed out failed disks, started getting random failures on different drives, suspected new Mobo (it was used of EBAY), again no seeing all RAM as well. Put back into old (Spare) and again now seeing random failures of disks. Disks always test out fine in smart tests etc. Using TV show XYZ as example (it has 3 seasons). Now when I browse through windows or plex or unraid UI share it only Season 2 - episode 12, but when I go main and click on another disk and drill down (if it contains that show) it will show some episodes... At a loss any help or advice would be great. tower-diagnostics-20210303-0445.zip Quote Link to comment
JorgeB Posted March 3, 2021 Share Posted March 3, 2021 There's no syslog in your diags, try again or post just the syslog. Quote Link to comment
Vossy Posted March 3, 2021 Author Share Posted March 3, 2021 Ill try again.. tower-syslog-20210303-0935.zip tower-diagnostics-20210303-1726.zip Quote Link to comment
Vossy Posted March 3, 2021 Author Share Posted March 3, 2021 (edited) Any ideas, if I clear these two, another different two fail....this time it is 5 and 7...if I had hair id be pulling it out... tower-diagnostics-20210303-1743.zip tower-syslog-20210303-0943.zip Edited March 3, 2021 by Vossy Quote Link to comment
Vossy Posted March 3, 2021 Author Share Posted March 3, 2021 reboot and all files back but still drives now unmountable .. tower-diagnostics-20210303-1751.zip tower-syslog-20210303-0951.zip Quote Link to comment
JorgeB Posted March 3, 2021 Share Posted March 3, 2021 Doesn't look like a disk problem, I would start by updating LSI firmware to latest. Quote Link to comment
Vossy Posted March 3, 2021 Author Share Posted March 3, 2021 Thanks JorgeB vie done that and will let you know. Quote Link to comment
Vossy Posted March 3, 2021 Author Share Posted March 3, 2021 Nope now disk 1 and 7 failed again... It seems that is likes two disk failed all the time.... tower-syslog-20210303-1222.zip tower-diagnostics-20210303-2022.zip Quote Link to comment
JorgeB Posted March 3, 2021 Share Posted March 3, 2021 Reboot and post new diags. Quote Link to comment
Vossy Posted March 3, 2021 Author Share Posted March 3, 2021 As requested tower-syslog-20210303-2018.zip tower-diagnostics-20210304-0418.zip Quote Link to comment
JorgeB Posted March 4, 2021 Share Posted March 4, 2021 Since the disks look healthy probably better to do a new config and re-sync parity, if there are still disk issues it could be bad controller, cables. power, etc. Quote Link to comment
Vossy Posted March 4, 2021 Author Share Posted March 4, 2021 I have done a new config a few times but not re-sync, is it worth doing it and re-syncing first before looking else? Or if I have had the failures after a new config it is more than likely a cable, backplane or card issue? Quote Link to comment
Vossy Posted March 7, 2021 Author Share Posted March 7, 2021 So ran for a few days with not Parity disk, not a single error as soon as I put on in and tired to do a sync errors on 2 x drives (tired a few times different drive each time). Tired with only one parity disk in various locations/slots (I have a NORCO 4224 case, through 3 x LSI card using 6 SAS cables). Still same result. Removed the Parity drives and no read errors. tower-diagnostics-20210307-1111.zip tower-syslog-20210307-0311.zip Quote Link to comment
Maticks Posted March 7, 2021 Share Posted March 7, 2021 (edited) I've run into this before you need to pull the boot bios from the LSI Cards they fight for the boot prompt. you will also find when you boot up it pauses for a few seconds longer than normal. sas2flash -o -e 5 to erase the boot services area of the flash chip. (thats the one on my 9211 cards for section 5 being the BIOS). You need to pull the boot bios from all 3 cards, you only need the boot bios if you plan to boot on a drive on the LSI Controller which you are booting off the USB so you don't need it. I don't know why it throws random errors but if you ran with only 1 LSI Card you will have no issues. Edited March 7, 2021 by Maticks Quote Link to comment
Vossy Posted March 7, 2021 Author Share Posted March 7, 2021 No sure they have BIOS, been running fine for years, now this.... see below LSI Corporation SAS2 Flash Utility Version 20.00.00.00 (2014.09.18) Copyright (c) 2008-2014 LSI Corporation. All rights reserved Adapter Selected is a LSI SAS: SAS2008(B2) Num Ctlr FW Ver NVDATA x86-BIOS PCI Addr ---------------------------------------------------------------------------- 0 SAS2008(B2) 20.00.07.00 14.01.00.08 No Image 00:09:00:00 1 SAS2008(B2) 20.00.07.00 14.01.00.08 No Image 00:08:00:00 2 SAS2008(B2) 20.00.07.00 14.01.00.08 No Image 00:02:00:00 Finished Processing Commands Successfully. Exiting SAS2Flash. Quote Link to comment
JorgeB Posted March 7, 2021 Share Posted March 7, 2021 7 hours ago, Vossy said: Tired with only one parity disk in various locations/slots (I have a NORCO 4224 case, through 3 x LSI card using 6 SAS cables). Still same result. The errors were not on the parity drive, though they are more likely to happen during a parity sync due to the heavy IO, do the errors happen to any disk on the different controllers? Maybe they are limited to a controller or backplane? Quote Link to comment
chizll Posted March 10, 2021 Share Posted March 10, 2021 (edited) I don't post much and usually when I go on to forums I am the one asking for help. I had a similar issue where I updated to Unraid 6.9.0 and after I rebooted, Disk 5 was not found. I tried several reboots but to no avail. Eventually, I powered down the system, opened up the server chassis, unplugged Disk 5's data and power connection, re-seated the LSI 9211-8i expansion card. Started it back up and viola! Disk 5 available again. Not sure if the update caused it, but decided to post my solution in case someone else has the same issue. Edited March 10, 2021 by chizll Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.