DigiDuncan Posted May 14, 2023 Share Posted May 14, 2023 I'll do my best to detail what happened as well as I can, but I'd be lying if I said I wasn't frazzled right now. For a few days, Disk 1 was showing disabled with contents emulated, but with no SMART errors. This happened after my SAS card overheated which necessitated a restart. Files on shares were not showing up in File Explorer and giving Disk I/O errors in Linux. I ordered new drives, but they are not in yet. Just now, the server crawled to a halt as services and containers stopped responding before the entire machine locked up and I was unable to do anything except power down the machine by force. My concern now is whether or not this machine is even safe to turn on, lest I lose all the data that was stored on Drive 1 (which is at least half my data.) Please, any advice would be really helpful right now. digiserver-diagnostics-20230510-0455(1).zip Quote Link to comment
trurl Posted May 14, 2023 Share Posted May 14, 2023 On mobile now so haven't looked at diagnostics yet. Do you have backups of anything important and irreplaceable? Quote Link to comment
DigiDuncan Posted May 14, 2023 Author Share Posted May 14, 2023 I do not, unfortunately. I know, I know. It's bad, but I'm really new to this and I just didn't have the funds to have a full backup yet. Quote Link to comment
trurl Posted May 14, 2023 Share Posted May 14, 2023 39 minutes ago, DigiDuncan said: a full backup You don't have to have a full backup, but you must always have another copy of anything important and irreplaceable. You get to decide what qualifies. Quote Link to comment
DigiDuncan Posted May 14, 2023 Author Share Posted May 14, 2023 1 minute ago, trurl said: [...] you must always have another copy of anything important and irreplaceable. I know. I'm sorry that I didn't have that yet; this server was multi-purpose, but it was bulk storage for a lot of important files (including every stream VOD I've ever made) and I just didn't have a good solution yet for storing them off-site. If and when this is fixed, that will be made as soon as possible. I don't have a lot of money, but I'll do what I can. Quote Link to comment
trurl Posted May 14, 2023 Share Posted May 14, 2023 Looks like all of your array disks have disconnected, probably that controller problem. I think there's a pretty good chance your data is OK but you need to get those disks connected to a working controller again and post new diagnostics. Quote Link to comment
JorgeB Posted May 15, 2023 Share Posted May 15, 2023 May 9 17:26:25 DigiServer kernel: mpt2sas_cm1: SAS host is non-operational !!!! A reboot should bring the array back up, possibly with a disabled disk, then make sure the HBA is well seated and sufficiently cooled, you can also try a different PCIe slot if available Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.