No SMART reports for some of cache SSDs

JorgeB · March 14

Quote

It turns out that only on 1 of the SSDs, the TXT file contains the SMART test report, on the other SSD the TXT file only contains

That, and all the other errors above, suggest the device dropped offline, please post the diagnostics.

Maik75 · March 18

Same issue here. Raid-1 SSD cache, 2 drives. One seems to be completely gone, there is no device under /dev anymore.

Unraid reports "Healthy" and did not send any notification. BTRFS seems to be shot, again no error visible anywhere.

This is highly dangerous behavior, to be honest. How can unraid report "Healthy" SMART when there isn´t even any smart data?

image.png.d8f994259f184077cc4da846215698ae.png

The only indication is the missing temperature and the weirdly low number of reads/writes. The drive is not in the system anymore.

Notice the lack of /dev/sdf:

image.png.cc2b6fd40c04aaedf53ce0bb5e232503.png

Discrepancy because visible in the UI as well, when looking at the SMART attributes for that drive:

image.png.fbca24b2dc04f7680d8978307cafa064.png

Again, how is this reported as "Healthy" and no notification whatsoever... I lost significant data probably, because the first drive of the cache seems to have a corrupted BTRFS as well.

File system check in the UI is useless somehow as well. UI shows this (currently running):

0 errors, nice isn´t it.

This is the current log:. /dev/sdd seems shot too:

Several ten thousands BTRFS errors..

At this point I don´t trust the UI for either SMART or BTRFS status anymore.

Serious problem in from my point of view.

Maik.

JorgeB · March 18

15 minutes ago, Maik75 said:

Several ten thousands BTRFS errors..

Same issue, one of your devices dropped offline, Unraid does not currently monitor pool devices, it's an old feature request of mine, for now you see here for better pool monitoring.

Maik75 · March 18

I could accept the BTRFS issue - however what about the "SMART Healthy" information? If you display an utterly wrong information, then why display SMART for pool devices at all? This is misleading at least.

Your own UI display for smart details shows "failed" btw, this I consider a bug. Why show "failed" in one part of the UI and "Healthy" in another?

Edited March 18 by Maik75

JorgeB · March 18

7 minutes ago, Maik75 said:

however what about the "SMART Healthy" information?

Since the device dropped offline there's no SMART, you need to reconnect the device to get that again.

Maik75 · March 18

Yes I understand that. Displaying "Healthy" for the drive in questions however seems like a serious UI bug, wouldn´t you agree?

JorgeB · March 18

1 hour ago, Maik75 said:

Displaying "Healthy" for the drive in questions however seems like a serious UI bug, wouldn´t you agree?

I do, the only clue you get for now when a pool device drops, is that is stops showing the temp.

No SMART reports for some of cache SSDs

User Feedback

Recommended Comments

JorgeB 7521

Link to comment

Maik75 0

Link to comment

JorgeB 7521

Link to comment

Maik75 0

Link to comment

JorgeB 7521

Link to comment

Maik75 0

Link to comment

JorgeB 7521

Link to comment

Join the conversation