Jump to content
  • Wrong Disk Utilization in WebGUI


    Lukáš Řádek
    • Closed

    I got an alert saying "Disk 3 returned to a normal utilization level" which surprised me since it was quite high before and I didn't delete anything (substantial) from it. I have done some writes and deletes to a share that is located on that drive (among others).

     

    I checked the WebUI's Main page which said that the disk is at 2 TB used out of 8 TB total, which alarmed me quite a bit because before it was somewhere around 7 TB used.

     

    Since I have just one main share that holds all my data, I went to Shares page and computed the share's size. That turned out to show the correct disk space used - Disk 3 has 7.06 TB used (which is correct) BUT 5.97 TB free (which is impossible since it is 8 TB drive).

    image.thumb.png.df516af0beb730cbdb47f5d10a55b9d9.png

     

    Windows (via Samba) and Krusader also show the correct disk space used.

    BUT when inspecting disk's properties in Krusader, it shows the incorrect values again (in contrast to the Size in the same dialog).

     

    image.png.e41adaa1b76d2f037603263fdaa3acd7.png

     

     

    SMART short test of the disk ran without a problem.

    Other disks behave as normal.

     

    What do you make of this and what do you suggest I do?

     

    I assume this might not neccessarily be an UnRAID bug, but at least UnRAID could handle such situations and notify user that the filesystem is corrupted... possibly???

     

     

    I am marking this as urgent since it can lead to false alarms and possible overreactions presuming data loss.




    User Feedback

    Recommended Comments

    Alright, that fixed it, thank you 🙂.

     

    Follow up: What to do now? Can I just carry on since file system is not included in parity (don't know it that is the case) or can the parity be potentially affected? In that case should I trust parity or the data and rebuild?

    Link to comment

    File system corruption is most often caused by un unclean shutdown, it can also be caused by hardware issues,  e.g., bad RAM, if it keeps happening without an apparent reason you'll need to investigate.

    Link to comment
    On 9/1/2023 at 9:59 AM, JorgeB said:

    File system corruption is most often caused by un unclean shutdown, it can also be caused by hardware issues,  e.g., bad RAM, if it keeps happening without an apparent reason you'll need to investigate.

     

    Sure. But can I consider this case fixed by running the xfs file system repair? Or does it somehow affect parity?

    Link to comment
    2 hours ago, Lukáš Řádek said:

     

    Sure. But can I consider this case fixed by running the xfs file system repair? Or does it somehow affect parity?

    If you run a xfs file system repair via the GUI this will update parity as it is doing any repair.   

     

    Having said that it is possible that a parity check may find a small number of errors that need correcting after an unclean shuitdown as some writes at the unclean shutdown stage did not make it to the parity drive as well.

    Link to comment
    On 9/3/2023 at 3:02 PM, itimpi said:

    If you run a xfs file system repair via the GUI this will update parity as it is doing any repair.   

     

    Having said that it is possible that a parity check may find a small number of errors that need correcting after an unclean shuitdown as some writes at the unclean shutdown stage did not make it to the parity drive as well.

     

    Thanks. A situation that I feared with parity system is how to know if the parity error actually means that there is bad data on the parity drive (for example due to unclean shutdown) or on the data drive since that is the one that has been corrupted.

    Link to comment
    1 hour ago, Lukáš Řádek said:

    Thanks. A situation that I feared with parity system is how to know if the parity error actually means that there is bad data on the parity drive (for example due to unclean shutdown) or on the data drive since that is the one that has been corrupted.

    You cannot.   Parity is not that clever - it only knows that something has gone wrong somewhere.

     

    The sequence of the way that the writes happen that as long as a drive has not failed a lost write to the parity drive is far more likely than the data drive so this is the assumption that is made.   The only way to make sure that a data drive does not have bad data is to either have checksums for the data or use BTRFS or ZFS as the file system type as they have built in check-summing.

     

    This is one of the reasons we recommend that the scheduled parity check is set to be non-correcting so that you only run a correcting check when you think none of your data drives has a problem.

    Link to comment
    On 8/31/2023 at 2:06 PM, JorgeB said:

    First thing is to check filesystem on that disk, assuming it' XFS, run it without -n.

    After a moment of terror, i ran the filesystem check which corrected it for me, too 

    Thanks for the easy solution! :)

    Edited by Jabberwocky
    • Like 1
    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.

×
×
  • Create New...