berizzle

Members
  • Posts

    32
  • Joined

  • Last visited

Posts posted by berizzle

  1. Precleared a brand new drive. Added lots of data to the machine and the disk is now disabled.

    Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
    # 1  Extended offline    Completed: read failure       70%       211         20836536
    # 2  Extended offline    Completed: read failure       40%       207         20836536
    # 3  Short offline       Completed without error       00%       201         -

    See SMART log

    What do I do next?

    WDC_WD60EFRX-68L0BN1.txt

  2. 1 minute ago, johnnie.black said:

     

    Reiserfs disks seem to be the #1 reason for this issue, convert one of your disks to XFS, limit all writes to that disk for a few days/weeks by changing your shares(s) included disks and see if crashing stops, if yes convert remaining disks.

     

    PS: IMO you should convert even if this isn't the source of the problem, there have been multiple issues with reiser lately and they have terrible performance in certain situations.

     

    I have 23 drives, 21 are Reiserfs 42TB and 2 XFS 6TB.

    9TB free over all the drives.

    Is there any process that makes sense to convert these disks?

  3. Just now, RobJ said:

     

    This is Linux, those are completely allowable.  It's only a problem when you interface to Windows stations, where it's not allowable.

     

    You are right. I was just thinking for users that may have this same issue unRAID can warn you have the same things created but in different case.

     

    53 minutes ago, Squid said:

    None of these problems however should cause the system to stop responding....  But it does confuse the hell out of SMB so who knows.

     

    HTTP and AFP were non responsive. I think I saw out of memory messages in the video too.

     

     

    Anyways i think this should solve the issues. Thanks all. Maybe this should be a plugin installed by default?

  4. So it looks like these may be the problems. To bad the interface doesn't disallow or alert the user when trying to create a share with the same spelling but different case. I am not sure how some of those happened.

     

    I'll work on these and see what happens.

    Screen Shot 2017-03-18 at 6.05.50 PM.png

  5. If you want to move it sure go ahead. 

    This only started recently in the last couple of weeks. Before then I have not seen this. This machine as been running unRAID for 5+ years now.

     

    It would be nice to know how to properly stop the array from local console as before I reboot it again.

  6. Updated from 6.0.0 to 6.0.1. I did not start the array after the reboot. Changed array slots from 24 to 22 and cache from 1 to 2. Doing this only to see options for additional cache slots. Now I see the cache drive FS is btrfs and not reiserfs.  I start the array and you know the rest, I have an option to format the cache drive. :( 

     

     

    How do I undo what happened? Or at least mount the drive to recover the data?

    I am surprised this happened.

  7. Thank you for being so kind, htcnewbie.

     

    Kaveh see http://lime-technology.com/forum/index.php?topic=33341.885

     

    For every app that you have installed create the last 6 lines.

    Add this to your go script.

    What is the app name? See the url that you would normally go to.

     

    Example: http://10.5.4.5:58000/Settings/flexget "flexget" is the "app name', then.

    
    ### Fix plugins
    cp -r /usr/local/emhttp/plugins/webGui/phaze.page /usr/local/emhttp/plugins/dynamix
    
    echo "" >> /usr/local/emhttp/plugins/plexmediaserver/plexmediaserver.page 
    echo "---" >> /usr/local/emhttp/plugins/plexmediaserver/plexmediaserver.page 
    echo "" >> /usr/local/emhttp/plugins/plexmediaserver/plexmediaserver.page 
    cat /usr/local/emhttp/plugins/plexmediaserver/plexmediaserver.php >> /usr/local/emhttp/plugins/plexmediaserver/plexmediaserver.page
    rm -f /usr/local/emhttp/plugins/plexmediaserver/plexmediaserver.php
    sed -i 's!\r!!g' /usr/local/emhttp/plugins/plexmediaserver/plexmediaserver.page
    
    echo "" >> /usr/local/emhttp/plugins/couchpotato/couchpotato.page 
    echo "---" >> /usr/local/emhttp/plugins/couchpotato/couchpotato.page 
    echo "" >> /usr/local/emhttp/plugins/couchpotato/couchpotato.page 
    cat /usr/local/emhttp/plugins/couchpotato/couchpotato.php >> /usr/local/emhttp/plugins/couchpotato/couchpotato.page
    rm -f /usr/local/emhttp/plugins/couchpotato/couchpotato.php
    sed -i 's!\r!!g' /usr/local/emhttp/plugins/couchpotato/couchpotato.page
    
    echo "" >> /usr/local/emhttp/plugins/transmission/transmission.page 
    echo "---" >> /usr/local/emhttp/plugins/transmission/transmission.page 
    echo "" >> /usr/local/emhttp/plugins/transmission/transmission.page 
    cat /usr/local/emhttp/plugins/transmission/transmission.php >> /usr/local/emhttp/plugins/transmission/transmission.page
    rm -f /usr/local/emhttp/plugins/transmission/transmission.php
    sed -i 's!\r!!g' /usr/local/emhttp/plugins/transmission/transmission.page
    

     

     

    So, after looking at the new unraid beta 12, it seems if I update my plugins, all users under beta 12 will get broken plugins.  Also, since they are making changes on each beta release, it's getting hard to update them all constantly with the changes.  I will likely wait for the first release candidate before updating all 17 plugins for full compatibility.  In the mean time, you can do the following to get any plugin working on beta 12 for now, You can set this in your GO script to automate it, but you need to add this for each plugin of mine you have that doesn't work on beta 12: (replace APPNAME with the name of the app plugin in lowercase)

     

    cp -r /usr/local/emhttp/plugins/webGui/phaze.page /usr/local/emhttp/plugins/dynamixecho "" >> /usr/local/emhttp/plugins/(APPNAME)/(APPNAME).page echo "---" >> /usr/local/emhttp/plugins/(APPNAME)/(APPNAME).page echo "" >> /usr/local/emhttp/plugins/(APPNAME)/(APPNAME).page cat /usr/local/emhttp/plugins/(APPNAME)/(APPNAME).php >> /usr/local/emhttp/plugins/(APPNAME)/(APPNAME).pagerm -f /usr/local/emhttp/plugins/(APPNAME)/(APPNAME).phpsed -i 's!\r!!g' /usr/local/emhttp/plugins/(APPNAME)/(APPNAME).page

     

    What this does is moves the page file out of the webGUI folder which no longer exists and into the dynamix folder which is now standard from unraid beta 12 and up.It also copies the PHP page into the PAGE file since they no longer allow separate files.Then it removes the PHP file since having it there will cause weirdness on the plugin page.Then lastly, it clears out the ^M carrage file that windows left, which breaks the plugin from showing.I'll have my version 2.0.0 plugins compatible with the RC / Final when it's closer to release but updating these plugins right now will make earlier beta users plugins inoperable.

  8. I followed the simple steps from http://lime-technology.com/wiki/index.php/UnRAID_Manual_5#Replace_a_failed_disk.

    After powering on the machine I allowed it to sync and now running the parity check with correct any parity errors turned off. I still see disk5 as unformated as the parity sync is running.

     

     

    Should I scrap the sync and then format the drive and run rebuild, then parity check without correction again? I did preclear the drive.

  9. Frank1940 and bkastner you have both contributed a lot of very good information and to the point. I really appreciate your help.

     

     

    I spent the rest of my Sunday researching the issue. I wonder if the semi seated SATA cable to the controller caused this. There is a train that rolls by about 100 feet away. Maybe the vibrations caused this? I have no issues doing what was suggested, I am just curious as to how disk5 had issues besides the high power on hours.

     

     

    After looking at the syslog during bootup I took a more in depth look at disk5. I mounted the drive with mount -t reiserfs /dev/sdp1 /tmp/x. I cd'd to the dir and could view the folders. I was going to attempt to run Joe L's unraid_partition_disk but have now decided against it for the time being as I will just replace the drive with a precleared drive.

     

     

    Do either of you folks have any recommendations / opinions on what I can do to this machine to extend the life of the hard drives current and new ones? Settings, configurations, scripts, spin down, etc

  10. OK, good info regarding the bathtub curve.

    Some drives have been on that long, huh? Wow

     

     

     

    What is DSBL on disk5?

    I have 2 3TB drive still in the package. I would preclear them, then what? What are the next steps to get the data from disk5?

    Also, how do I properly swap a drive and have the data put back on?

    Lastly, how do I determine the drives to replace from looking at the SMART Report?

     

     

    I pulled the case off the rack. A few of SATA cables may have been not connected securely. Snugged them all up and powered on.

     

     

    I now have this:

    SMART Report

    Syslog

     

     

    This is array off.

    2014-07-06_18-27-42.png

     

     

    This is array on. This does not look good.

    2014-07-06_18-52-03.png

     

     

     

     

     

     

  11. http://lime-technology.com/wiki/index.php?title=The_Analysis_of_Drive_Issues#Drive_interface_issue_.231

     

    Your syslog shows both sdr and sdq having issues....and at first glance appear to be physical. Try the link above for details, but

    in summary, SATA cable may be bad/loose.  Also could be a power supply problem, so check that all connections are solid.

     

    Perhaps those two drives share a common controller, are in the same drive cage, or share a SAS cable?

     

     

    I have a 24 bay case. I'll need to open it up to get info. I no longer have it.

    As far as I remember there are 3 cards for all 24 bays.

     

     

    Is there anything I can do to test the hardware?

     

     

    This is what I have today.

     

    unRAID Version: 5.0

    What should be my next step?

     

     

    It also looks like you have an issue with disk4 as well as disk5.  You should probably also list your hardware--- particularly what you are using for your SATA expansion cards.  You also had two disks that failed to return smart data.  (I suspect that these disks were disk4 and disk5 since they were not indicating that they were powered up--- no disk temperatures in the table.)

     

    It also appears the you have a lot of disks with very high 'power on' hours.  While the actual hours are only an indication that you could be reaching the end-of-life area of the bathtub failure curve, it does raise the risk for multiple disk failures substantially!  I hope you have another backup of your critical data.

     

    Why do they have high power on hours? How do I fix that? Is this bad?

     

  12. I think it has to do with http://lime-technology.com/wiki/index.php/UnRAID_Manual#User_shares_2 I am not sure.

    But I can't wrap my head around that.

     

     

    Yes, shares are on. I am not sure how you would see data with shares being off.

    What more info can I give you?

     

    It may have something to do with the settings here, I believe: http://tower/Settings/ShareSettingsMenu

    Included disk(s): All

    Excluded disk(s): None

    That is the current settings there.

  13.  

    What settings do I need to make so when I create a folder in a disk that is doesn't copy automatically to others disks?

     

    I have disk1, disk2, disk3 etc. If I create "foldera" in disk1 I see it in disk2 and disk3 but it may be empty or it may have files.

     

    I would like to keep every drive separate from each for the future.

     

    How do I go about cleaning up what has been done? I have 15 drives.

     

    I am using 5.0-rc11 currently but I don't believe my issue is with this version.