UnRAID failed to mount REISERFS [SOLVED]

snsumner · March 11, 2011

Hello,

I'm freaking out my HDD used by the parity drive was starting to fail so I installed a new HDD to replace it. When I ran the parity sync one of my drive (disk4) started generating a bunch of errors so I shutdown the system to figure out what was happening. From investigating the syslog there is something wrong withy disk4. It passed all the smartctl but according to the syslog it can't mount the REISERFS. I'm getting this message:

Mar 10 17:34:38 sumner-nas logger: mount: wrong fs type, bad option, bad superblock on /dev/md4,

Mar 10 17:34:38 sumner-nas logger: missing codepage or helper program, or other error

Mar 10 17:34:38 sumner-nas logger: In some cases useful info is found in syslog - try

Mar 10 17:34:38 sumner-nas logger: dmesg | tail or so

Mar 10 17:34:38 sumner-nas logger:

Mar 10 17:34:38 sumner-nas emhttp: _shcmd: shcmd (26): exit status: 32

Mar 10 17:34:38 sumner-nas emhttp: disk4 mount error: 32

Mar 10 17:34:38 sumner-nas emhttp: shcmd (27): rmdir /mnt/disk4

Mar 10 17:34:38 sumner-nas kernel: REISERFS warning (device md4): sh-2006 read_super_block: bread failed (dev md4, block 2, size 4096)

Mar 10 17:34:38 sumner-nas kernel: REISERFS warning (device md4): sh-2006 read_super_block: bread failed (dev md4, block 16, size 4096)

Mar 10 17:34:38 sumner-nas kernel: REISERFS warning (device md4): sh-2021 reiserfs_fill_super: can not find reiserfs on md4

Does anyone know what the I should do in this situation when I can't mount the REISERFS? I've attached my syslog from the previous reboot which hit the the same problem.

syslog-2011-03-10.txt

snsumner · March 11, 2011

BTW, I followed the instruction to check the file system from here:

http://lime-technology.com/wiki/index.php?title=Check_Disk_Filesystems

And this is the output I got:

root@sumner-nas:~# umount /dev/md4

umount: /dev/md4: not mounted

root@sumner-nas:~# reiserfsck --check /dev/md4

reiserfsck 3.6.21 (2009 www.namesys.com)

*************************************************************

** If you are using the latest reiserfsprogs and it fails **

** please email bug reports to [email protected], **

** providing as much information as possible -- your **

** hardware, kernel, patches, settings, all reiserfsck **

** messages (including version), the reiserfsck logfile, **

** check the syslog file for any related information. **

** If you would like advice on using this program, support **

** is available for $25 at www.namesys.com/support.html. **

*************************************************************

Will read-only check consistency of the filesystem on /dev/md4

Will put log info to 'stdout'

Do you want to run this program?[N/Yes] (note need to type Yes if you do):Yes

The problem has occurred looks like a hardware problem. If you have

bad blocks, we advise you to get a new hard drive, because once you

get one bad block that the disk drive internals cannot hide from

your sight,the chances of getting more are generally said to become

much higher (precise statistics are unknown to us), and this disk

drive is probably not expensive enough for you to you to risk your

time and data on it. If you don't want to follow that follow that

advice then if you have just a few bad blocks, try writing to the

bad blocks and see if the drive remaps the bad blocks (that means

it takes a block it has in reserve and allocates it for use for

of that block number). If it cannot remap the block, use badblock

option (-B) with reiserfs utils to handle this block correctly.

bread: Cannot read the block (2): (Input/output error).

Aborted

Joe L. · March 11, 2011

The disk is apparently not readable.

Try power cycling it. (removing power, then re-applying power, then trying the reiserfsck once more)

Joe L.

snsumner · March 11, 2011

The disk is apparently not readable.

Try power cycling it. (removing power, then re-applying power, then trying the reiserfsck once more)

Joe L.

I tried power cycling the unraid server and now it hanging at the BIOS screen. The second HDD busy light is flashing endlessly, none of the other HDD are flaashing. I took out each individual HDD (six total) and put them in another server and ran the DOS boot version of Seagate Tool (short test) and they all passed.

This all begin when I noticed slow I/O performance on my parity drive:

http://lime-technology.com/forum/index.php?topic=11548.0

I ordered a replacement drive and replaced the parity drive so I could ship it back to Seagate for warranty repair. I run daily smart reports and all my HDD and all the others appeared to be healthy. The parity drive had a high number of bad sectors which is why I was being proactive and swapping it out.

When I activated the parity sync I immediately got tons of errors on disk 4. About 7% into the parity sync I stopped the array and rebooted the system. On the reboot the /dev/md4 couldn't mount and now my configuration is invalid.

This might be some type of hardware issue with my motherboard or controllers. I might have to swap out the motherboard and controller to resolve this problem. Either way I'm very concerned about losing data since I have many important files on my unraid system.

I will continue to figure out how to get the system to reboot otherwise I will replace the motherboard and maybe controllers to see if that will resolve my problem.

Any suggestions are greatly appreciated.

Joe L. · March 11, 2011

The disk is apparently not readable.

Try power cycling it. (removing power, then re-applying power, then trying the reiserfsck once more)

Joe L.

I tried power cycling the unraid server and now it hanging at the BIOS screen. The second HDD busy light is flashing endlessly, none of the other HDD are flaashing. I took out each individual HDD (six total) and put them in another server and ran the DOS boot version of Seagate Tool (short test) and they all passed.

This all begin when I noticed slow I/O performance on my parity drive:

http://lime-technology.com/forum/index.php?topic=11548.0

I ordered a replacement drive and replaced the parity drive so I could ship it back to Seagate for warranty repair. I run daily smart reports and all my HDD and all the others appeared to be healthy. The parity drive had a high number of bad sectors which is why I was being proactive and swapping it out.

When I activated the parity sync I immediately got tons of errors on disk 4. About 7% into the parity sync I stopped the array and rebooted the system. On the reboot the /dev/md4 couldn't mount and now my configuration is invalid.

This might be some type of hardware issue with my motherboard or controllers. I might have to swap out the motherboard and controller to resolve this problem. Either way I'm very concerned about losing data since I have many important files on my unraid system.

I will continue to figure out how to get the system to reboot otherwise I will replace the motherboard and maybe controllers to see if that will resolve my problem.

Any suggestions are greatly appreciated.

are you sure the flash drive is still selected as the boot device?

lionelhutz · March 11, 2011

My first advice is to not expect unRAID to be a backup for your files. Setup another off-site backup system. Try Crashplan backing up to another computer or online storage.

Peter

dgaschk · March 11, 2011

I use unRAID as a backup server. If I loose my backups they are very easy to replace. unRAID is highly reliable when built and operated properly. But no storage system should not be relied upon to protect the single copy of important data.

snsumner · March 11, 2011

are you sure the flash drive is still selected as the boot device?

Yes, I got the system to reboot fine, not sure what was causing the problem. The bad news is that disk4 /dev/md4 still refuses to mount. As I stated before I ran SeaTools on it short test and it passed. I also make sure all my cables and controller are tightly connected. And disk4 was detected by the unraid computer it just can't mount the volume! I'm going to run the long test overnight on disk4 and see if it reports any problems.

I attached a zip file which contains screenshots within UnRAID interface of the problem. Screenshot of my device options before and after I swapped my parity drive. I included output of the smart test I ran on the drive. I included output of my attempt to run reiserfsck which failed. I included my syslog from my last reboot.

I ran fdisk /dev/sdc -l and got the following output:

root@sumner-nas:~# fdisk /dev/sdc -l

Disk /dev/sdc: 1500.3 GB, 1500301910016 bytes

1 heads, 63 sectors/track, 46512336 cylinders

Units = cylinders of 63 * 512 = 32256 bytes

Disk identifier: 0x00000000

Device Boot Start End Blocks Id System

/dev/sdc1 2 46512336 1465138552+ 83 Linux

Partition 1 does not end on cylinder boundary.

root@sumner-nas:~#

All my data appears to be in tack on disk1, disk2, disk3, disk5 but disk4 I cannot mount. Is there some recovery utility I can run to get whatever I can from this disk? The drive appears to be in working condition but something corrupted the reiser fs. Is there away to copy what is not corrupted to another disk? Is there anything I can do?

Any help is greatly appreciated.

diagnostic_files.zip

snsumner · March 11, 2011

So I found this thread where somebody had a very similar problem:

http://lime-technology.com/forum/index.php?topic=4021.msg35622;topicseen#msg35622

They ran --rebuild-tree and it fixed their problem. I think I'm at the point where I need to do the same thing but I want the powers to be to tell me whether I've exhausted all my other options. Please let me know if I should try running --rebuild-tree

Thanks,

Scott

lionelhutz · March 11, 2011

Yes, give it a try.

Peter

snsumner · March 11, 2011

I tried running --rebuild-tree and got the same error:

root@sumner-nas:~# reiserfsck --rebuild-tree /dev/md4

reiserfsck 3.6.21 (2009 www.namesys.com)

*************************************************************

** Do not run the program with --rebuild-tree unless **

** something is broken and MAKE A BACKUP before using it. **

** If you have bad sectors on a drive it is usually a bad **

** idea to continue using it. Then you probably should get **

** a working hard drive, copy the file system from the bad **

** drive to the good one -- dd_rescue is a good tool for **

** that -- and only then run this program. **

** If you are using the latest reiserfsprogs and it fails **

** please email bug reports to [email protected], **

** providing as much information as possible -- your **

** hardware, kernel, patches, settings, all reiserfsck **

** messages (including version), the reiserfsck logfile, **

** check the syslog file for any related information. **

** If you would like advice on using this program, support **

** is available for $25 at www.namesys.com/support.html. **

*************************************************************

Will rebuild the filesystem (/dev/md4) tree