Jump to content

[SOLVED] Strange messup with disks


Recommended Posts

Sorry for not having a syslog, but for some reason the "telnet tower" or "telnet + ip" commands don't work. Is it because of Win7?

 

I'm running unraid 4.7

 

My problem:

 

A couple of days ago I replaced one Seagate disc with a WD 2T EARS disc. I had it MBR 4k aligned. I have not used jumpers on my previous EARS hdds and I have on unaligned EARS in my array ATM.

 

Did the data-rebuild and everything was fine. I have power cycled it with no probs.

 

But the new drive was outside the case the whole time and I thought I would install it inside. I did it, powered on and one other disk had a blue dot (Asking for a data-rebuild)... First I thought it was the cables, but tried to change the disk to other cable and had the same blue dot on that same disk. Then I thought if I would just change four cables randomly. Then it had four red dots. The disks order was not correct, so I was putting them in order when I noticed that the one slot that had the new disk was telling that the correct disc was the one that had been replaced?

 

There was the option to start and let the unraid to save the new disc order and I did that and started to run parity check straight after but I was getting major sync errors.

 

What could be wrong? How would the array understand that there is a new disc that is correct?

 

Thanks in advance!

 

syslog.txt

Link to comment

I'll withhold my comments on randomly changing cables...

 

In any case, when you moved the disk inside the machine, did you connect it to the same cable / sata port as it was connected to when outside the box?  If not, its quite possible the disk order changed when you moved it inside.

 

Are you CERTAIN the parity disk is assigned correctly?  If it isn't, and started Unraid, you just wiped whatever data was on that drive.

 

If you are comfortable that parity has been correctly assigned, and the data on the other drives in intact, you can type initconfig at the server prompt, which will remove the current configuration, allow you to reassign the drives, start the array, and it will recalculate parity.  NOTE:  I'd strongly suggest posting a syslog before going this route if at all possible.

Link to comment

Sorry for not having a syslog, but for some reason the "telnet tower" or "telnet + ip" commands don't work. Is it because of Win7?

 

Yes, the telnet client is disabled by default in Win7.  Here's how to enable it:

 

Start Menu,

Control Panel,

Program and Features,

Turn Windows features on or off,

Telnet Client (ensure it's checked/selected),

Click OK

 

PuTTY works fine too.

Link to comment

Thanks Rajahal!

 

I think the problem lies here:

 

Jun  3 03:02:57 Tower logger: mount: /dev/md7: can't read superblock

Jun  3 03:02:57 Tower emhttp: _shcmd: shcmd (40): exit status: 32

Jun  3 03:02:57 Tower emhttp: disk7 mount error: 32

Jun  3 03:02:57 Tower emhttp: shcmd (42): rmdir /mnt/disk7

Jun  3 03:02:57 Tower kernel: REISERFS warning: reiserfs-5090 is_tree_node: node level 1 does not match to the expected one 4

Jun  3 03:02:57 Tower kernel: REISERFS error (device md7): vs-5150 search_by_key: invalid format found in block 94471506. Fsck?

Jun  3 03:02:57 Tower kernel: REISERFS (device md7): Remounting filesystem read-only

Jun  3 03:02:57 Tower kernel: REISERFS error (device md7): vs-13070 reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of [1 2 0x0 SD]

 

What does those mean?

Link to comment

Thanks Rajahal!

 

I think the problem lies here:

 

Jun  3 03:02:57 Tower logger: mount: /dev/md7: can't read superblock

Jun  3 03:02:57 Tower emhttp: _shcmd: shcmd (40): exit status: 32

Jun  3 03:02:57 Tower emhttp: disk7 mount error: 32

Jun  3 03:02:57 Tower emhttp: shcmd (42): rmdir /mnt/disk7

Jun  3 03:02:57 Tower kernel: REISERFS warning: reiserfs-5090 is_tree_node: node level 1 does not match to the expected one 4

Jun  3 03:02:57 Tower kernel: REISERFS error (device md7): vs-5150 search_by_key: invalid format found in block 94471506. Fsck?

Jun  3 03:02:57 Tower kernel: REISERFS (device md7): Remounting filesystem read-only

Jun  3 03:02:57 Tower kernel: REISERFS error (device md7): vs-13070 reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of [1 2 0x0 SD]

 

What does those mean?

It means you must fix the file-system corruption.  TO prevent further corruption, the OS has made the file system read-only until it is fixed.

Follow the steps in the wiki here:http://lime-technology.com/wiki/index.php?title=Check_Disk_Filesystems

 

Joe L.

http://lime-technology.com/wiki/index.php?title=Check_Disk_Filesystems

Link to comment

I just thinking could the "Start and let the unraid to save the new disc order" be the reason for the red dot?

 

Could it be possible that the parity saved the broken file system when I applied that option?

 

And now that it is fixed it thinks its not correct?

 

When I saved the new disc order all dots were green, tried parity check and got massive amount of errors. Rebooted and the 7th drive showed as unformatted.

 

Would it be safer to just rebuild the drive from parity?

Link to comment

Thanks Joe!

 

I did the rebuild-tree and now seems to be fine with reiserfsck -check.

 

But it still has a red dot. Now it's not showing as unformatted, but it's not in the array.

 

Syslog included.

A disk with a "red" indicator has had a "write" to it fail.

It will not go back into service simply because you fix the reason for the the write failure (loose cable)

 

The only way to get the same drive back is to allow unRAID to re-construct the contents onto itself.

(including the "write" on that had originally failed)

 

The only way unRAID will re-construct onto a drive is for it to think it is a "new/different" drive than original.  You can fake out unRAID to think the old disk is its own replacement by

Stopping the array

Un-assigning the failed disk

Starting the array with the disk un-assigned (this causes unRAID to forget the model/serial number)

Stopping the array

Re-assigning the failed disk

Starting the array once more.  (this will re-construct the disk based on parity and all the remaining disks)

 

Once the failed disk is re-constructed you also need to perform a parity check, to ensure the contents you wrote is readable.

 

Joe L.

Link to comment

I'm getting a new hdd today, but I still would like to ask one thing.

 

When reiserfsck is used, does it just fix something that a data-rebuild would too?

 

So if I have one disk that has write errors, could I just connect a new one and start data-rebuild there?

 

Or does the reiserfsck some how change the parity data?

 

Bare with me, I'm trying to figure this situation and I'm no linux-savvy person.  :-[

Link to comment

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...