Vova

July 18, 2017

Thanks a lot @trurl, this solved my problem.

July 18, 2017

hello. is there any way i can restore unraid system here?

July 17, 2017

up

July 14, 2017

Hello.

After graceful shutdown, i've got the message saying LZMA data corrupt.

I went through the forum to find any suitable solution and followed the steps here: https://forums.lime-technology.com/topic/57614-lzma-data-corrupt-message/

Literally, i've copied the contents of the 'previous' folder over the root of the flash, overwriting everything. I've done it in Windows OS (if it makes any difference)

Unfortunately, it didn't help.

It passes the initial step (where it halted previously, i think it was some sort of decompression of the kernel?), but now i get kernel panic like this:

i've also followed the recommendations to check my memory. I've run memtest in safe mode 3 times and also have run HP's memory test in 'Insight Diagnostics'. Both of them say no reason to worry. If it's worth mentioning, i'm using ECC memory in this system.

please help me to restore the system now.

June 27, 2017

Thanks Johnnie, i will leave it as it is for 1 month then.

June 26, 2017

On 6/22/2017 at 0:10 PM, johnnie.black said:

It's easy to confirm by using the disk in a different backplane, unless there's a general problem with the server/controller.

so, i've attached this drive to another port of the same controller, not via the backplane but via standard SATA cable.

B120i has 6 SATA ports (https://www.hpe.com/h20195/v2/gethtml.aspx?docname=c04168333). 4 of them are connected to the backplane, and 1 is available as a separate one.

After this i've initiated rebuild of the failed drive. rebuild succeeded.

Should i assume that the backplane is faulty? is there a solid way to verify this assumption?

Thanks for the time you spend to help us here!

June 22, 2017

Johhnie, given that i've seen the same errors previously with centos on completely other disks - does it provide any valuable info to reconsider the verdict?

June 21, 2017

Thank you. i've removed the Adaptec controller which was rubbishing the logs.

after that i've run the rebuild procedure and seems the drive is flapping. i'm attaching the log.

please give me some clues here. should i replace the cable/backplane?

i've seen very similar errors on completely other drives on this Microserver G8 on Centos 7 before (2TB WD SEs)

tower-diagnostics-20170621-1421.zip

June 19, 2017

this adapter is empty, not connected to any drive.

can you please point me to the instruction how to rebuild to the same disk? Is this what do i need to do? https://wiki.lime-technology.com/Replacing_a_Data_Drive

June 19, 2017

Hi. after being on a vacation with my unraid box turned on 24/7 today i've noticed that one of the data HDD was put in a disabled state. there were 187 errors on the Main tab in UI. Also, what was surprising is that the write count on this drive was very huge. i'm talking about very big number like 18,000,000,000,000,000 or even more zeroes. Unfortunately, i've stopped/starter the array w/o taking the screenshot before and that counters were zeroed out.

I have a question - why did i have such a big number of writes for this HDD? i assume that because of this HUGE number of writes the drive has failed.

attaching the diag file.

Please help to identify the root cause.

tower-diagnostics-20170619-1516.zip

June 19, 2017

i moved to internal b120i controller and i do not observe the issue any more.

thank you.

May 26, 2017

Thanks @johnnie.black

i'm now trying other backplane port for this HDD. after that i will try to connect to other controllers.

Thanks for fast replies.

May 26, 2017

10 minutes ago, Squid said:

Have you already set up disk #2 with files?

yes, disk#2 is already with an fs, this was an old error yesterday.

i've seen my root error even before i pushed in this new disk.

thanks for csrf explanation. now i understand the root cause.

May 26, 2017

14 minutes ago, Squid said:

post the full diagnostics.

Apologize, i didn't know the magic "download diag" button in unraid. attaching.

also, it's worth mentioning, i do not have cache neither parity drive now in the setup

thanks.

tower-diagnostics-20170526-1658.zip

May 26, 2017

Hello. I'm now on trial period, considering to buy unraid.

If my unraid setup is uptime for slightly more than 1 day (i currently observe such a pattern), i'm starting to observe weird behavior of my dockers (plex not opening the webUI, transmission the same, transmission not being able to write the file)

i'm on the latest version of unraid currently (6.3.4)

in /var/log/syslog i see the following:

Quote

May 26 16:00:58 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:00:58 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15851, flush 1, corrupt 0, gen 0
May 26 16:00:58 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:00:58 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15852, flush 1, corrupt 0, gen 0
May 26 16:00:58 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:00:58 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15853, flush 1, corrupt 0, gen 0
May 26 16:00:58 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:00:58 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15854, flush 1, corrupt 0, gen 0
May 26 16:00:59 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:00:59 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:00 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:01 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:01 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:02 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:02 Tower kernel: blk_update_request: 20 callbacks suppressed
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:02 Tower kernel: btrfs_dev_stat_print_on_error: 20 callbacks suppressed
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15875, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15876, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15877, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15878, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15879, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15880, flush 1, corrupt 0, gen 0
May 26 16:01:03 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:03 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:03 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:03 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15881, flush 1, corrupt 0, gen 0
May 26 16:01:03 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:03 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15882, flush 1, corrupt 0, gen 0
May 26 16:01:03 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:03 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15883, flush 1, corrupt 0, gen 0
May 26 16:01:03 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:03 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15884, flush 1, corrupt 0, gen 0
May 26 16:01:04 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:05 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:05 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:06 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:07 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:07 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:07 Tower kernel: blk_update_request: 20 callbacks suppressed
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:07 Tower kernel: btrfs_dev_stat_print_on_error: 20 callbacks suppressed
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15905, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15906, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15907, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15908, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15909, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15910, flush 1, corrupt 0, gen 0

The

Host adapter dead -1

lines i receive each second, even if there is no problem with the unraid.

in /mnt/ i see this strange picture:

Quote

root@Tower:/mnt# ls -la
/bin/ls: cannot access 'user': Input/output error
/bin/ls: cannot access 'disk1': Input/output error
total 0
drwxr-xr-x 6 root root 120 May 25 21:44 ./
drwxr-xr-x 18 root root 400 May 26 10:09 ../
d????????? ? ? ? ? ? disk1/
drwxrwxrwx 3 nobody users 17 May 25 23:06 disk2/
drwxrwxrwx 3 nobody users 60 May 25 23:16 disks/
d????????? ? ? ? ? ? user/
root@Tower:/mnt#

What am i doing wrong?

attaching full dmesg is full of:

Quote

[66614.291345] btrfs_dev_stat_print_on_error: 20 callbacks suppressed
[66614.291348] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19235, flush 1, corrupt 0, gen 0
[66614.291392] blk_update_request: I/O error, dev loop0, sector 3126720
[66614.291394] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19236, flush 1, corrupt 0, gen 0
[66614.291571] blk_update_request: I/O error, dev loop0, sector 1029568
[66614.291573] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19237, flush 1, corrupt 0, gen 0
[66614.291594] blk_update_request: I/O error, dev loop0, sector 3126720
[66614.291595] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19238, flush 1, corrupt 0, gen 0
[66614.291673] blk_update_request: I/O error, dev loop0, sector 1029568
[66614.291677] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19239, flush 1, corrupt 0, gen 0
[66614.291691] blk_update_request: I/O error, dev loop0, sector 3126720
[66614.291693] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19240, flush 1, corrupt 0, gen 0
[66614.326048] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66615.291837] blk_update_request: I/O error, dev loop0, sector 1029568
[66615.291842] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19241, flush 1, corrupt 0, gen 0
[66615.291861] blk_update_request: I/O error, dev loop0, sector 3126720
[66615.291863] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19242, flush 1, corrupt 0, gen 0
[66615.291968] blk_update_request: I/O error, dev loop0, sector 1029568
[66615.291970] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19243, flush 1, corrupt 0, gen 0
[66615.292228] blk_update_request: I/O error, dev loop0, sector 3126720
[66615.292230] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19244, flush 1, corrupt 0, gen 0
[66615.350036] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66616.310040] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66617.334057] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66618.358064] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66619.294418] blk_update_request: 20 callbacks suppressed
[66619.294420] blk_update_request: I/O error, dev loop0, sector 1029568
[66619.294422] btrfs_dev_stat_print_on_error: 20 callbacks suppressed
[66619.294425] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19265, flush 1, corrupt 0, gen 0
[66619.294451] blk_update_request: I/O error, dev loop0, sector 3126720
[66619.294453] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19266, flush 1, corrupt 0, gen 0
[66619.294569] blk_update_request: I/O error, dev loop0, sector 1029568
[66619.294572] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19267, flush 1, corrupt 0, gen 0
[66619.294593] blk_update_request: I/O error, dev loop0, sector 3126720
[66619.294595] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19268, flush 1, corrupt 0, gen 0
[66619.294668] blk_update_request: I/O error, dev loop0, sector 1029568
[66619.294670] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19269, flush 1, corrupt 0, gen 0
[66619.294682] blk_update_request: I/O error, dev loop0, sector 3126720
[66619.294684] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19270, flush 1, corrupt 0, gen 0
[66619.318066] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1

for the system, i'm using HP Microserver G8 with Adaptec 3405. disks are in Volume (JBOD) mode.

Please help

Vova

Posts

Joined

Last visited

Content Type

Profiles

Forums

Downloads

Store

Gallery

Bug Reports

Documentation

Landing

Posts posted by Vova

[SOLVED] LZMA Data Corrupt.

[SOLVED] LZMA Data Corrupt.

[SOLVED] LZMA Data Corrupt.

[SOLVED] LZMA Data Corrupt.

HDD failed with HUGE write count

HDD failed with HUGE write count

HDD failed with HUGE write count

HDD failed with HUGE write count

HDD failed with HUGE write count

HDD failed with HUGE write count

errors in logs (SOLVED)

errors in logs (SOLVED)

errors in logs (SOLVED)

errors in logs (SOLVED)

errors in logs (SOLVED)