Vova
-
Posts
15 -
Joined
-
Last visited
Content Type
Profiles
Forums
Downloads
Store
Gallery
Bug Reports
Documentation
Landing
Posts posted by Vova
-
-
hello. is there any way i can restore unraid system here?
-
-
Hello.
After graceful shutdown, i've got the message saying LZMA data corrupt.
I went through the forum to find any suitable solution and followed the steps here: https://forums.lime-technology.com/topic/57614-lzma-data-corrupt-message/
Literally, i've copied the contents of the 'previous' folder over the root of the flash, overwriting everything. I've done it in Windows OS (if it makes any difference)
Unfortunately, it didn't help.
It passes the initial step (where it halted previously, i think it was some sort of decompression of the kernel?), but now i get kernel panic like this:
i've also followed the recommendations to check my memory. I've run memtest in safe mode 3 times and also have run HP's memory test in 'Insight Diagnostics'. Both of them say no reason to worry. If it's worth mentioning, i'm using ECC memory in this system.
please help me to restore the system now.
-
Thanks Johnnie, i will leave it as it is for 1 month then.
-
On 6/22/2017 at 0:10 PM, johnnie.black said:
It's easy to confirm by using the disk in a different backplane, unless there's a general problem with the server/controller.
so, i've attached this drive to another port of the same controller, not via the backplane but via standard SATA cable.
B120i has 6 SATA ports (https://www.hpe.com/h20195/v2/gethtml.aspx?docname=c04168333). 4 of them are connected to the backplane, and 1 is available as a separate one.
After this i've initiated rebuild of the failed drive. rebuild succeeded.
Should i assume that the backplane is faulty? is there a solid way to verify this assumption?
Thanks for the time you spend to help us here!
-
Johhnie, given that i've seen the same errors previously with centos on completely other disks - does it provide any valuable info to reconsider the verdict?
-
Thank you. i've removed the Adaptec controller which was rubbishing the logs.
after that i've run the rebuild procedure and seems the drive is flapping. i'm attaching the log.
please give me some clues here. should i replace the cable/backplane?
i've seen very similar errors on completely other drives on this Microserver G8 on Centos 7 before (2TB WD SEs)
-
this adapter is empty, not connected to any drive.
can you please point me to the instruction how to rebuild to the same disk? Is this what do i need to do? https://wiki.lime-technology.com/Replacing_a_Data_Drive
-
Hi. after being on a vacation with my unraid box turned on 24/7 today i've noticed that one of the data HDD was put in a disabled state. there were 187 errors on the Main tab in UI. Also, what was surprising is that the write count on this drive was very huge. i'm talking about very big number like 18,000,000,000,000,000 or even more zeroes. Unfortunately, i've stopped/starter the array w/o taking the screenshot before and that counters were zeroed out.
I have a question - why did i have such a big number of writes for this HDD? i assume that because of this HUGE number of writes the drive has failed.
attaching the diag file.
Please help to identify the root cause.
-
i moved to internal b120i controller and i do not observe the issue any more.
thank you.
-
Thanks @johnnie.black
i'm now trying other backplane port for this HDD. after that i will try to connect to other controllers.
Thanks for fast replies.
-
10 minutes ago, Squid said:
Have you already set up disk #2 with files?
yes, disk#2 is already with an fs, this was an old error yesterday.
i've seen my root error even before i pushed in this new disk.
thanks for csrf explanation. now i understand the root cause.
-
14 minutes ago, Squid said:
post the full diagnostics.
Apologize, i didn't know the magic "download diag" button in unraid. attaching.
also, it's worth mentioning, i do not have cache neither parity drive now in the setup
thanks.
-
Hello. I'm now on trial period, considering to buy unraid.
If my unraid setup is uptime for slightly more than 1 day (i currently observe such a pattern), i'm starting to observe weird behavior of my dockers (plex not opening the webUI, transmission the same, transmission not being able to write the file)
i'm on the latest version of unraid currently (6.3.4)
in /var/log/syslog i see the following:
QuoteMay 26 16:00:58 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:00:58 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15851, flush 1, corrupt 0, gen 0
May 26 16:00:58 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:00:58 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15852, flush 1, corrupt 0, gen 0
May 26 16:00:58 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:00:58 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15853, flush 1, corrupt 0, gen 0
May 26 16:00:58 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:00:58 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15854, flush 1, corrupt 0, gen 0
May 26 16:00:59 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:00:59 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:00 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:01 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:01 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:02 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:02 Tower kernel: blk_update_request: 20 callbacks suppressed
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:02 Tower kernel: btrfs_dev_stat_print_on_error: 20 callbacks suppressed
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15875, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15876, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15877, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15878, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15879, flush 1, corrupt 0, gen 0
May 26 16:01:02 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:02 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15880, flush 1, corrupt 0, gen 0
May 26 16:01:03 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:03 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:03 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:03 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15881, flush 1, corrupt 0, gen 0
May 26 16:01:03 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:03 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15882, flush 1, corrupt 0, gen 0
May 26 16:01:03 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:03 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15883, flush 1, corrupt 0, gen 0
May 26 16:01:03 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:03 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15884, flush 1, corrupt 0, gen 0
May 26 16:01:04 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:05 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:05 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:06 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:07 Tower kernel: aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
May 26 16:01:07 Tower shfs/user: err: shfs_write: write: (5) Input/output error
May 26 16:01:07 Tower kernel: blk_update_request: 20 callbacks suppressed
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:07 Tower kernel: btrfs_dev_stat_print_on_error: 20 callbacks suppressed
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15905, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15906, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15907, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15908, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 1029568
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15909, flush 1, corrupt 0, gen 0
May 26 16:01:07 Tower kernel: blk_update_request: I/O error, dev loop0, sector 3126720
May 26 16:01:07 Tower kernel: BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 15910, flush 1, corrupt 0, gen 0The
Host adapter dead -1
lines i receive each second, even if there is no problem with the unraid.
in /mnt/ i see this strange picture:
Quoteroot@Tower:/mnt# ls -la
/bin/ls: cannot access 'user': Input/output error
/bin/ls: cannot access 'disk1': Input/output error
total 0
drwxr-xr-x 6 root root 120 May 25 21:44 ./
drwxr-xr-x 18 root root 400 May 26 10:09 ../
d????????? ? ? ? ? ? disk1/
drwxrwxrwx 3 nobody users 17 May 25 23:06 disk2/
drwxrwxrwx 3 nobody users 60 May 25 23:16 disks/
d????????? ? ? ? ? ? user/
root@Tower:/mnt#
What am i doing wrong?
attaching full dmesg is full of:
Quote[66614.291345] btrfs_dev_stat_print_on_error: 20 callbacks suppressed
[66614.291348] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19235, flush 1, corrupt 0, gen 0
[66614.291392] blk_update_request: I/O error, dev loop0, sector 3126720
[66614.291394] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19236, flush 1, corrupt 0, gen 0
[66614.291571] blk_update_request: I/O error, dev loop0, sector 1029568
[66614.291573] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19237, flush 1, corrupt 0, gen 0
[66614.291594] blk_update_request: I/O error, dev loop0, sector 3126720
[66614.291595] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19238, flush 1, corrupt 0, gen 0
[66614.291673] blk_update_request: I/O error, dev loop0, sector 1029568
[66614.291677] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19239, flush 1, corrupt 0, gen 0
[66614.291691] blk_update_request: I/O error, dev loop0, sector 3126720
[66614.291693] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19240, flush 1, corrupt 0, gen 0
[66614.326048] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66615.291837] blk_update_request: I/O error, dev loop0, sector 1029568
[66615.291842] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19241, flush 1, corrupt 0, gen 0
[66615.291861] blk_update_request: I/O error, dev loop0, sector 3126720
[66615.291863] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19242, flush 1, corrupt 0, gen 0
[66615.291968] blk_update_request: I/O error, dev loop0, sector 1029568
[66615.291970] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19243, flush 1, corrupt 0, gen 0
[66615.292228] blk_update_request: I/O error, dev loop0, sector 3126720
[66615.292230] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19244, flush 1, corrupt 0, gen 0
[66615.350036] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66616.310040] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66617.334057] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66618.358064] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
[66619.294418] blk_update_request: 20 callbacks suppressed
[66619.294420] blk_update_request: I/O error, dev loop0, sector 1029568
[66619.294422] btrfs_dev_stat_print_on_error: 20 callbacks suppressed
[66619.294425] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19265, flush 1, corrupt 0, gen 0
[66619.294451] blk_update_request: I/O error, dev loop0, sector 3126720
[66619.294453] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19266, flush 1, corrupt 0, gen 0
[66619.294569] blk_update_request: I/O error, dev loop0, sector 1029568
[66619.294572] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19267, flush 1, corrupt 0, gen 0
[66619.294593] blk_update_request: I/O error, dev loop0, sector 3126720
[66619.294595] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19268, flush 1, corrupt 0, gen 0
[66619.294668] blk_update_request: I/O error, dev loop0, sector 1029568
[66619.294670] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19269, flush 1, corrupt 0, gen 0
[66619.294682] blk_update_request: I/O error, dev loop0, sector 3126720
[66619.294684] BTRFS error (device loop0): bdev /dev/loop0 errs: wr 1, rd 19270, flush 1, corrupt 0, gen 0
[66619.318066] aacraid 0000:08:0e.0: AAC0:aac_check_health: Host adapter dead -1
for the system, i'm using HP Microserver G8 with Adaptec 3405. disks are in Volume (JBOD) mode.
Please help
[SOLVED] LZMA Data Corrupt.
in General Support
Posted
Thanks a lot @trurl, this solved my problem.