June 11, 201412 yr Hi folks, Hopefully be able to help / confirm my issues Im running unraid 5.0.5, and during large copies to one of my disks, I get a system hang. Took me a while to work out what was causing the hang - I had only just started using CouchPotato, so had thought it was my issue, so stopped using it. Anyway, turned out it was CouchPotato doing a move operation that caused the problem, and not CP. Once I found that out (just yesterday) I've since been able to recreate it, and just today managed to catch the last dying events from my syslog: Jun 11 10:35:37 Media emhttp: shcmd (41): /usr/local/sbin/emhttp_event disks_mounted Jun 11 10:35:37 Media emhttp_event: disks_mounted Jun 11 10:35:37 Media rc.swapfile[22620]: Swap file plugin update during array mount is disabled Jun 11 10:35:37 Media rc.swapfile[22621]: Starting swap file during array mount ... Jun 11 10:35:43 Media kernel: Adding 2097148k swap on /mnt/cache/.custom/swapfile/swapfile. Priority:-1 extents:563 across:2145304k Jun 11 10:35:43 Media rc.swapfile[22627]: Swap file /mnt/cache/.custom/swapfile/swapfile re-used and started Jun 11 10:35:43 Media emhttp: shcmd (42): :>/etc/samba/smb-shares.conf Jun 11 10:35:43 Media avahi-daemon[22456]: Files changed, reloading. Jun 11 10:35:43 Media emhttp: Restart SMB... Jun 11 10:35:43 Media emhttp: shcmd (43): killall -HUP smbd Jun 11 10:35:43 Media emhttp: shcmd (44): cp /etc/avahi/services/smb.service- /etc/avahi/services/smb.service Jun 11 10:35:43 Media avahi-daemon[22456]: Files changed, reloading. Jun 11 10:35:43 Media avahi-daemon[22456]: Service group file /services/smb.service changed, reloading. Jun 11 10:35:43 Media emhttp: shcmd (45): ps axc | grep -q rpc.mountd Jun 11 10:35:43 Media emhttp: _shcmd: shcmd (45): exit status: 1 Jun 11 10:35:43 Media emhttp: shcmd (46): /usr/local/sbin/emhttp_event svcs_restarted Jun 11 10:35:43 Media emhttp_event: svcs_restarted Jun 11 10:35:43 Media emhttp: shcmd (47): /usr/local/sbin/emhttp_event started Jun 11 10:35:43 Media emhttp_event: started Jun 11 10:35:44 Media avahi-daemon[22456]: Service "Media" (/services/smb.service) successfully established. Jun 11 10:36:44 Media in.telnetd[22658]: connect from 192.168.1.24 (192.168.1.24) Jun 11 10:36:48 Media login[22659]: ROOT LOGIN on '/dev/pts/1' from '192.168.1.24' Jun 11 10:45:32 Media kernel: ata8.00: exception Emask 0x52 SAct 0x0 SErr 0xffffffff action 0xe frozen Jun 11 10:45:32 Media kernel: ata8: SError: { RecovData RecovComm UnrecovData Persist Proto HostInt PHYRdyChg PHYInt CommWake 10B8B Dispar BadCRC Handshk LinkSeq TrStaTrns UnrecFIS DevExch } Jun 11 10:45:32 Media kernel: ata8.00: failed command: READ DMA Jun 11 10:45:32 Media kernel: ata8.00: cmd c8/00:00:6f:fe:44/00:00:00:00:00/e3 tag 0 dma 131072 in Jun 11 10:45:32 Media kernel: res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x56 (ATA bus error) Jun 11 10:45:32 Media kernel: ata8.00: status: { DRDY } Jun 11 10:45:32 Media kernel: ata8: hard resetting link Note, I log in @ 10:30, and kick off the copy (about 20-30Gb, from my cache drive to /mnt/disk5) I swapped the disk yesterday for another 1TB WD EADS (known good), so Im thinking its the expansion card... (StarTech PEXSATA22i). Funnily enough, I dont think Ive experienced this on the other disk hanging off this card. Ran memtest for 12 or so passes without error the first time it happened, so happy with my memory Just wondering if anyone can make out anything from the syslog snippet above. I'll have another complete syslog from the following reboot shortly thanks
June 11, 201412 yr Author Syslog attached - array not started just yet (I set it not to autostart during my poking) Can start it, and attach a new one if needed - you can let me know Thanks syslog-2014-06-11.zip
June 11, 201412 yr Author New cable, problem seemingly persists Jun 12 00:32:52 Media login[32246]: ROOT LOGIN on '/dev/pts/1' from '192.168.1.24' Jun 12 00:46:04 Media kernel: ata7.00: exception Emask 0x52 SAct 0x0 SErr 0xffffffff action 0xe frozen Jun 12 00:46:04 Media kernel: ata7: SError: { RecovData RecovComm UnrecovData Persist Proto HostInt PHYRdyChg PHYInt CommWake 10B8B Dispar BadCRC Handshk LinkSeq TrStaTrns UnrecFIS DevExch } Jun 12 00:46:04 Media kernel: ata7.00: failed command: WRITE DMA EXT Jun 12 00:46:04 Media kernel: ata7.00: cmd 35/00:00:cf:dc:6b/00:04:17:00:00/e0 tag 0 dma 524288 out Jun 12 00:46:04 Media kernel: res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x56 (ATA bus error) Jun 12 00:46:04 Media kernel: ata7.00: status: { DRDY } Jun 12 00:46:04 Media kernel: ata7: hard resetting link Jun 12 00:46:04 Media kernel: ata8.00: exception Emask 0x52 SAct 0x0 SErr 0xffffffff action 0xe frozen
June 12, 201412 yr Author A quick google of the 'Emask 0x52' error pops up the following thread: http://lime-technology.com/forum/index.php?topic=4345.0 seemed to point to a problematic controller found it strange the new code snippet gives errors on both ata7 & ata8, which I presume are my 2x ports off that card. Didnt see that on the previous snippet also only errors after about 10 mins of constant heavy copying... hasnt bombed out doing a parity check at all (touch wood)
June 13, 201412 yr Author I needed more space/expandability anyway, so have got a new m1015 on the way from amazon I might try a large continuous copy to the other drive on the startech card, and see if the problem is there too. Its possible Ive not done large continuous copies to it - smaller 200-300 mb ones, but not large BR rips like whats been triggering the crash If its ok, and as someone mentioned earlier about the cabling perhaps being at fault, it might point to the physical port on the card being questionable?
June 13, 201412 yr I needed more space/expandability anyway, so have got a new m1015 on the way from amazon I might try a large continuous copy to the other drive on the startech card, and see if the problem is there too. Its possible Ive not done large continuous copies to it - smaller 200-300 mb ones, but not large BR rips like whats been triggering the crash If its ok, and as someone mentioned earlier about the cabling perhaps being at fault, it might point to the physical port on the card being questionable? Yes.
June 16, 201412 yr Author hit a bit of a hurdle last night sadly the m1015 arrived, and I got it flashed to IT mode, but unfortunately my motherboard doesnt like it in its x16 slot I think it thinks its a graphics card, and just beeps, and doesnt boot unfortunately (1 long beep, 2 short ones) its an msi ka780gm, and I flashed it to the latest bios last night also funnily enough, the x1 card (the one this m1015 was supposed to replace), seems to be working in the x16 slot, so might be something I missed in the bios, but I dont think so unfortunately bit of a loss as to what to do next.. cutting the back out of the x1 slot isnt a runner due to the proximity of the northbridge heatsink
June 16, 201412 yr hit a bit of a hurdle last night sadly the m1015 arrived, and I got it flashed to IT mode, but unfortunately my motherboard doesnt like it in its x16 slot I think it thinks its a graphics card, and just beeps, and doesnt boot unfortunately (1 long beep, 2 short ones) its an msi ka780gm, and I flashed it to the latest bios last night also funnily enough, the x1 card (the one this m1015 was supposed to replace), seems to be working in the x16 slot, so might be something I missed in the bios, but I dont think so unfortunately bit of a loss as to what to do next.. cutting the back out of the x1 slot isnt a runner due to the proximity of the northbridge heatsink Check for a bios setting "detect non-compliant PCIe card" for your x16 slot. Then see if it makes a difference if you change it. But your MB may expect anything > PCIe x1 to be a graphics card.
June 18, 201412 yr Author I had a good look through the bios, googling any acronym I didnt recognise, and sadly nothing there might just be as you say actually found a good deal on an asrock c224, so Im actually thinking of retiring this server to backup status hah - a crash during a parity check (no large copies) has not helped my confidence in the setup. I wanted to expand out to a 12 drive setup, so this will just move my plans up a bit For anyone else, if this resolves it, dont follow my lead - just try another 1x satax2 card
Archived
This topic is now archived and is closed to further replies.