Locking up when writing to certain Disks


Recommended Posts

Help Please :)

 

I am copying files from Vista to Unraid version: 4.2.1

 

When i copy to disk 4 or 5 (they are on completely different IDE cables, btw) the up to be copied (Vista shows "Calculating Time..) and copies over one folder then locks up.

 

It locks up the whole server and i have to hard reset it (shutting it down via web is FUBARd too.)

 

Sometime if I am lucky, I can reboot and copy to Disk 5. Disk 4 gives me no love at all.

 

I have attached the error log I received when on one of my bad copies to 4. Can someone give it a gander and see if anythings jumps out?

 

Much Appreciated,

 

Mark

 

Edit: NOT letting me attach the 14kb text log file so it is below:

 

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.791896]  [<c017755f>] reiserfs_read_bitmap_block+0xb0/0xba

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792005]  [<c0175ea7>] scan_bitmap_block+0x63/0x22c

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792113]  [<c01762d8>] scan_bitmap+0x1a2/0x1fb

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792220]  [<c01772ed>] reiserfs_allocate_blocknrs+0x2cc/0x3d

2

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792330]  [<c01846dc>] get_empty_nodes+0xc3/0x14e

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792438]  [<c0283501>] ip_output+0x12a/0x204

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792547]  [<c0283501>] ip_output+0x12a/0x204

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792654]  [<c018620d>] fix_nodes+0x168/0x31a

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792760]  [<c018ff14>] reiserfs_paste_into_item+0xc6/0x15a

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792881]  [<c0283501>] ip_output+0x12a/0x204

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.792987]  [<c01371a3>] __kzalloc+0xb/0x32

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.793092]  [<c017df41>] reiserfs_get_block+0xe36/0xfe1

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.793203]  [<c0283501>] ip_output+0x12a/0x204

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.793310]  [<c02838cd>] ip_queue_xmit+0x2f2/0x32c

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.793417]  [<c0132e94>] __alloc_pages+0x49/0x295

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.793526]  [<c013b669>] find_mergeable_anon_vma+0x60/0xb4

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.793635]  [<c0139e8c>] do_anonymous_page+0xa8/0x110

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.793743]  [<c01c050e>] copy_to_user+0x27/0x2f

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.793853]  [<c01605c3>] __block_prepare_write+0x188/0x42c

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.793961]  [<c0132e2f>] get_page_from_freelist+0x8a/0xa6

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.794071]  [<c0160fda>] block_prepare_write+0x21/0x2e

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.794179]  [<c017d10b>] reiserfs_get_block+0x0/0xfe1

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.794285]  [<c0180340>] reiserfs_prepare_write+0xc0/0x117

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.794392]  [<c017d10b>] reiserfs_get_block+0x0/0xfe1

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.794498]  [<c012f775>] find_or_create_page+0x5c/0x7d

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.794606]  [<c0160c92>] __generic_cont_expand+0xa3/0xfe

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.794713]  [<c0160d1e>] generic_cont_expand+0x31/0x37

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.794820]  [<c0180904>] reiserfs_setattr+0x5d/0x16d

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.794926]  [<c01302f3>] filemap_nopage+0x153/0x250

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.795032]  [<c01c050e>] copy_to_user+0x27/0x2f

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.795139]  [<c01566f7>] notify_change+0xf4/0x209

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.795247]  [<c0145ac5>] do_truncate+0x58/0x6e

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.795355]  [<c0145dcc>] do_sys_ftruncate+0x14e/0x167

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.795463]  [<c0145e28>] sys_ftruncate64+0x19/0x1b

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.795569]  [<c0103a62>] syscall_call+0x7/0xb

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.795677]  =======================

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.795748] Code: 40 74 d2 b9 1f 00 00 00 0f a3 0a 19 c0 85 c0

75 0f 8d 04 0f 66 89 03 0f b7 43 02 40 66 89 43 02 49 79 e5 eb b0 66 83 3b 00 75

04 <0f> 0b eb fe 5b 5e 5f c3 55 89 d1 57 89 c7 56 53 83 ec 10 8b 58

 

Message from syslogd@Tower at Mon Jan  7 23:01:18 2008 ...

Tower kernel: [  397.797917] EIP: [<c01774a7>] reiserfs_cache_bitmap_metadata+0x

74/0x7c SS:ESP 0068:d4757844

Jan  7 23:01:18 Tower kernel: [  397.789321] ------------[ cut here ]-----------

-

Jan  7 23:01:18 Tower kernel: [  397.789401] kernel BUG at fs/reiserfs/bitmap.c:

1287!

Jan  7 23:01:18 Tower kernel: [  397.789476] invalid opcode: 0000 [#1]

Jan  7 23:01:18 Tower kernel: [  397.789549] Modules linked in: md_mod fuse ide_

disk ata_piix libata pdc202xx_new piix ide_core e1000

Jan  7 23:01:18 Tower kernel: [  397.789976] CPU:    0

Jan  7 23:01:18 Tower kernel: [  397.789977] EIP:    0060:[<c01774a7>]    Not ta

inted VLI

Jan  7 23:01:18 Tower kernel: [  397.789978] EFLAGS: 00010246  (2.6.22.5 #9)

Jan  7 23:01:18 Tower kernel: [  397.790199] EIP is at reiserfs_cache_bitmap_met

adata+0x74/0x7c

Jan  7 23:01:18 Tower kernel: [  397.790276] eax: d3323000  ebx: e09a4d84  ecx

: ffffffff  edx: d3322ffc

Jan  7 23:01:18 Tower kernel: [  397.790353] esi: d32fd6d8  edi: 00000000  ebp

: e09a4d84  esp: d4757844

Jan  7 23:01:18 Tower kernel: [  397.790430] ds: 007b  es: 007b  fs: 0000  gs:

0033  ss: 0068

Jan  7 23:01:18 Tower kernel: [  397.790506] Process smbd (pid: 1325, ti=d475600

0 task=c14f7070 task.ti=d4756000)

Jan  7 23:01:18 Tower kernel: [  397.790582] Stack: d32fd6d8 05b08000 dfc00a00 c

017755f 00001000 72000001 d47578f8 00000001

Jan  7 23:01:18 Tower kernel: [  397.790975]        00000b61 d47578d4 00000e8e d

fc00a00 c0175ea7 de665f64 00000000 04040404

Jan  7 23:01:18 Tower kernel: [  397.791367]        00000000 e09a4d84 00000b61 d

e5d11c0 542e1a94 3e846bff 00000b61 dfc00a00

Jan  7 23:01:18 Tower kernel: [  397.791759] Call Trace:

Jan  7 23:01:18 Tower kernel: [  397.791896]  [<c017755f>] reiserfs_read_bitmap_

block+0xb0/0xba

Jan  7 23:01:18 Tower kernel: [  397.792005]  [<c0175ea7>] scan_bitmap_block+0x6

3/0x22c

Jan  7 23:01:18 Tower kernel: [  397.792113]  [<c01762d8>] scan_bitmap+0x1a2/0x1

fb

Jan  7 23:01:18 Tower kernel: [  397.792220]  [<c01772ed>] reiserfs_allocate_blo

cknrs+0x2cc/0x3d2

Jan  7 23:01:18 Tower kernel: [  397.792330]  [<c01846dc>] get_empty_nodes+0xc3/

0x14e

Jan  7 23:01:18 Tower kernel: [  397.792438]  [<c0283501>] ip_output+0x12a/0x204

Jan  7 23:01:18 Tower kernel: [  397.792547]  [<c0283501>] ip_output+0x12a/0x204

Jan  7 23:01:18 Tower kernel: [  397.792654]  [<c018620d>] fix_nodes+0x168/0x31a

Jan  7 23:01:18 Tower kernel: [  397.792760]  [<c018ff14>] reiserfs_paste_into_i

tem+0xc6/0x15a

Jan  7 23:01:18 Tower kernel: [  397.792881]  [<c0283501>] ip_output+0x12a/0x204

Jan  7 23:01:18 Tower kernel: [  397.792987]  [<c01371a3>] __kzalloc+0xb/0x32

Jan  7 23:01:18 Tower kernel: [  397.793092]  [<c017df41>] reiserfs_get_block+0x

e36/0xfe1

Jan  7 23:01:18 Tower kernel: [  397.793203]  [<c0283501>] ip_output+0x12a/0x204

Jan  7 23:01:18 Tower kernel: [  397.793310]  [<c02838cd>] ip_queue_xmit+0x2f2/0

x32c

Jan  7 23:01:18 Tower kernel: [  397.793417]  [<c0132e94>] __alloc_pages+0x49/0x

295

Jan  7 23:01:18 Tower kernel: [  397.793526]  [<c013b669>] find_mergeable_anon_v

ma+0x60/0xb4

Jan  7 23:01:18 Tower kernel: [  397.793635]  [<c0139e8c>] do_anonymous_page+0xa

8/0x110

Jan  7 23:01:18 Tower kernel: [  397.793743]  [<c01c050e>] copy_to_user+0x27/0x2

f

Jan  7 23:01:18 Tower kernel: [  397.793853]  [<c01605c3>] __block_prepare_write

+0x188/0x42c

Jan  7 23:01:18 Tower kernel: [  397.793961]  [<c0132e2f>] get_page_from_freelis

t+0x8a/0xa6

Jan  7 23:01:18 Tower kernel: [  397.794071]  [<c0160fda>] block_prepare_write+0

x21/0x2e

Jan  7 23:01:18 Tower kernel: [  397.794179]  [<c017d10b>] reiserfs_get_block+0x

0/0xfe1

Jan  7 23:01:18 Tower kernel: [  397.794285]  [<c0180340>] reiserfs_prepare_writ

e+0xc0/0x117

Jan  7 23:01:18 Tower kernel: [  397.794392]  [<c017d10b>] reiserfs_get_block+0x

0/0xfe1

Jan  7 23:01:18 Tower kernel: [  397.794498]  [<c012f775>] find_or_create_page+0

x5c/0x7d

Jan  7 23:01:18 Tower kernel: [  397.794606]  [<c0160c92>] __generic_cont_expand

+0xa3/0xfe

Jan  7 23:01:18 Tower kernel: [  397.794713]  [<c0160d1e>] generic_cont_expand+0

x31/0x37

Jan  7 23:01:18 Tower kernel: [  397.794820]  [<c0180904>] reiserfs_setattr+0x5d

/0x16d

Jan  7 23:01:18 Tower kernel: [  397.794926]  [<c01302f3>] filemap_nopage+0x153/

0x250

Jan  7 23:01:18 Tower kernel: [  397.795032]  [<c01c050e>] copy_to_user+0x27/0x2

f

Jan  7 23:01:18 Tower kernel: [  397.795139]  [<c01566f7>] notify_change+0xf4/0x

209

Jan  7 23:01:18 Tower kernel: [  397.795247]  [<c0145ac5>] do_truncate+0x58/0x6e

Jan  7 23:01:18 Tower kernel: [  397.795355]  [<c0145dcc>] do_sys_ftruncate+0x14

e/0x167

Jan  7 23:01:18 Tower kernel: [  397.795463]  [<c0145e28>] sys_ftruncate64+0x19/

0x1b

Jan  7 23:01:18 Tower kernel: [  397.795569]  [<c0103a62>] syscall_call+0x7/0xb

Jan  7 23:01:18 Tower kernel: [  397.795677]  =======================

Jan  7 23:01:18 Tower kernel: [  397.795748] Code: 40 74 d2 b9 1f 00 00 00 0f a3

0a 19 c0 85 c0 75 0f 8d 04 0f 66 89 03 0f b7 43 02 40 66 89 43 02 49 79 e5 eb b

0 66 83 3b 00 75 04 <0f> 0b eb fe 5b 5e 5f c3 55 89 d1 57 89 c7 56 53 83 ec 10 8

b 58

Jan  7 23:01:18 Tower kernel: [  397.797917] EIP: [<c01774a7>] reiserfs_cache_bi

tmap_metadata+0x74/0x7c SS:ESP 0068:d4757844

Jan  7 23:01:18 Tower kernel: [  397.798104] WARNING: at kernel/exit.c:869 do_ex

it()

Jan  7 23:01:18 Tower kernel: [  397.798178]  [<c011592c>] do_exit+0x41/0x300

Jan  7 23:01:18 Tower kernel: [  397.798285]  [<c010434b>] die+0x188/0x190

Jan  7 23:01:18 Tower kernel: [  397.798390]  [<c01045c5>] do_invalid_op+0x0/0x8

a

Jan  7 23:01:18 Tower kernel: [  397.798495]  [<c0104646>] do_invalid_op+0x81/0x

8a

Jan  7 23:01:18 Tower kernel: [  397.798601]  [<c01774a7>] reiserfs_cache_bitmap

_metadata+0x74/0x7c

Jan  7 23:01:18 Tower kernel: [  397.798710]  [<c02b4c16>] io_schedule+0xe/0x16

Jan  7 23:01:18 Tower kernel: [  397.798817]  [<c02b4d3a>] __wait_on_bit+0x4a/0x

51

Jan  7 23:01:18 Tower kernel: [  397.798923]  [<c02b4daf>] out_of_line_wait_on_b

it+0x6e/0x76

Jan  7 23:01:18 Tower kernel: [  397.799030]  [<c015f04b>] sync_buffer+0x0/0x2e

Jan  7 23:01:18 Tower kernel: [  397.799135]  [<c01215e5>] wake_bit_function+0x0

/0x3c

Jan  7 23:01:18 Tower kernel: [  397.799243]  [<c02b5532>] error_code+0x6a/0x70

Jan  7 23:01:18 Tower kernel: [  397.799350]  [<c01774a7>] reiserfs_cache_bitmap

_metadata+0x74/0x7c

Jan  7 23:01:18 Tower kernel: [  397.799458]  [<c017755f>] reiserfs_read_bitmap_

block+0xb0/0xba

Jan  7 23:01:18 Tower kernel: [  397.799567]  [<c0175ea7>] scan_bitmap_block+0x6

3/0x22c

Jan  7 23:01:18 Tower kernel: [  397.799675]  [<c01762d8>] scan_bitmap+0x1a2/0x1

fb

Jan  7 23:01:18 Tower kernel: [  397.799782]  [<c01772ed>] reiserfs_allocate_blo

cknrs+0x2cc/0x3d2

Jan  7 23:01:18 Tower kernel: [  397.799892]  [<c01846dc>] get_empty_nodes+0xc3/

0x14e

Jan  7 23:01:18 Tower kernel: [  397.799999]  [<c0283501>] ip_output+0x12a/0x204

Jan  7 23:01:18 Tower kernel: [  397.800106]  [<c0283501>] ip_output+0x12a/0x204

Jan  7 23:01:18 Tower kernel: [  397.800213]  [<c018620d>] fix_nodes+0x168/0x31a

Jan  7 23:01:18 Tower kernel: [  397.800320]  [<c018ff14>] reiserfs_paste_into_i

tem+0xc6/0x15a

Jan  7 23:01:18 Tower kernel: [  397.800441]  [<c0283501>] ip_output+0x12a/0x204

Jan  7 23:01:18 Tower kernel: [  397.800552]  [<c01371a3>] __kzalloc+0xb/0x32

Jan  7 23:01:18 Tower kernel: [  397.800658]  [<c017df41>] reiserfs_get_block+0x

e36/0xfe1

Jan  7 23:01:18 Tower kernel: [  397.800769]  [<c0283501>] ip_output+0x12a/0x204

Jan  7 23:01:18 Tower kernel: [  397.800876]  [<c02838cd>] ip_queue_xmit+0x2f2/0

x32c

Jan  7 23:01:18 Tower kernel: [  397.800983]  [<c0132e94>] __alloc_pages+0x49/0x

295

Jan  7 23:01:18 Tower kernel: [  397.801090]  [<c013b669>] find_mergeable_anon_v

ma+0x60/0xb4

Jan  7 23:01:18 Tower kernel: [  397.801199]  [<c0139e8c>] do_anonymous_page+0xa

8/0x110

Jan  7 23:01:18 Tower kernel: [  397.801309]  [<c01c050e>] copy_to_user+0x27/0x2

f

Jan  7 23:01:18 Tower kernel: [  397.801418]  [<c01605c3>] __block_prepare_write

+0x188/0x42c

Jan  7 23:01:18 Tower kernel: [  397.801525]  [<c0132e2f>] get_page_from_freelis

t+0x8a/0xa6

Jan  7 23:01:18 Tower kernel: [  397.801634]  [<c0160fda>] block_prepare_write+0

x21/0x2e

Jan  7 23:01:18 Tower kernel: [  397.801739]  [<c017d10b>] reiserfs_get_block+0x

0/0xfe1

Jan  7 23:01:18 Tower kernel: [  397.801846]  [<c0180340>] reiserfs_prepare_writ

e+0xc0/0x117

Jan  7 23:01:18 Tower kernel: [  397.801952]  [<c017d10b>] reiserfs_get_block+0x

0/0xfe1

Jan  7 23:01:18 Tower kernel: [  397.802058]  [<c012f775>] find_or_create_page+0

x5c/0x7d

Jan  7 23:01:18 Tower kernel: [  397.802165]  [<c0160c92>] __generic_cont_expand

+0xa3/0xfe

Jan  7 23:01:18 Tower kernel: [  397.802272]  [<c0160d1e>] generic_cont_expand+0

x31/0x37

Jan  7 23:01:18 Tower kernel: [  397.802378]  [<c0180904>] reiserfs_setattr+0x5d

/0x16d

Jan  7 23:01:18 Tower kernel: [  397.802483]  [<c01302f3>] filemap_nopage+0x153/

0x250

Jan  7 23:01:18 Tower kernel: [  397.802589]  [<c01c050e>] copy_to_user+0x27/0x2

f

Jan  7 23:01:18 Tower kernel: [  397.802695]  [<c01566f7>] notify_change+0xf4/0x

209

Jan  7 23:01:18 Tower kernel: [  397.802802]  [<c0145ac5>] do_truncate+0x58/0x6e

Jan  7 23:01:18 Tower kernel: [  397.802909]  [<c0145dcc>] do_sys_ftruncate+0x14

e/0x167

Jan  7 23:01:18 Tower kernel: [  397.803016]  [<c0145e28>] sys_ftruncate64+0x19/

0x1b

Jan  7 23:01:18 Tower kernel: [  397.803123]  [<c0103a62>] syscall_call+0x7/0xb

Jan  7 23:01:18 Tower kernel: [  397.803229]  =======================

 

Link to comment

daquint - do you remember which version of unRAID was used to initially format those disks?  I think this problem can only potentially happen on a disk which was formatted with unRAID release prior to 4.0-beta1, and then system is shut down/crashes without doing a Stop first.

 

This problem is not "supposed" to happen - the whole point of using a journaling file system, such as reiserfs, is to avoid this kind of problem.  But I think what happened is that some change was made to the reiserfs file system between linux 2.4 kernel and 2.6 kernel, which introduced this problem.  unRAID OS moved to linux 2.6 kernel starting with 4.0-beta1.

 

I can't yet prove this is the case, but the original reiserfs developer may not have been able to sign-off these changes, due to being in jail on suspicion of murder  :o  (Or perhaps was just distracted??)

Link to comment

This particuliar disk (disk4) - the original 400GB i had in there had been formatted with Unraid 3.0

 

When the 400GB (disk4) started having issues, I upgraded to a 500GB drive. Trying different things, I may have upgraded (to 4.0) right before or right after the drive switch - I can't remember...

 

When the server reformatted the new drive and built from parity, could the new 500GB inherited the issue from the 400GB?

 

Ultimately the reiserfsck fixed it, so far so good.

 

Now I am wondering if that 400GB that I thought was bad is perfectly fine. I will have to put it in and see if it works.

 

Thanks!

Link to comment
  • 3 weeks later...
Ultimately the reiserfsck fixed it, so far so good.

 

Now I am wondering if that 400GB that I thought was bad is perfectly fine. I will have to put it in and see if it works.

 

Let us know if the 400 gig drive worked. Would be nice to know if the file system was the problem.

 

Phil

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.