February 23, 201115 yr Before upgrading, I had two discs: a 500GB Seagate IDE and 1.5TB Western Digital Green EADS drive, no parity. Recently I installed a third drive, a Western Digital 1TB Caviar Black. Everything worked fine with just the two discs. I installed the third, and suddenly I'm getting a bunch of errors on the 1.5TB Green drive. I can still navigate through its folder structure and see all the files, but I can't open any or write anything new without Windows yelling at me about network problems. The other drive and the new one both work just fine. My hardware: Intel P4 2.53Ghz 1GB RAM (I think) PCI SATA Card with SIL3124 controller NVidia Geforce4 MX 440 Syslog is attached, any ideas? syslog.txt
February 23, 201115 yr It's likely that you loosened one of the cables the the EADS drive. Re-seat the cables and try again.
February 24, 201115 yr Author Funnily enough when I swapped SATA cables for a brand new one, the disk was not recognized by anything at all (not even the BIOS screen). I put the old cable back in, and it was the same as before. I switched it to a different SATA port on the card, and my unRAID booted up a lot faster, but the same problem is happening.
February 24, 201115 yr try reiserfsck --check /dev/mdx where x=1,2,3,... after stopping samba and un-mounting the disk being checked
February 25, 201115 yr Author That command does not want to run for some reason. It will output reiserfs_open: the reiserfs superblock cannot be found on /dev/sdb. Failed to open the filesystem. It does this for any disc I try to check, not just the one acting up. This is very odd.
February 25, 201115 yr That command does not want to run for some reason. It will output reiserfs_open: the reiserfs superblock cannot be found on /dev/sdb. Failed to open the filesystem. It does this for any disc I try to check, not just the one acting up. This is very odd. There will NEVER be a superblock on /dev/sdb. You typed the wrong device name. The reiserfs is on the FIRST partition on the disk. To check it you would typically access it through the /dev/mdX device. (Where md1 = disk1 in your array, md2 = disk1, etc) To access the raw disk directly, without regard for parity, you must use the device name with a trailing "1" It refers to the first partition. reiserfsck --check /dev/sdb1 Joe L.
February 26, 201115 yr Author Thanks for the help. I ran the check, and it fixed a few very minor things it looks like. But the problem I was having before remains.
February 26, 201115 yr Author I should have clarified. I ran the check, and then the fix that was recommended in the output I applied. After rebooting and running the check again, I get the following. ########### reiserfsck --check started at Fri Feb 25 17:01:42 2011 ########### Replaying journal.. Trans replayed: mountid 33, transid 43321, desc 3904, len 1, commit 3906, next t rans offset 3889 Trans replayed: mountid 33, transid 43322, desc 3907, len 1, commit 3909, next t rans offset 3892 Trans replayed: mountid 33, transid 43323, desc 3910, len 11, commit 3922, next trans offset 3905 Trans replayed: mountid 33, transid 43324, desc 3923, len 6, commit 3930, next t rans offset 3913 Trans replayed: mountid 33, transid 43325, desc 3931, len 10, commit 3942, next trans offset 3925 Trans replayed: mountid 33, transid 43326, desc 3943, len 7, commit 3951, next t rans offset 3934 Trans replayed: mountid 33, transid 43327, desc 3952, len 125, commit 4078, next trans offset 4061 Trans replayed: mountid 33, transid 43328, desc 4079, len 125, commit 4205, next trans offset 4188 Trans replayed: mountid 33, transid 43329, desc 4206, len 116, commit 4323, next trans offset 4306 Reiserfs journal '/dev/sdb1' in blocks [18..8211]: 9 transactions replayed Checking internal tree..finished Comparing bitmaps..finished Checking Semantic tree: finished No corruptions found There are on the filesystem: Leaves 362239 Internal nodes 2147 Directories 4 Other files 4 Data block pointers 366584348 (4656796 of them are zero) Safe links 0 ########### reiserfsck finished at Fri Feb 25 17:05:34 2011 ###########
February 26, 201115 yr The journal is replaying. This happens after a dirty shutdown. How are you rebooting?
February 26, 201115 yr How are you rebooting via telnet? The powerdown add-in needs to be installed to shutdown cleanly via telnet unless you enter about 6 commands.
February 26, 201115 yr Author Via telnet I just used the "reboot" command. I've only ever done that once though, which did happen to be just before I ran that check. Every other reboot has been via the webui. Running the check after rebooting via the webui gets me this: reiserfsck --check started at Sat Feb 26 13:36:55 2011 ########### Replaying journal.. Trans replayed: mountid 34, transid 43336, desc 4342, len 1, commit 4344, next t rans offset 4327 Trans replayed: mountid 34, transid 43337, desc 4345, len 1, commit 4347, next t rans offset 4330 Reiserfs journal '/dev/sdb1' in blocks [18..8211]: 2 transactions replayed Checking internal tree..finished Comparing bitmaps..finished Checking Semantic tree: finished No corruptions found There are on the filesystem: Leaves 362239 Internal nodes 2147 Directories 4 Other files 4 Data block pointers 366584348 (4656796 of them are zero) Safe links 0 ########### reiserfsck finished at Sat Feb 26 13:40:46 2011 ###########
February 27, 201115 yr Don't just enter reboot on the terminal. Search the wiki on how do do a clean reboot via telent. Did you do the reiserfsck corrections on sd1? If so you parity is out of sync and you ned to rebuild parity. You should have used mdX. mdX would have kept parity in sync. EDIT: Wait for Joe L. or another Mod to comment before proceeding.
February 27, 201115 yr Author I understand that now about rebooting via telnet. Again, I don't normally do it that way. I have no parity at the moment, so doing mdX instead of sda1 is not a concern. I will do it that way in the future though noting this.
February 27, 201115 yr OK. Does the reiserfsck report clean file systems on all drives? Post SMART reports for the drives.
February 27, 201115 yr Via telnet I just used the "reboot" command. I've only ever done that once though, which did happen to be just before I ran that check. Every other reboot has been via the webui. Running the check after rebooting via the webui gets me this: reiserfsck --check started at Sat Feb 26 13:36:55 2011 ########### Replaying journal.. Trans replayed: mountid 34, transid 43336, desc 4342, len 1, commit 4344, next t rans offset 4327 Trans replayed: mountid 34, transid 43337, desc 4345, len 1, commit 4347, next t rans offset 4330 Reiserfs journal '/dev/sdb1' in blocks [18..8211]: 2 transactions replayed Checking internal tree..finished Comparing bitmaps..finished Checking Semantic tree: finished No corruptions found There are on the filesystem: Leaves 362239 Internal nodes 2147 Directories 4 Other files 4 Data block pointers 366584348 (4656796 of them are zero) Safe links 0 ########### reiserfsck finished at Sat Feb 26 13:40:46 2011 ########### If you just type reboot via telnet without stopping the array first you'll always face a file-system check upon re-start and risk corruption of your data. You must stop the array first. Then you can type reboot.
February 27, 201115 yr Author Via telnet I just used the "reboot" command. I've only ever done that once though, which did happen to be just before I ran that check. Every other reboot has been via the webui. Running the check after rebooting via the webui gets me this: reiserfsck --check started at Sat Feb 26 13:36:55 2011 ########### Replaying journal.. Trans replayed: mountid 34, transid 43336, desc 4342, len 1, commit 4344, next t rans offset 4327 Trans replayed: mountid 34, transid 43337, desc 4345, len 1, commit 4347, next t rans offset 4330 Reiserfs journal '/dev/sdb1' in blocks [18..8211]: 2 transactions replayed Checking internal tree..finished Comparing bitmaps..finished Checking Semantic tree: finished No corruptions found There are on the filesystem: Leaves 362239 Internal nodes 2147 Directories 4 Other files 4 Data block pointers 366584348 (4656796 of them are zero) Safe links 0 ########### reiserfsck finished at Sat Feb 26 13:40:46 2011 ########### If you just type reboot via telnet without stopping the array first you'll always face a file-system check upon re-start and risk corruption of your data. You must stop the array first. Then you can type reboot. The one time I did reboot via telnet like that was after the array was stopped anyway, so I think that is not a problem. OK. Does the reiserfsck report clean file systems on all drives? Post SMART reports for the drives. == START OF INFORMATION SECTION === Model Family: Seagate Barracuda 7200.10 family Device Model: ST3500630A Serial Number: 9QG3JJ9F Firmware Version: 3.AAF User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Sat Feb 26 21:32:09 2011 GMT+8 SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED See vendor-specific Attribute list for marginal Attributes. General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 430) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 163) minutes. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 113 096 006 Pre-fail Always - 58077059 3 Spin_Up_Time 0x0003 093 093 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 222 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 060 048 030 Pre-fail Always - 2745004395352 9 Power_On_Hours 0x0032 084 084 000 Old_age Always - 14705 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 126 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 077 037 045 Old_age Always In_the_past 23 (Lifetime Min/Max 23/23) 194 Temperature_Celsius 0x0022 023 063 000 Old_age Always - 23 (0 17 0 0) 195 Hardware_ECC_Recovered 0x001a 102 058 000 Old_age Always - 96258917 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0 202 TA_Increase_Count 0x0032 100 253 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. === START OF INFORMATION SECTION === Device Model: WDC WD15EADS-00P8B0 Serial Number: WD-WMAVU0585310 Firmware Version: 01.00A01 User Capacity: 1,500,301,910,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Sat Feb 26 21:34:12 2011 GMT+8 SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (33000) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x3037) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 178 177 021 Pre-fail Always - 6100 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 159 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 099 099 000 Old_age Always - 1057 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 41 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 24 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 1685 194 Temperature_Celsius 0x0022 128 119 000 Old_age Always - 22 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 6 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. === START OF INFORMATION SECTION === Device Model: WDC WD1001FALS-00J7B1 Serial Number: WD-WMATV2093574 Firmware Version: 05.00K05 User Capacity: 1,000,204,886,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Sat Feb 26 21:35:49 2011 GMT+8 SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (18600) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 214) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x303f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 228 224 021 Pre-fail Always - 8591 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 976 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 084 084 000 Old_age Always - 12394 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 203 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 70 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 976 194 Temperature_Celsius 0x0022 128 098 000 Old_age Always - 22 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. Reiserfsck reports the drives clean.
Archived
This topic is now archived and is closed to further replies.