August 22, 201015 yr So today out of the blue one of my disks was diabled. I ran a reiserfsck command on the disk and got the following message: Do you want to run this program?[N/Yes] (note need to type Yes if you do):Y reiserfs_open: the reiserfs superblock cannot be found on /dev/sdi. Failed to open the filesystem. If the partition table has not been changed, and the partition is valid and it really contains a reiserfs partition, then the superblock is corrupted and you need to run this utility with --rebuild-sb. I'm a little confused as to what to do next. Should I run the "--rebuild-sb." command ? Am I going to lose data here? Any help from the experts would be greatly appreciated. Kent Here is a piece of the syslog: Aug 22 11:16:51 Tower kernel: REISERFS error (device md4): zam-7001 reiserfs_find_entry: io error Aug 22 11:16:51 Tower kernel: REISERFS error (device md4): vs-13070 reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of [49730 50098 0x0 SD] Aug 22 11:17:04 Tower last message repeated 83031 times Aug 22 11:17:04 Tower emhttp: disk_temperature: ioctl (smart_enable): Input/output error Aug 22 11:17:04 Tower kernel: REISERFS error (device md4): vs-13070 reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of [49730 50098 0x0 SD] Aug 22 11:17:04 Tower last message repeated 256 times Aug 22 11:17:04 Tower emhttp: disk_temperature: ioctl (smart_enable): Input/output error Aug 22 11:17:04 Tower kernel: REISERFS error (device md4): vs-13070 reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of [49730 50098 0x0 SD] Aug 22 11:17:25 Tower last message repeated 140004 times Aug 22 11:17:25 Tower emhttp: disk_temperature: ioctl (smart_enable): Input/output error Aug 22 11:17:25 Tower kernel: REISERFS error (device md4): vs-13070 reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of [49730 50098 0x0 SD] Aug 22 11:17:26 Tower last message repeated 255 times Aug 22 11:17:26 Tower emhttp: disk_temperature: ioctl (smart_enable): Input/output error Aug 22 11:17:26 Tower kernel: REISERFS error (device md4): vs-13070 reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of [49730 50098 0x0 SD] Aug 22 11:17:57 Tower last message repeated 192138 times
August 22, 201015 yr Author Here is the smart report for this drive, looks Ok I think: smartctl -a -d ata /dev/sdi (disk4) smartctl version 5.38 [i486-slackware-linux-gnu] Copyright © 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Device Model: WDC WD1001FALS-00J7B0 Serial Number: WD-WMATV0331864 Firmware Version: 05.00K05 User Capacity: 1,000,204,886,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Sun Aug 22 13:07:31 2010 MDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (19200) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 221) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x303f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 253 253 021 Pre-fail Always - 4433 4 Start_Stop_Count 0x0032 098 098 000 Old_age Always - 2218 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 051 Old_age Always - 0 9 Power_On_Hours 0x0032 082 082 000 Old_age Always - 13373 10 Spin_Retry_Count 0x0032 100 100 051 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 051 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 105 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 67 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 2218 194 Temperature_Celsius 0x0022 122 111 000 Old_age Always - 28 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 051 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 13371 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
August 23, 201015 yr Author Well, I tried again and was able to complete a ReiserFS check. Do you want to run this program?[N/Yes] (note need to type Yes if you do):Yes ########### reiserfsck --check started at Sun Aug 22 20:40:56 2010 ########### Replaying journal: Done. Reiserfs journal '/dev/md4' in blocks [18..8211]: 0 transactions replayed Checking internal tree.. finished Comparing bitmaps..finished Checking Semantic tree: finished No corruptions found There are on the filesystem: Leaves 240262 Internal nodes 1518 Directories 10281 Other files 109577 Data block pointers 230984973 (630 of them are zero) Safe links 0 ########### reiserfsck finished at Sun Aug 22 21:59:38 2010 ########### The drive passed a smartctl test and is now being rebuilt. Fingers crossed.
August 23, 201015 yr The drive passed a smartctl test and is now being rebuilt. Fingers crossed. Do you still have syslog? if there were no SATA error along with those I/O error. i will run memtest to make sure memory is not an issue. the original I/O error looks like happen when unRAID try to read disk temperature through ioctl while reading inode info.
August 24, 201015 yr Author GK20, Thanks for taking the time. memtest showed no issues and the drive rebuilt without problems. Looks like all is good again. Kent
Archived
This topic is now archived and is closed to further replies.