Mike Bruns
Dabbler
- Joined
- Dec 9, 2015
- Messages
- 21
Hi all,
Could someone help me interpret this zpool error after a scrub? The pool is functional, just giving a corruption error in a non-critical file. It's a new Freenas install.
My config is a: Dell Poweredge T110-ii Server, 16GB ECC RAM, 5x6TB drives, RaidZ2, Current 9.3.1 stable software.
Note: One of the drives shows a smartctl error "in the past" but appears fine now. I'm waiting to get the RMA replacement and will replace the questionable one after burn-in. Is it better to run a RaidZ2 with 1 questionable drive, or remove the questionable drive and run a RaidZ1
Could someone help me interpret this zpool error after a scrub? The pool is functional, just giving a corruption error in a non-critical file. It's a new Freenas install.
My config is a: Dell Poweredge T110-ii Server, 16GB ECC RAM, 5x6TB drives, RaidZ2, Current 9.3.1 stable software.
Note: One of the drives shows a smartctl error "in the past" but appears fine now. I'm waiting to get the RMA replacement and will replace the questionable one after burn-in. Is it better to run a RaidZ2 with 1 questionable drive, or remove the questionable drive and run a RaidZ1
Code:
########## ZPool status report summary for all pools ########## +--------------+--------+------+------+------+----+--------+------+-----+ |Pool Name |Status |Read |Write |Cksum |Used|Scrub |Scrub |Last | | | |Errors|Errors|Errors| |Repaired|Errors|Scrub| | | | | | | |Bytes | |Age | +--------------+--------+------+------+------+----+--------+------+-----+ |freenas-boot |ONLINE | 0| 0| 0| 7%| 0| 0| 1| |fullvolume !|ONLINE | 0| 0| 28| 30%| 84K| 7| 0| +--------------+--------+------+------+------+----+--------+------+-----+ ########## ZPool status report for freenas-boot ########## pool: freenas-boot state: ONLINE scan: scrub repaired 0 in 0h1m with 0 errors on Wed Dec 30 02:24:26 2015 config: NAME STATE READ WRITE CKSUM freenas-boot ONLINE 0 0 0 mirror-0 ONLINE 0 0 0 gptid/c97814ea-a44e-11e5-b1d0-f8db88ffc155 ONLINE 0 0 0 gptid/c9a28995-a44e-11e5-b1d0-f8db88ffc155 ONLINE 0 0 0 errors: No known data errors ########## ZPool status report for fullvolume ########## pool: fullvolume state: ONLINE status: One or more devices has experienced an error resulting in data corruption. Applications may be affected. action: Restore the file in question if possible. Otherwise restore the entire pool from backup. see: http://illumos.org/msg/ZFS-8000-8A scan: scrub repaired 84K in 6h47m with 7 errors on Wed Dec 30 08:48:23 2015 config: NAME STATE READ WRITE CKSUM fullvolume ONLINE 0 0 7 raidz2-0 ONLINE 0 0 14 gptid/89d13f2b-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 1 gptid/8a83b206-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 0 gptid/8b271ed3-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 1 gptid/8bd76287-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 3 gptid/8c8582df-aba0-11e5-be4a-f8db88ffc155 ONLINE 0 0 2 errors: Permanent errors have been detected in the following files: /var/db/system/cores/python2.7.core ========================= ########## SMART status report summary for all drives ########## +------+---------------+----+-----+-----+-----+-------+-------+--------+------+------+------+-------+----+ |Device|Serial |Temp|Power|Start|Spin |ReAlloc|Current|Offline |UDMA |Seek |High |Command|Last| | | | |On |Stop |Retry|Sectors|Pending|Uncorrec|CRC |Errors|Fly |Timeout|Test| | | | |Hours|Count|Count| |Sectors|Sectors |Errors| |Writes|Count |Age | +------+---------------+----+-----+-----+-----+-------+-------+--------+------+------+------+-------+----+ |ada0 ?|WOL240326574 | 35 | 141| 8| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5| |ada1 ?|WOL240327198 | 35 | 284| 12| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5| |ada2 ?|WOL240327200 | 35 | 284| 12| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5| |ada3 ?|WOL240327207 | 34 | 284| 12| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5| |ada4 ?|WOL240327210 | 36 | 265| 14| 0| 0| 0| 0| 0| N/A| N/A| N/A| 5| +------+---------------+----+-----+-----+-----+-------+-------+--------+------+------+------+-------+----+ ########## SMART status report for ada0 drive (: WOL240326574) ########## SMART overall-health self-assessment test result: PASSED ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 201 197 021 Pre-fail Always - 8941 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 8 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 141 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 8 16 Unknown_Attribute 0x0022 000 200 000 Old_age Always - 17197295533 183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 4 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 3 194 Temperature_Celsius 0x0022 117 112 000 Old_age Always - 35 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 No Errors Logged Test_Description Status Remaining LifeTime(hours) LBA_of_first_error Extended offline Completed without error 00% 12 - ########## SMART status report for ada1 drive (: WOL240327198) ########## SMART overall-health self-assessment test result: PASSED ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 211 207 021 Pre-fail Always - 8433 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 12 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 284 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 12 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 7 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 63 194 Temperature_Celsius 0x0022 117 111 000 Old_age Always - 35 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 No Errors Logged Test_Description Status Remaining LifeTime(hours) LBA_of_first_error Conveyance offline Completed without error 00% 164 - ########## SMART status report for ada2 drive (: WOL240327200) ########## SMART overall-health self-assessment test result: PASSED ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 210 206 021 Pre-fail Always - 8483 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 12 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 284 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 12 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 7 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 61 194 Temperature_Celsius 0x0022 117 109 000 Old_age Always - 35 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 No Errors Logged Test_Description Status Remaining LifeTime(hours) LBA_of_first_error Conveyance offline Completed without error 00% 164 - ########## SMART status report for ada3 drive (: WOL240327207) ########## SMART overall-health self-assessment test result: PASSED ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 209 206 021 Pre-fail Always - 8508 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 12 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 284 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 12 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 7 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 62 194 Temperature_Celsius 0x0022 118 112 000 Old_age Always - 34 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 No Errors Logged Test_Description Status Remaining LifeTime(hours) LBA_of_first_error Conveyance offline Completed without error 00% 164 - ########## SMART status report for ada4 drive (: WOL240327210) ########## SMART overall-health self-assessment test result: PASSED See vendor-specific Attribute list for marginal Attributes. ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 001 051 Pre-fail Always In_the_past 0 3 Spin_Up_Time 0x0027 209 206 021 Pre-fail Always - 8533 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 14 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 265 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 10 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 6 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 21 194 Temperature_Celsius 0x0022 116 110 000 Old_age Always - 36 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 198 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0 No Errors Logged Test_Description Status Remaining LifeTime(hours) LBA_of_first_error Short offline Completed without error 00% 147 -