Hello everyone,
I have been getting a warning that my pool is degraded. It does say scrub have repaired 531M. Now my question is should I be worried or replace the disk "Right Now". I have a fairly good backup system in place with continuous backup to external disk and offsite backup via Crashplan. So, in case I lose the pool I can still restore from my backup.
My Configs are as follows:
HP Gen8 Microserver with E31265L CPU
16GB ECC RAM
4 X 3tb HDD in RAIDZ1 (Two WD Red, One HGST NAS, One Seagate NAS) bought at different time.
To be honest, I am pretty lost on the smart data. When does smart gives an error saying smart have failed ? As I can see with the Seagate drive there are a lot of read errors still smart report says these have passed. Am I missing something. I would really appreciate your input.
I have been getting a warning that my pool is degraded. It does say scrub have repaired 531M. Now my question is should I be worried or replace the disk "Right Now". I have a fairly good backup system in place with continuous backup to external disk and offsite backup via Crashplan. So, in case I lose the pool I can still restore from my backup.
My Configs are as follows:
HP Gen8 Microserver with E31265L CPU
16GB ECC RAM
4 X 3tb HDD in RAIDZ1 (Two WD Red, One HGST NAS, One Seagate NAS) bought at different time.
Code:
[root@freenas] ~# zpool status pool: HomeStorage state: DEGRADED status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://illumos.org/msg/ZFS-8000-9P scan: scrub repaired 531M in 7h22m with 0 errors on Fri Sep 2 03:34:08 2016 config: NAME STATE READ WRITE CKSUM HomeStorage DEGRADED 0 0 0 raidz1-0 DEGRADED 0 0 0 gptid/db896513-1f57-11e6-a31c-d0bf9c467bac ONLINE 0 0 0 gptid/dbec92cb-1f57-11e6-a31c-d0bf9c467bac ONLINE 0 0 0 gptid/dc4e1836-1f57-11e6-a31c-d0bf9c467bac ONLINE 0 0 0 gptid/dca39d2a-1f57-11e6-a31c-d0bf9c467bac DEGRADED 0 0 13.0K too many errors errors: No known data errors pool: SSD state: ONLINE scan: scrub repaired 0 in 0h5m with 0 errors on Mon Aug 29 23:44:46 2016 config: NAME STATE READ WRITE CKSUM SSD ONLINE 0 0 0 gptid/f5ea8d9f-1f57-11e6-a31c-d0bf9c467bac ONLINE 0 0 0 errors: No known data errors pool: freenas-boot state: ONLINE scan: scrub repaired 0 in 0h6m with 0 errors on Wed Aug 31 03:51:31 2016 config: NAME STATE READ WRITE CKSUM freenas-boot ONLINE 0 0 0 mirror-0 ONLINE 0 0 0 da0p2 ONLINE 0 0 0 da1p2 ONLINE 0 0 0 errors: No known data errors [root@freenas] ~#
Code:
[root@freenas] ~# camcontrol devlist <ST3000VN000-1HJ166 SC60> at scbus0 target 0 lun 0 (pass0,ada0) <Hitachi HDS723030ALA640 MKAOAA10> at scbus1 target 0 lun 0 (pass1,ada1) <ST3000VN000-1H4167 SC44> at scbus2 target 0 lun 0 (pass2,ada2) <WDC WD30EFRX-68EUZN0 82.00A82> at scbus3 target 0 lun 0 (pass3,ada3) <KINGSTON SV300S37A120G 603ABBF0> at scbus4 target 0 lun 0 (pass4,ada4) < Patriot Memory PMAP> at scbus7 target 0 lun 0 (pass5,da0) <Kingston DataTraveler 3.0 PMAP> at scbus8 target 0 lun 0 (pass6,da1) [root@freenas]
Code:
[root@freenas] ~# smartctl -A /dev/ada0 smartctl 6.5 2016-05-07 r4318 [FreeBSD 10.3-STABLE amd64] (local build) Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 114 099 006 Pre-fail Always - 61596504 3 Spin_Up_Time 0x0003 094 094 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 46 5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 083 060 030 Pre-fail Always - 217889256 9 Power_On_Hours 0x0032 091 091 000 Old_age Always - 8606 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 46 184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 001 001 000 Old_age Always - 122 190 Airflow_Temperature_Cel 0x0022 078 062 045 Old_age Always - 22 (Min/Max 22/29) 191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 34 193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 385 194 Temperature_Celsius 0x0022 022 040 000 Old_age Always - 22 (0 10 0 0 0) 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 [root@freenas] ~# smartctl -A /dev/ada1 smartctl 6.5 2016-05-07 r4318 [FreeBSD 10.3-STABLE amd64] (local build) Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 136 136 054 Pre-fail Offline - 81 3 Spin_Up_Time 0x0007 156 156 024 Pre-fail Always - 468 (Average 525) 4 Start_Stop_Count 0x0012 099 099 000 Old_age Always - 4919 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 123 123 020 Pre-fail Offline - 31 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 29715 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 134 192 Power-Off_Retract_Count 0x0032 081 081 000 Old_age Always - 22831 193 Load_Cycle_Count 0x0012 081 081 000 Old_age Always - 22831 194 Temperature_Celsius 0x0002 222 222 000 Old_age Always - 27 (Min/Max 13/58) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 [root@freenas] ~# smartctl -A /dev/ada2 smartctl 6.5 2016-05-07 r4318 [FreeBSD 10.3-STABLE amd64] (local build) Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 117 099 006 Pre-fail Always - 148047224 3 Spin_Up_Time 0x0003 092 092 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 380 5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 084 060 030 Pre-fail Always - 268329967 9 Power_On_Hours 0x0032 080 080 000 Old_age Always - 18055 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 77 184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 078 050 045 Old_age Always - 22 (Min/Max 21/29) 191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 35 193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 382 194 Temperature_Celsius 0x0022 022 050 000 Old_age Always - 22 (0 13 0 0 0) 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 [root@freenas] ~# smartctl -A /dev/ada3 smartctl 6.5 2016-05-07 r4318 [FreeBSD 10.3-STABLE amd64] (local build) Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 100 253 021 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 4 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 097 097 000 Old_age Always - 2516 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 4 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 1 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 2849 194 Temperature_Celsius 0x0022 129 121 000 Old_age Always - 21 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 [root@freenas] ~# smartctl -A /dev/ada4 smartctl 6.5 2016-05-07 r4318 [FreeBSD 10.3-STABLE amd64] (local build) Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x0032 095 095 050 Old_age Always - 0/3321286 5 Retired_Block_Count 0x0033 100 100 003 Pre-fail Always - 0 9 Power_On_Hours_and_Msec 0x0032 098 098 000 Old_age Always - 2517h+09m+50.380s 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 7 171 Program_Fail_Count 0x000a 100 100 000 Old_age Always - 0 172 Erase_Fail_Count 0x0032 100 100 000 Old_age Always - 0 174 Unexpect_Power_Loss_Ct 0x0030 000 000 000 Old_age Offline - 7 177 Wear_Range_Delta 0x0000 000 000 000 Old_age Offline - 1 181 Program_Fail_Count 0x000a 100 100 000 Old_age Always - 0 182 Erase_Fail_Count 0x0032 100 100 000 Old_age Always - 0 187 Reported_Uncorrect 0x0012 100 100 000 Old_age Always - 0 189 Airflow_Temperature_Cel 0x0000 017 028 000 Old_age Offline - 17 (Min/Max 9/28) 194 Temperature_Celsius 0x0022 017 028 000 Old_age Always - 17 (Min/Max 9/28) 195 ECC_Uncorr_Error_Count 0x001c 120 120 000 Old_age Offline - 0/3321286 196 Reallocated_Event_Count 0x0033 100 100 003 Pre-fail Always - 0 201 Unc_Soft_Read_Err_Rate 0x001c 120 120 000 Old_age Offline - 0/3321286 204 Soft_ECC_Correct_Rate 0x001c 120 120 000 Old_age Offline - 0/3321286 230 Life_Curve_Status 0x0013 100 100 000 Pre-fail Always - 100 231 SSD_Life_Left 0x0000 100 100 011 Old_age Offline - 4294967296 233 SandForce_Internal 0x0032 000 000 000 Old_age Always - 6782 234 SandForce_Internal 0x0032 000 000 000 Old_age Always - 6260 241 Lifetime_Writes_GiB 0x0032 000 000 000 Old_age Always - 6260 242 Lifetime_Reads_GiB 0x0032 000 000 000 Old_age Always - 1333 244 Unknown_Attribute 0x0000 098 098 010 Old_age Offline - 4980800 [root@freenas] ~#
Code:
[root@freenas] ~# glabel status Name Status Components gptid/db896513-1f57-11e6-a31c-d0bf9c467bac N/A ada0p2 gptid/dbec92cb-1f57-11e6-a31c-d0bf9c467bac N/A ada1p2 gptid/dc4e1836-1f57-11e6-a31c-d0bf9c467bac N/A ada2p2 gptid/dca39d2a-1f57-11e6-a31c-d0bf9c467bac N/A ada3p2 gptid/f5ea8d9f-1f57-11e6-a31c-d0bf9c467bac N/A ada4p2 gptid/d720c229-1f3f-11e6-8269-d0bf9c467bac N/A da0p1 gptid/ac55d980-4cd4-11e6-adb2-d0bf9c467bac N/A da1p1 [root@freenas] ~#
To be honest, I am pretty lost on the smart data. When does smart gives an error saying smart have failed ? As I can see with the Seagate drive there are a lot of read errors still smart report says these have passed. Am I missing something. I would really appreciate your input.
Last edited: