HDD Failed?

Status
Not open for further replies.

28061

Contributor
Joined
Oct 13, 2014
Messages
120
Evening all,

My FreeNas has been working great for about a year, then the other day I received a critical notification by email, saying my pool had been degraded. Here's what it says...
DiskStatus.jpg

I've done an extended SMART test and here's the output:
Code:

=== START OF INFORMATION SECTION ===
Model Family:  Western Digital Red
Device Model:  WDC WD30EFRX-68EUZN0
Serial Number:  WD-WCC4N3VP7J2P
LU WWN Device Id: 5 0014ee 20b09d03b
Firmware Version: 82.00A82
User Capacity:  3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:  512 bytes logical, 4096 bytes physical
Rotation Rate:  5400 rpm
Device is:  In smartctl database [for details use: -P show]
ATA Version is:  ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:  Mon Feb  6 21:19:49 2017 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
  was never started.
  Auto Offline Data Collection: Disabled.
Self-test execution status:  (  0) The previous self-test routine completed
  without error or no self-test has ever
  been run.
Total time to complete Offline
data collection:  (39540) seconds.
Offline data collection
capabilities:  (0x7b) SMART execute Offline immediate.
  Auto Offline data collection on/off support.
  Suspend Offline collection upon new
  command.
  Offline surface scan supported.
  Self-test supported.
  Conveyance Self-test supported.
  Selective Self-test supported.
SMART capabilities:  (0x0003) Saves SMART data before entering
  power-saving mode.
  Supports SMART auto save timer.
Error logging capability:  (0x01) Error logging supported.
  General Purpose Logging supported.
Short self-test routine
recommended polling time:  (  2) minutes.
Extended self-test routine
recommended polling time:  ( 397) minutes.
Conveyance self-test routine
recommended polling time:  (  5) minutes.
SCT capabilities:  (0x703d) SCT Status supported.
  SCT Error Recovery Control supported.
  SCT Feature Control supported.
  SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x002f  200  200  051  Pre-fail  Always  -  0
  3 Spin_Up_Time  0x0027  178  177  021  Pre-fail  Always  -  6075
  4 Start_Stop_Count  0x0032  100  100  000  Old_age  Always  -  93
  5 Reallocated_Sector_Ct  0x0033  200  200  140  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x002e  200  200  000  Old_age  Always  -  0
  9 Power_On_Hours  0x0032  077  077  000  Old_age  Always  -  16970
 10 Spin_Retry_Count  0x0032  100  253  000  Old_age  Always  -  0
 11 Calibration_Retry_Count 0x0032  100  253  000  Old_age  Always  -  0
 12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  93
192 Power-Off_Retract_Count 0x0032  200  200  000  Old_age  Always  -  91
193 Load_Cycle_Count  0x0032  200  200  000  Old_age  Always  -  164
194 Temperature_Celsius  0x0022  114  102  000  Old_age  Always  -  36
196 Reallocated_Event_Count 0x0032  200  200  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0032  200  200  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0030  100  253  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x0032  200  200  000  Old_age  Always  -  0
200 Multi_Zone_Error_Rate  0x0008  200  200  000  Old_age  Offline  -  0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description  Status  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline  Completed without error  00%  16970  -
# 2  Short offline  Completed without error  00%  5244  -
# 3  Short offline  Completed without error  00%  5220  -
# 4  Short offline  Completed without error  00%  5196  -
# 5  Short offline  Completed without error  00%  5172  -
# 6  Short offline  Completed without error  00%  5148  -
# 7  Short offline  Interrupted (host reset)  90%  5124  -
# 8  Short offline  Completed without error  00%  5100  -
# 9  Short offline  Completed without error  00%  5076  -
#10  Short offline  Completed without error  00%  5057  -
#11  Short offline  Completed without error  00%  5033  -
#12  Short offline  Completed without error  00%  5009  -
#13  Short offline  Completed without error  00%  4985  -
#14  Short offline  Completed without error  00%  4961  -
#15  Short offline  Completed without error  00%  4937  -
#16  Short offline  Completed without error  00%  4913  -
#17  Short offline  Completed without error  00%  4889  -
#18  Extended offline  Completed without error  00%  4874  -
#19  Short offline  Completed without error  00%  4865  -
#20  Short offline  Completed without error  00%  4841  -
#21  Short offline  Interrupted (host reset)  90%  4818  -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
  1  0  0  Not_testing
  2  0  0  Not_testing
  3  0  0  Not_testing
  4  0  0  Not_testing
  5  0  0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

It's the correct serial number, ID's 5, 197 & 198 all score 0 and I've replaced the SATA cable with nil difference.

Can anyone help with what I do next?

Thank you very much
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
That specific drive is fine, could be the HBA causing trouble...
 

28061

Contributor
Joined
Oct 13, 2014
Messages
120
That specific drive is fine, could be the HBA causing trouble...
Well that's suck. It's integrated into the motherboard and it's only ?18months old. Can I remove the degraded drive and reinstate it?


Sent from my iPhone using Tapatalk
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Well, let's approach this carefully. First of all, post your hardware specs so that everyone can see them (mobile users won't see signatures without jumping through hoops).
Next, grab us the output of smartctl -x /dev/daX for all drives.
We could also use the output of sas2flash -listall
 

28061

Contributor
Joined
Oct 13, 2014
Messages
120
Thank you

Hardware specs:
FreeNAS-9.10.2-U1 (86c7ef5)
Supermicro X10SL7-F with i3 4160 Haswell
6 x 3TB Western Digital Red's on RAIDZ2
16GB ECC Unbuffered RAM: Crucial CT4484984
3 x SanDisk Cruzer Fit 16GB mini mirrored boot

Output of Sas2Flash -ListAll
Code:
LSI Corporation SAS2 Flash Utility
Version 16.00.00.00 (2013.03.01)
Copyright (c) 2008-2013 LSI Corporation. All rights reserved

  Adapter Selected is a LSI SAS: SAS2308_1(D1)

Num  Ctlr  FW Ver  NVDATA  x86-BIOS  PCI Addr
----------------------------------------------------------------------------

0  SAS2308_1(D1)  20.00.02.00  14.01.00.14  07.39.00.00  00:02:00:00

  Finished Processing Commands Successfully.
  Exiting SAS2Flash.



da0
Code:
=== START OF INFORMATION SECTION ===
Model Family:  Western Digital Red
Device Model:  WDC WD30EFRX-68EUZN0
Serial Number:  WD-WCC4N6LTTNJH
LU WWN Device Id: 5 0014ee 2b5b4d60f
Firmware Version: 82.00A82
User Capacity:  3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:  512 bytes logical, 4096 bytes physical
Rotation Rate:  5400 rpm
Device is:  In smartctl database [for details use: -P show]
ATA Version is:  ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:  Tue Feb  7 11:00:14 2017 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:  Unavailable
APM feature is:  Unavailable
Rd look-ahead is: Enabled
Write cache is:  Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
  was never started.
  Auto Offline Data Collection: Disabled.
Self-test execution status:  (  0) The previous self-test routine completed
  without error or no self-test has ever
  been run.
Total time to complete Offline
data collection:  (38580) seconds.
Offline data collection
capabilities:  (0x7b) SMART execute Offline immediate.
  Auto Offline data collection on/off support.
  Suspend Offline collection upon new
  command.
  Offline surface scan supported.
  Self-test supported.
  Conveyance Self-test supported.
  Selective Self-test supported.
SMART capabilities:  (0x0003) Saves SMART data before entering
  power-saving mode.
  Supports SMART auto save timer.
Error logging capability:  (0x01) Error logging supported.
  General Purpose Logging supported.
Short self-test routine
recommended polling time:  (  2) minutes.
Extended self-test routine
recommended polling time:  ( 387) minutes.
Conveyance self-test routine
recommended polling time:  (  5) minutes.
SCT capabilities:  (0x703d) SCT Status supported.
  SCT Error Recovery Control supported.
  SCT Feature Control supported.
  SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAGS  VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate  POSR-K  200  200  051  -  0
  3 Spin_Up_Time  POS--K  178  178  021  -  6058
  4 Start_Stop_Count  -O--CK  100  100  000  -  93
  5 Reallocated_Sector_Ct  PO--CK  200  200  140  -  0
  7 Seek_Error_Rate  -OSR-K  200  200  000  -  0
  9 Power_On_Hours  -O--CK  077  077  000  -  16984
 10 Spin_Retry_Count  -O--CK  100  253  000  -  0
 11 Calibration_Retry_Count -O--CK  100  253  000  -  0
 12 Power_Cycle_Count  -O--CK  100  100  000  -  93
192 Power-Off_Retract_Count -O--CK  200  200  000  -  91
193 Load_Cycle_Count  -O--CK  200  200  000  -  145
194 Temperature_Celsius  -O---K  114  101  000  -  36
196 Reallocated_Event_Count -O--CK  200  200  000  -  0
197 Current_Pending_Sector  -O--CK  200  200  000  -  0
198 Offline_Uncorrectable  ----CK  100  253  000  -  0
199 UDMA_CRC_Error_Count  -O--CK  200  200  000  -  0
200 Multi_Zone_Error_Rate  ---R--  200  200  000  -  0
  ||||||_ K auto-keep
  |||||__ C event count
  ||||___ R error rate
  |||____ S speed/performance
  ||_____ O updated online
  |______ P prefailure warning

General Purpose Log Directory Version 1
SMART  Log Directory Version 1 [multi-sector log support]
Address  Access  R/W  Size  Description
0x00  GPL,SL  R/O  1  Log Directory
0x01  SL  R/O  1  Summary SMART error log
0x02  SL  R/O  5  Comprehensive SMART error log
0x03  GPL  R/O  6  Ext. Comprehensive SMART error log
0x06  SL  R/O  1  SMART self-test log
0x07  GPL  R/O  1  Extended self-test log
0x09  SL  R/W  1  Selective self-test log
0x10  GPL  R/O  1  SATA NCQ Queued Error log
0x11  GPL  R/O  1  SATA Phy Event Counters log
0x21  GPL  R/O  1  Write stream error log
0x22  GPL  R/O  1  Read stream error log
0x80-0x9f  GPL,SL  R/W  16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS  16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS  1  Device vendor specific log
0xbd  GPL,SL  VS  1  Device vendor specific log
0xc0  GPL,SL  VS  1  Device vendor specific log
0xc1  GPL  VS  93  Device vendor specific log
0xe0  GPL,SL  R/W  1  SCT Command/Status
0xe1  GPL,SL  R/W  1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description  Status  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline  Completed without error  00%  5244  -
# 2  Short offline  Completed without error  00%  5220  -
# 3  Short offline  Completed without error  00%  5196  -
# 4  Short offline  Completed without error  00%  5172  -
# 5  Short offline  Completed without error  00%  5148  -
# 6  Short offline  Completed without error  00%  5124  -
# 7  Short offline  Completed without error  00%  5100  -
# 8  Short offline  Completed without error  00%  5076  -
# 9  Short offline  Completed without error  00%  5057  -
#10  Short offline  Completed without error  00%  5033  -
#11  Short offline  Completed without error  00%  5009  -
#12  Short offline  Completed without error  00%  4985  -
#13  Short offline  Completed without error  00%  4961  -
#14  Short offline  Completed without error  00%  4937  -
#15  Short offline  Completed without error  00%  4913  -
#16  Short offline  Completed without error  00%  4889  -
#17  Extended offline  Completed without error  00%  4873  -
#18  Short offline  Completed without error  00%  4865  -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
  1  0  0  Not_testing
  2  0  0  Not_testing
  3  0  0  Not_testing
  4  0  0  Not_testing
  5  0  0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:  3
SCT Version (vendor specific):  258 (0x0102)
SCT Support Level:  1
Device State:  Active (0)
Current Temperature:  36 Celsius
Power Cycle Min/Max Temperature:  22/39 Celsius
Lifetime  Min/Max Temperature:  2/49 Celsius
Under/Over Temperature Limit Count:  0/0
Vendor specific:
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:  2
Temperature Sampling Period:  1 minute
Temperature Logging Interval:  1 minute
Min/Max recommended Temperature:  0/60 Celsius
Min/Max Temperature Limit:  -41/85 Celsius
Temperature History Size (Index):  478 (35)

Index  Estimated Time  Temperature Celsius
  36  2017-02-07 03:03  36  *****************
 ...  ..(476 skipped).  ..  *****************
  35  2017-02-07 11:00  36  *****************

SCT Error Recovery Control:
  Read:  70 (7.0 seconds)
  Write:  70 (7.0 seconds)

Device Statistics (GP/SMART Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID  Size  Value  Description
0x0001  2  0  Command failed due to ICRC error
0x0002  2  0  R_ERR response for data FIS
0x0003  2  0  R_ERR response for device-to-host data FIS
0x0004  2  0  R_ERR response for host-to-device data FIS
0x0005  2  0  R_ERR response for non-data FIS
0x0006  2  0  R_ERR response for device-to-host non-data FIS
0x0007  2  0  R_ERR response for host-to-device non-data FIS
0x0008  2  0  Device-to-host non-data FIS retries
0x0009  2  7  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2  8  Device-to-host register FISes sent due to a COMRESET
0x000b  2  0  CRC errors within host-to-device FIS
0x000f  2  0  R_ERR response for host-to-device data FIS, CRC
0x0012  2  0  R_ERR response for host-to-device non-data FIS, CRC
0x8000  4  1889253  Vendor specific



Output of SmartCtl -x /dev/da1
Code:
=== START OF INFORMATION SECTION ===
Model Family:  Western Digital Red
Device Model:  WDC WD30EFRX-68EUZN0
Serial Number:  WD-WCC4N3VP7D78
LU WWN Device Id: 5 0014ee 2605ef747
Firmware Version: 82.00A82
User Capacity:  3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:  512 bytes logical, 4096 bytes physical
Rotation Rate:  5400 rpm
Device is:  In smartctl database [for details use: -P show]
ATA Version is:  ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:  Tue Feb  7 10:57:03 2017 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:  Unavailable
APM feature is:  Unavailable
Rd look-ahead is: Enabled
Write cache is:  Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
  was never started.
  Auto Offline Data Collection: Disabled.
Self-test execution status:  (  0) The previous self-test routine completed
  without error or no self-test has ever
  been run.
Total time to complete Offline
data collection:  (41460) seconds.
Offline data collection
capabilities:  (0x7b) SMART execute Offline immediate.
  Auto Offline data collection on/off support.
  Suspend Offline collection upon new
  command.
  Offline surface scan supported.
  Self-test supported.
  Conveyance Self-test supported.
  Selective Self-test supported.
SMART capabilities:  (0x0003) Saves SMART data before entering
  power-saving mode.
  Supports SMART auto save timer.
Error logging capability:  (0x01) Error logging supported.
  General Purpose Logging supported.
Short self-test routine
recommended polling time:  (  2) minutes.
Extended self-test routine
recommended polling time:  ( 416) minutes.
Conveyance self-test routine
recommended polling time:  (  5) minutes.
SCT capabilities:  (0x703d) SCT Status supported.
  SCT Error Recovery Control supported.
  SCT Feature Control supported.
  SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAGS  VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate  POSR-K  200  200  051  -  0
  3 Spin_Up_Time  POS--K  174  173  021  -  6258
  4 Start_Stop_Count  -O--CK  100  100  000  -  93
  5 Reallocated_Sector_Ct  PO--CK  200  200  140  -  0
  7 Seek_Error_Rate  -OSR-K  200  200  000  -  0
  9 Power_On_Hours  -O--CK  077  077  000  -  16984
 10 Spin_Retry_Count  -O--CK  100  253  000  -  0
 11 Calibration_Retry_Count -O--CK  100  253  000  -  0
 12 Power_Cycle_Count  -O--CK  100  100  000  -  93
192 Power-Off_Retract_Count -O--CK  200  200  000  -  91
193 Load_Cycle_Count  -O--CK  200  200  000  -  142
194 Temperature_Celsius  -O---K  113  099  000  -  37
196 Reallocated_Event_Count -O--CK  200  200  000  -  0
197 Current_Pending_Sector  -O--CK  200  200  000  -  0
198 Offline_Uncorrectable  ----CK  100  253  000  -  0
199 UDMA_CRC_Error_Count  -O--CK  200  200  000  -  0
200 Multi_Zone_Error_Rate  ---R--  200  200  000  -  0
  ||||||_ K auto-keep
  |||||__ C event count
  ||||___ R error rate
  |||____ S speed/performance
  ||_____ O updated online
  |______ P prefailure warning

General Purpose Log Directory Version 1
SMART  Log Directory Version 1 [multi-sector log support]
Address  Access  R/W  Size  Description
0x00  GPL,SL  R/O  1  Log Directory
0x01  SL  R/O  1  Summary SMART error log
0x02  SL  R/O  5  Comprehensive SMART error log
0x03  GPL  R/O  6  Ext. Comprehensive SMART error log
0x06  SL  R/O  1  SMART self-test log
0x07  GPL  R/O  1  Extended self-test log
0x09  SL  R/W  1  Selective self-test log
0x10  GPL  R/O  1  SATA NCQ Queued Error log
0x11  GPL  R/O  1  SATA Phy Event Counters log
0x21  GPL  R/O  1  Write stream error log
0x22  GPL  R/O  1  Read stream error log
0x80-0x9f  GPL,SL  R/W  16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS  16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS  1  Device vendor specific log
0xbd  GPL,SL  VS  1  Device vendor specific log
0xc0  GPL,SL  VS  1  Device vendor specific log
0xc1  GPL  VS  93  Device vendor specific log
0xe0  GPL,SL  R/W  1  SCT Command/Status
0xe1  GPL,SL  R/W  1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description  Status  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline  Completed without error  00%  5244  -
# 2  Short offline  Completed without error  00%  5220  -
# 3  Short offline  Completed without error  00%  5196  -
# 4  Short offline  Completed without error  00%  5172  -
# 5  Short offline  Completed without error  00%  5148  -
# 6  Short offline  Completed without error  00%  5124  -
# 7  Short offline  Completed without error  00%  5100  -
# 8  Short offline  Completed without error  00%  5076  -
# 9  Short offline  Completed without error  00%  5057  -
#10  Short offline  Completed without error  00%  5033  -
#11  Short offline  Completed without error  00%  5009  -
#12  Short offline  Completed without error  00%  4985  -
#13  Short offline  Completed without error  00%  4961  -
#14  Short offline  Completed without error  00%  4937  -
#15  Short offline  Completed without error  00%  4913  -
#16  Short offline  Completed without error  00%  4889  -
#17  Extended offline  Completed without error  00%  4874  -
#18  Short offline  Completed without error  00%  4865  -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
  1  0  0  Not_testing
  2  0  0  Not_testing
  3  0  0  Not_testing
  4  0  0  Not_testing
  5  0  0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:  3
SCT Version (vendor specific):  258 (0x0102)
SCT Support Level:  1
Device State:  Active (0)
Current Temperature:  37 Celsius
Power Cycle Min/Max Temperature:  22/41 Celsius
Lifetime  Min/Max Temperature:  2/51 Celsius
Under/Over Temperature Limit Count:  0/0
Vendor specific:
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:  2
Temperature Sampling Period:  1 minute
Temperature Logging Interval:  1 minute
Min/Max recommended Temperature:  0/60 Celsius
Min/Max Temperature Limit:  -41/85 Celsius
Temperature History Size (Index):  478 (29)

Index  Estimated Time  Temperature Celsius
  30  2017-02-07 03:00  37  ******************
 ...  ..(476 skipped).  ..  ******************
  29  2017-02-07 10:57  37  ******************

SCT Error Recovery Control:
  Read:  70 (7.0 seconds)
  Write:  70 (7.0 seconds)

Device Statistics (GP/SMART Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID  Size  Value  Description
0x0001  2  0  Command failed due to ICRC error
0x0002  2  0  R_ERR response for data FIS
0x0003  2  0  R_ERR response for device-to-host data FIS
0x0004  2  0  R_ERR response for host-to-device data FIS
0x0005  2  0  R_ERR response for non-data FIS
0x0006  2  0  R_ERR response for device-to-host non-data FIS
0x0007  2  0  R_ERR response for host-to-device non-data FIS
0x0008  2  0  Device-to-host non-data FIS retries
0x0009  2  7  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2  8  Device-to-host register FISes sent due to a COMRESET
0x000b  2  0  CRC errors within host-to-device FIS
0x000f  2  0  R_ERR response for host-to-device data FIS, CRC
0x0012  2  0  R_ERR response for host-to-device non-data FIS, CRC
0x8000  4  1889053  Vendor specific


da2
Code:
=== START OF INFORMATION SECTION ===
Model Family:  Western Digital Red
Device Model:  WDC WD30EFRX-68EUZN0
Serial Number:  WD-WCC4N6ZY0SDF
LU WWN Device Id: 5 0014ee 2b5b4b610
Firmware Version: 82.00A82
User Capacity:  3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:  512 bytes logical, 4096 bytes physical
Rotation Rate:  5400 rpm
Device is:  In smartctl database [for details use: -P show]
ATA Version is:  ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:  Tue Feb  7 10:57:56 2017 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:  Unavailable
APM feature is:  Unavailable
Rd look-ahead is: Enabled
Write cache is:  Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
  was never started.
  Auto Offline Data Collection: Disabled.
Self-test execution status:  (  0) The previous self-test routine completed
  without error or no self-test has ever
  been run.
Total time to complete Offline
data collection:  (40020) seconds.
Offline data collection
capabilities:  (0x7b) SMART execute Offline immediate.
  Auto Offline data collection on/off support.
  Suspend Offline collection upon new
  command.
  Offline surface scan supported.
  Self-test supported.
  Conveyance Self-test supported.
  Selective Self-test supported.
SMART capabilities:  (0x0003) Saves SMART data before entering
  power-saving mode.
  Supports SMART auto save timer.
Error logging capability:  (0x01) Error logging supported.
  General Purpose Logging supported.
Short self-test routine
recommended polling time:  (  2) minutes.
Extended self-test routine
recommended polling time:  ( 401) minutes.
Conveyance self-test routine
recommended polling time:  (  5) minutes.
SCT capabilities:  (0x703d) SCT Status supported.
  SCT Error Recovery Control supported.
  SCT Feature Control supported.
  SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAGS  VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate  POSR-K  200  200  051  -  0
  3 Spin_Up_Time  POS--K  176  176  021  -  6175
  4 Start_Stop_Count  -O--CK  100  100  000  -  93
  5 Reallocated_Sector_Ct  PO--CK  200  200  140  -  0
  7 Seek_Error_Rate  -OSR-K  200  200  000  -  0
  9 Power_On_Hours  -O--CK  077  077  000  -  16984
 10 Spin_Retry_Count  -O--CK  100  253  000  -  0
 11 Calibration_Retry_Count -O--CK  100  253  000  -  0
 12 Power_Cycle_Count  -O--CK  100  100  000  -  93
192 Power-Off_Retract_Count -O--CK  200  200  000  -  91
193 Load_Cycle_Count  -O--CK  200  200  000  -  142
194 Temperature_Celsius  -O---K  117  103  000  -  33
196 Reallocated_Event_Count -O--CK  200  200  000  -  0
197 Current_Pending_Sector  -O--CK  200  200  000  -  0
198 Offline_Uncorrectable  ----CK  100  253  000  -  0
199 UDMA_CRC_Error_Count  -O--CK  200  200  000  -  0
200 Multi_Zone_Error_Rate  ---R--  200  200  000  -  0
  ||||||_ K auto-keep
  |||||__ C event count
  ||||___ R error rate
  |||____ S speed/performance
  ||_____ O updated online
  |______ P prefailure warning

General Purpose Log Directory Version 1
SMART  Log Directory Version 1 [multi-sector log support]
Address  Access  R/W  Size  Description
0x00  GPL,SL  R/O  1  Log Directory
0x01  SL  R/O  1  Summary SMART error log
0x02  SL  R/O  5  Comprehensive SMART error log
0x03  GPL  R/O  6  Ext. Comprehensive SMART error log
0x06  SL  R/O  1  SMART self-test log
0x07  GPL  R/O  1  Extended self-test log
0x09  SL  R/W  1  Selective self-test log
0x10  GPL  R/O  1  SATA NCQ Queued Error log
0x11  GPL  R/O  1  SATA Phy Event Counters log
0x21  GPL  R/O  1  Write stream error log
0x22  GPL  R/O  1  Read stream error log
0x80-0x9f  GPL,SL  R/W  16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS  16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS  1  Device vendor specific log
0xbd  GPL,SL  VS  1  Device vendor specific log
0xc0  GPL,SL  VS  1  Device vendor specific log
0xc1  GPL  VS  93  Device vendor specific log
0xe0  GPL,SL  R/W  1  SCT Command/Status
0xe1  GPL,SL  R/W  1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description  Status  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline  Completed without error  00%  5244  -
# 2  Short offline  Completed without error  00%  5220  -
# 3  Short offline  Completed without error  00%  5196  -
# 4  Short offline  Completed without error  00%  5172  -
# 5  Short offline  Completed without error  00%  5148  -
# 6  Short offline  Completed without error  00%  5124  -
# 7  Short offline  Completed without error  00%  5100  -
# 8  Short offline  Completed without error  00%  5076  -
# 9  Short offline  Completed without error  00%  5057  -
#10  Short offline  Completed without error  00%  5033  -
#11  Short offline  Completed without error  00%  5009  -
#12  Short offline  Completed without error  00%  4985  -
#13  Short offline  Completed without error  00%  4961  -
#14  Short offline  Completed without error  00%  4937  -
#15  Short offline  Completed without error  00%  4913  -
#16  Short offline  Completed without error  00%  4889  -
#17  Extended offline  Completed without error  00%  4874  -
#18  Short offline  Completed without error  00%  4865  -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
  1  0  0  Not_testing
  2  0  0  Not_testing
  3  0  0  Not_testing
  4  0  0  Not_testing
  5  0  0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:  3
SCT Version (vendor specific):  258 (0x0102)
SCT Support Level:  1
Device State:  Active (0)
Current Temperature:  33 Celsius
Power Cycle Min/Max Temperature:  21/36 Celsius
Lifetime  Min/Max Temperature:  2/47 Celsius
Under/Over Temperature Limit Count:  0/0
Vendor specific:
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:  2
Temperature Sampling Period:  1 minute
Temperature Logging Interval:  1 minute
Min/Max recommended Temperature:  0/60 Celsius
Min/Max Temperature Limit:  -41/85 Celsius
Temperature History Size (Index):  478 (33)

Index  Estimated Time  Temperature Celsius
  34  2017-02-07 03:00  33  **************
 ...  ..( 45 skipped).  ..  **************
  80  2017-02-07 03:46  33  **************
  81  2017-02-07 03:47  32  *************
 ...  ..( 54 skipped).  ..  *************
 136  2017-02-07 04:42  32  *************
 137  2017-02-07 04:43  33  **************
 ...  ..(  5 skipped).  ..  **************
 143  2017-02-07 04:49  33  **************
 144  2017-02-07 04:50  32  *************
 ...  ..( 56 skipped).  ..  *************
 201  2017-02-07 05:47  32  *************
 202  2017-02-07 05:48  33  **************
 203  2017-02-07 05:49  32  *************
 ...  ..( 51 skipped).  ..  *************
 255  2017-02-07 06:41  32  *************
 256  2017-02-07 06:42  33  **************
 ...  ..(254 skipped).  ..  **************
  33  2017-02-07 10:57  33  **************

SCT Error Recovery Control:
  Read:  70 (7.0 seconds)
  Write:  70 (7.0 seconds)

Device Statistics (GP/SMART Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID  Size  Value  Description
0x0001  2  0  Command failed due to ICRC error
0x0002  2  0  R_ERR response for data FIS
0x0003  2  0  R_ERR response for device-to-host data FIS
0x0004  2  0  R_ERR response for host-to-device data FIS
0x0005  2  0  R_ERR response for non-data FIS
0x0006  2  0  R_ERR response for device-to-host non-data FIS
0x0007  2  0  R_ERR response for host-to-device non-data FIS
0x0008  2  0  Device-to-host non-data FIS retries
0x0009  2  7  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2  8  Device-to-host register FISes sent due to a COMRESET
0x000b  2  0  CRC errors within host-to-device FIS
0x000f  2  0  R_ERR response for host-to-device data FIS, CRC
0x0012  2  0  R_ERR response for host-to-device non-data FIS, CRC
0x8000  4  1889113  Vendor specific

 

28061

Contributor
Joined
Oct 13, 2014
Messages
120
da3
Code:
=== START OF INFORMATION SECTION ===
Model Family:  Western Digital Red
Device Model:  WDC WD30EFRX-68EUZN0
Serial Number:  WD-WCC4N3VP716T
LU WWN Device Id: 5 0014ee 2605ef34c
Firmware Version: 82.00A82
User Capacity:  3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:  512 bytes logical, 4096 bytes physical
Rotation Rate:  5400 rpm
Device is:  In smartctl database [for details use: -P show]
ATA Version is:  ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:  Tue Feb  7 10:58:39 2017 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:  Unavailable
APM feature is:  Unavailable
Rd look-ahead is: Enabled
Write cache is:  Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
  was never started.
  Auto Offline Data Collection: Disabled.
Self-test execution status:  (  0) The previous self-test routine completed
  without error or no self-test has ever
  been run.
Total time to complete Offline
data collection:  (41160) seconds.
Offline data collection
capabilities:  (0x7b) SMART execute Offline immediate.
  Auto Offline data collection on/off support.
  Suspend Offline collection upon new
  command.
  Offline surface scan supported.
  Self-test supported.
  Conveyance Self-test supported.
  Selective Self-test supported.
SMART capabilities:  (0x0003) Saves SMART data before entering
  power-saving mode.
  Supports SMART auto save timer.
Error logging capability:  (0x01) Error logging supported.
  General Purpose Logging supported.
Short self-test routine
recommended polling time:  (  2) minutes.
Extended self-test routine
recommended polling time:  ( 413) minutes.
Conveyance self-test routine
recommended polling time:  (  5) minutes.
SCT capabilities:  (0x703d) SCT Status supported.
  SCT Error Recovery Control supported.
  SCT Feature Control supported.
  SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAGS  VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate  POSR-K  200  200  051  -  0
  3 Spin_Up_Time  POS--K  178  177  021  -  6091
  4 Start_Stop_Count  -O--CK  100  100  000  -  93
  5 Reallocated_Sector_Ct  PO--CK  200  200  140  -  0
  7 Seek_Error_Rate  -OSR-K  200  200  000  -  0
  9 Power_On_Hours  -O--CK  077  077  000  -  16984
 10 Spin_Retry_Count  -O--CK  100  253  000  -  0
 11 Calibration_Retry_Count -O--CK  100  253  000  -  0
 12 Power_Cycle_Count  -O--CK  100  100  000  -  93
192 Power-Off_Retract_Count -O--CK  200  200  000  -  91
193 Load_Cycle_Count  -O--CK  200  200  000  -  141
194 Temperature_Celsius  -O---K  115  102  000  -  35
196 Reallocated_Event_Count -O--CK  200  200  000  -  0
197 Current_Pending_Sector  -O--CK  200  200  000  -  0
198 Offline_Uncorrectable  ----CK  100  253  000  -  0
199 UDMA_CRC_Error_Count  -O--CK  200  200  000  -  0
200 Multi_Zone_Error_Rate  ---R--  200  200  000  -  0
  ||||||_ K auto-keep
  |||||__ C event count
  ||||___ R error rate
  |||____ S speed/performance
  ||_____ O updated online
  |______ P prefailure warning

General Purpose Log Directory Version 1
SMART  Log Directory Version 1 [multi-sector log support]
Address  Access  R/W  Size  Description
0x00  GPL,SL  R/O  1  Log Directory
0x01  SL  R/O  1  Summary SMART error log
0x02  SL  R/O  5  Comprehensive SMART error log
0x03  GPL  R/O  6  Ext. Comprehensive SMART error log
0x06  SL  R/O  1  SMART self-test log
0x07  GPL  R/O  1  Extended self-test log
0x09  SL  R/W  1  Selective self-test log
0x10  GPL  R/O  1  SATA NCQ Queued Error log
0x11  GPL  R/O  1  SATA Phy Event Counters log
0x21  GPL  R/O  1  Write stream error log
0x22  GPL  R/O  1  Read stream error log
0x80-0x9f  GPL,SL  R/W  16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS  16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS  1  Device vendor specific log
0xbd  GPL,SL  VS  1  Device vendor specific log
0xc0  GPL,SL  VS  1  Device vendor specific log
0xc1  GPL  VS  93  Device vendor specific log
0xe0  GPL,SL  R/W  1  SCT Command/Status
0xe1  GPL,SL  R/W  1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description  Status  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline  Completed without error  00%  5244  -
# 2  Short offline  Completed without error  00%  5220  -
# 3  Short offline  Completed without error  00%  5196  -
# 4  Short offline  Completed without error  00%  5172  -
# 5  Short offline  Completed without error  00%  5148  -
# 6  Short offline  Completed without error  00%  5124  -
# 7  Short offline  Completed without error  00%  5100  -
# 8  Short offline  Completed without error  00%  5076  -
# 9  Short offline  Completed without error  00%  5057  -
#10  Short offline  Completed without error  00%  5033  -
#11  Short offline  Completed without error  00%  5009  -
#12  Short offline  Completed without error  00%  4985  -
#13  Short offline  Completed without error  00%  4961  -
#14  Short offline  Completed without error  00%  4937  -
#15  Short offline  Completed without error  00%  4913  -
#16  Short offline  Completed without error  00%  4889  -
#17  Extended offline  Completed without error  00%  4874  -
#18  Short offline  Completed without error  00%  4865  -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
  1  0  0  Not_testing
  2  0  0  Not_testing
  3  0  0  Not_testing
  4  0  0  Not_testing
  5  0  0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:  3
SCT Version (vendor specific):  258 (0x0102)
SCT Support Level:  1
Device State:  Active (0)
Current Temperature:  35 Celsius
Power Cycle Min/Max Temperature:  22/38 Celsius
Lifetime  Min/Max Temperature:  2/48 Celsius
Under/Over Temperature Limit Count:  0/0
Vendor specific:
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:  2
Temperature Sampling Period:  1 minute
Temperature Logging Interval:  1 minute
Min/Max recommended Temperature:  0/60 Celsius
Min/Max Temperature Limit:  -41/85 Celsius
Temperature History Size (Index):  478 (32)

Index  Estimated Time  Temperature Celsius
  33  2017-02-07 03:01  36  *****************
 ...  ..( 15 skipped).  ..  *****************
  49  2017-02-07 03:17  36  *****************
  50  2017-02-07 03:18  35  ****************
 ...  ..(253 skipped).  ..  ****************
 304  2017-02-07 07:32  35  ****************
 305  2017-02-07 07:33  36  *****************
 ...  ..(204 skipped).  ..  *****************
  32  2017-02-07 10:58  36  *****************

SCT Error Recovery Control:
  Read:  70 (7.0 seconds)
  Write:  70 (7.0 seconds)

Device Statistics (GP/SMART Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID  Size  Value  Description
0x0001  2  0  Command failed due to ICRC error
0x0002  2  0  R_ERR response for data FIS
0x0003  2  0  R_ERR response for device-to-host data FIS
0x0004  2  0  R_ERR response for host-to-device data FIS
0x0005  2  0  R_ERR response for non-data FIS
0x0006  2  0  R_ERR response for device-to-host non-data FIS
0x0007  2  0  R_ERR response for host-to-device non-data FIS
0x0008  2  0  Device-to-host non-data FIS retries
0x0009  2  3  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2  4  Device-to-host register FISes sent due to a COMRESET
0x000b  2  0  CRC errors within host-to-device FIS
0x000f  2  0  R_ERR response for host-to-device data FIS, CRC
0x0012  2  0  R_ERR response for host-to-device non-data FIS, CRC
0x8000  4  1889151  Vendor specific



da4
Code:
=== START OF INFORMATION SECTION ===
Model Family:  Western Digital Red
Device Model:  WDC WD30EFRX-68EUZN0
Serial Number:  WD-WCC4N3VP7J2P
LU WWN Device Id: 5 0014ee 20b09d03b
Firmware Version: 82.00A82
User Capacity:  3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:  512 bytes logical, 4096 bytes physical
Rotation Rate:  5400 rpm
Device is:  In smartctl database [for details use: -P show]
ATA Version is:  ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:  Tue Feb  7 10:59:06 2017 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:  Unavailable
APM feature is:  Unavailable
Rd look-ahead is: Enabled
Write cache is:  Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
  was never started.
  Auto Offline Data Collection: Disabled.
Self-test execution status:  (  0) The previous self-test routine completed
  without error or no self-test has ever
  been run.
Total time to complete Offline
data collection:  (39540) seconds.
Offline data collection
capabilities:  (0x7b) SMART execute Offline immediate.
  Auto Offline data collection on/off support.
  Suspend Offline collection upon new
  command.
  Offline surface scan supported.
  Self-test supported.
  Conveyance Self-test supported.
  Selective Self-test supported.
SMART capabilities:  (0x0003) Saves SMART data before entering
  power-saving mode.
  Supports SMART auto save timer.
Error logging capability:  (0x01) Error logging supported.
  General Purpose Logging supported.
Short self-test routine
recommended polling time:  (  2) minutes.
Extended self-test routine
recommended polling time:  ( 397) minutes.
Conveyance self-test routine
recommended polling time:  (  5) minutes.
SCT capabilities:  (0x703d) SCT Status supported.
  SCT Error Recovery Control supported.
  SCT Feature Control supported.
  SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAGS  VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate  POSR-K  200  200  051  -  0
  3 Spin_Up_Time  POS--K  178  177  021  -  6075
  4 Start_Stop_Count  -O--CK  100  100  000  -  93
  5 Reallocated_Sector_Ct  PO--CK  200  200  140  -  0
  7 Seek_Error_Rate  -OSR-K  200  200  000  -  0
  9 Power_On_Hours  -O--CK  077  077  000  -  16984
 10 Spin_Retry_Count  -O--CK  100  253  000  -  0
 11 Calibration_Retry_Count -O--CK  100  253  000  -  0
 12 Power_Cycle_Count  -O--CK  100  100  000  -  93
192 Power-Off_Retract_Count -O--CK  200  200  000  -  91
193 Load_Cycle_Count  -O--CK  200  200  000  -  178
194 Temperature_Celsius  -O---K  118  102  000  -  32
196 Reallocated_Event_Count -O--CK  200  200  000  -  0
197 Current_Pending_Sector  -O--CK  200  200  000  -  0
198 Offline_Uncorrectable  ----CK  100  253  000  -  0
199 UDMA_CRC_Error_Count  -O--CK  200  200  000  -  0
200 Multi_Zone_Error_Rate  ---R--  200  200  000  -  0
  ||||||_ K auto-keep
  |||||__ C event count
  ||||___ R error rate
  |||____ S speed/performance
  ||_____ O updated online
  |______ P prefailure warning

General Purpose Log Directory Version 1
SMART  Log Directory Version 1 [multi-sector log support]
Address  Access  R/W  Size  Description
0x00  GPL,SL  R/O  1  Log Directory
0x01  SL  R/O  1  Summary SMART error log
0x02  SL  R/O  5  Comprehensive SMART error log
0x03  GPL  R/O  6  Ext. Comprehensive SMART error log
0x06  SL  R/O  1  SMART self-test log
0x07  GPL  R/O  1  Extended self-test log
0x09  SL  R/W  1  Selective self-test log
0x10  GPL  R/O  1  SATA NCQ Queued Error log
0x11  GPL  R/O  1  SATA Phy Event Counters log
0x21  GPL  R/O  1  Write stream error log
0x22  GPL  R/O  1  Read stream error log
0x80-0x9f  GPL,SL  R/W  16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS  16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS  1  Device vendor specific log
0xbd  GPL,SL  VS  1  Device vendor specific log
0xc0  GPL,SL  VS  1  Device vendor specific log
0xc1  GPL  VS  93  Device vendor specific log
0xe0  GPL,SL  R/W  1  SCT Command/Status
0xe1  GPL,SL  R/W  1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description  Status  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline  Completed without error  00%  16970  -
# 2  Short offline  Completed without error  00%  5244  -
# 3  Short offline  Completed without error  00%  5220  -
# 4  Short offline  Completed without error  00%  5196  -
# 5  Short offline  Completed without error  00%  5172  -
# 6  Short offline  Completed without error  00%  5148  -
# 7  Short offline  Interrupted (host reset)  90%  5124  -
# 8  Short offline  Completed without error  00%  5100  -
# 9  Short offline  Completed without error  00%  5076  -
#10  Short offline  Completed without error  00%  5057  -
#11  Short offline  Completed without error  00%  5033  -
#12  Short offline  Completed without error  00%  5009  -
#13  Short offline  Completed without error  00%  4985  -
#14  Short offline  Completed without error  00%  4961  -
#15  Short offline  Completed without error  00%  4937  -
#16  Short offline  Completed without error  00%  4913  -
#17  Short offline  Completed without error  00%  4889  -
#18  Extended offline  Completed without error  00%  4874  -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
  1  0  0  Not_testing
  2  0  0  Not_testing
  3  0  0  Not_testing
  4  0  0  Not_testing
  5  0  0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:  3
SCT Version (vendor specific):  258 (0x0102)
SCT Support Level:  1
Device State:  Active (0)
Current Temperature:  32 Celsius
Power Cycle Min/Max Temperature:  22/37 Celsius
Lifetime  Min/Max Temperature:  2/48 Celsius
Under/Over Temperature Limit Count:  0/0
Vendor specific:
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:  2
Temperature Sampling Period:  1 minute
Temperature Logging Interval:  1 minute
Min/Max recommended Temperature:  0/60 Celsius
Min/Max Temperature Limit:  -41/85 Celsius
Temperature History Size (Index):  478 (251)

Index  Estimated Time  Temperature Celsius
 252  2017-02-07 03:02  33  **************
 ...  ..( 18 skipped).  ..  **************
 271  2017-02-07 03:21  33  **************
 272  2017-02-07 03:22  32  *************
 ...  ..( 24 skipped).  ..  *************
 297  2017-02-07 03:47  32  *************
 298  2017-02-07 03:48  33  **************
 ...  ..( 14 skipped).  ..  **************
 313  2017-02-07 04:03  33  **************
 314  2017-02-07 04:04  32  *************
 ...  ..( 43 skipped).  ..  *************
 358  2017-02-07 04:48  32  *************
 359  2017-02-07 04:49  33  **************
 ...  ..(  9 skipped).  ..  **************
 369  2017-02-07 04:59  33  **************
 370  2017-02-07 05:00  32  *************
 ...  ..( 46 skipped).  ..  *************
 417  2017-02-07 05:47  32  *************
 418  2017-02-07 05:48  33  **************
 ...  ..( 10 skipped).  ..  **************
 429  2017-02-07 05:59  33  **************
 430  2017-02-07 06:00  32  *************
 ...  ..( 46 skipped).  ..  *************
 477  2017-02-07 06:47  32  *************
  0  2017-02-07 06:48  33  **************
 ...  ..( 28 skipped).  ..  **************
  29  2017-02-07 07:17  33  **************
  30  2017-02-07 07:18  32  *************
 ...  ..( 14 skipped).  ..  *************
  45  2017-02-07 07:33  32  *************
  46  2017-02-07 07:34  33  **************
 ...  ..(204 skipped).  ..  **************
 251  2017-02-07 10:59  33  **************

SCT Error Recovery Control:
  Read:  70 (7.0 seconds)
  Write:  70 (7.0 seconds)

Device Statistics (GP/SMART Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID  Size  Value  Description
0x0001  2  0  Command failed due to ICRC error
0x0002  2  9972  R_ERR response for data FIS
0x0003  2  0  R_ERR response for device-to-host data FIS
0x0004  2  9972  R_ERR response for host-to-device data FIS
0x0005  2  158  R_ERR response for non-data FIS
0x0006  2  0  R_ERR response for device-to-host non-data FIS
0x0007  2  158  R_ERR response for host-to-device non-data FIS
0x0008  2  0  Device-to-host non-data FIS retries
0x0009  2  4768  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2  4769  Device-to-host register FISes sent due to a COMRESET
0x000b  2  3242  CRC errors within host-to-device FIS
0x000f  2  3174  R_ERR response for host-to-device data FIS, CRC
0x0012  2  68  R_ERR response for host-to-device non-data FIS, CRC
0x8000  4  1889178  Vendor specific



da5
Code:
=== START OF INFORMATION SECTION ===
Model Family:  Western Digital Red
Device Model:  WDC WD30EFRX-68EUZN0
Serial Number:  WD-WCC4N5HK535K
LU WWN Device Id: 5 0014ee 2b5b4cff8
Firmware Version: 82.00A82
User Capacity:  3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:  512 bytes logical, 4096 bytes physical
Rotation Rate:  5400 rpm
Device is:  In smartctl database [for details use: -P show]
ATA Version is:  ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:  Tue Feb  7 10:59:45 2017 GMT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:  Unavailable
APM feature is:  Unavailable
Rd look-ahead is: Enabled
Write cache is:  Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
  was never started.
  Auto Offline Data Collection: Disabled.
Self-test execution status:  (  0) The previous self-test routine completed
  without error or no self-test has ever
  been run.
Total time to complete Offline
data collection:  (39840) seconds.
Offline data collection
capabilities:  (0x7b) SMART execute Offline immediate.
  Auto Offline data collection on/off support.
  Suspend Offline collection upon new
  command.
  Offline surface scan supported.
  Self-test supported.
  Conveyance Self-test supported.
  Selective Self-test supported.
SMART capabilities:  (0x0003) Saves SMART data before entering
  power-saving mode.
  Supports SMART auto save timer.
Error logging capability:  (0x01) Error logging supported.
  General Purpose Logging supported.
Short self-test routine
recommended polling time:  (  2) minutes.
Extended self-test routine
recommended polling time:  ( 399) minutes.
Conveyance self-test routine
recommended polling time:  (  5) minutes.
SCT capabilities:  (0x703d) SCT Status supported.
  SCT Error Recovery Control supported.
  SCT Feature Control supported.
  SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAGS  VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate  POSR-K  200  200  051  -  0
  3 Spin_Up_Time  POS--K  176  175  021  -  6175
  4 Start_Stop_Count  -O--CK  100  100  000  -  93
  5 Reallocated_Sector_Ct  PO--CK  200  200  140  -  0
  7 Seek_Error_Rate  -OSR-K  200  200  000  -  0
  9 Power_On_Hours  -O--CK  077  077  000  -  16984
 10 Spin_Retry_Count  -O--CK  100  253  000  -  0
 11 Calibration_Retry_Count -O--CK  100  253  000  -  0
 12 Power_Cycle_Count  -O--CK  100  100  000  -  93
192 Power-Off_Retract_Count -O--CK  200  200  000  -  91
193 Load_Cycle_Count  -O--CK  200  200  000  -  145
194 Temperature_Celsius  -O---K  115  102  000  -  35
196 Reallocated_Event_Count -O--CK  200  200  000  -  0
197 Current_Pending_Sector  -O--CK  200  200  000  -  0
198 Offline_Uncorrectable  ----CK  100  253  000  -  0
199 UDMA_CRC_Error_Count  -O--CK  200  200  000  -  0
200 Multi_Zone_Error_Rate  ---R--  200  200  000  -  0
  ||||||_ K auto-keep
  |||||__ C event count
  ||||___ R error rate
  |||____ S speed/performance
  ||_____ O updated online
  |______ P prefailure warning

General Purpose Log Directory Version 1
SMART  Log Directory Version 1 [multi-sector log support]
Address  Access  R/W  Size  Description
0x00  GPL,SL  R/O  1  Log Directory
0x01  SL  R/O  1  Summary SMART error log
0x02  SL  R/O  5  Comprehensive SMART error log
0x03  GPL  R/O  6  Ext. Comprehensive SMART error log
0x06  SL  R/O  1  SMART self-test log
0x07  GPL  R/O  1  Extended self-test log
0x09  SL  R/W  1  Selective self-test log
0x10  GPL  R/O  1  SATA NCQ Queued Error log
0x11  GPL  R/O  1  SATA Phy Event Counters log
0x21  GPL  R/O  1  Write stream error log
0x22  GPL  R/O  1  Read stream error log
0x80-0x9f  GPL,SL  R/W  16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS  16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS  1  Device vendor specific log
0xbd  GPL,SL  VS  1  Device vendor specific log
0xc0  GPL,SL  VS  1  Device vendor specific log
0xc1  GPL  VS  93  Device vendor specific log
0xe0  GPL,SL  R/W  1  SCT Command/Status
0xe1  GPL,SL  R/W  1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description  Status  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline  Completed without error  00%  5244  -
# 2  Short offline  Completed without error  00%  5220  -
# 3  Short offline  Completed without error  00%  5196  -
# 4  Short offline  Completed without error  00%  5172  -
# 5  Short offline  Completed without error  00%  5148  -
# 6  Short offline  Completed without error  00%  5124  -
# 7  Short offline  Completed without error  00%  5100  -
# 8  Short offline  Completed without error  00%  5076  -
# 9  Short offline  Completed without error  00%  5057  -
#10  Short offline  Completed without error  00%  5033  -
#11  Short offline  Completed without error  00%  5009  -
#12  Short offline  Completed without error  00%  4985  -
#13  Short offline  Completed without error  00%  4961  -
#14  Short offline  Completed without error  00%  4937  -
#15  Short offline  Completed without error  00%  4913  -
#16  Short offline  Completed without error  00%  4889  -
#17  Extended offline  Completed without error  00%  4874  -
#18  Short offline  Completed without error  00%  4865  -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
  1  0  0  Not_testing
  2  0  0  Not_testing
  3  0  0  Not_testing
  4  0  0  Not_testing
  5  0  0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:  3
SCT Version (vendor specific):  258 (0x0102)
SCT Support Level:  1
Device State:  Active (0)
Current Temperature:  35 Celsius
Power Cycle Min/Max Temperature:  22/39 Celsius
Lifetime  Min/Max Temperature:  2/48 Celsius
Under/Over Temperature Limit Count:  0/0
Vendor specific:
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:  2
Temperature Sampling Period:  1 minute
Temperature Logging Interval:  1 minute
Min/Max recommended Temperature:  0/60 Celsius
Min/Max Temperature Limit:  -41/85 Celsius
Temperature History Size (Index):  478 (37)

Index  Estimated Time  Temperature Celsius
  38  2017-02-07 03:02  35  ****************
 ...  ..(476 skipped).  ..  ****************
  37  2017-02-07 10:59  35  ****************

SCT Error Recovery Control:
  Read:  70 (7.0 seconds)
  Write:  70 (7.0 seconds)

Device Statistics (GP/SMART Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID  Size  Value  Description
0x0001  2  0  Command failed due to ICRC error
0x0002  2  0  R_ERR response for data FIS
0x0003  2  0  R_ERR response for device-to-host data FIS
0x0004  2  0  R_ERR response for host-to-device data FIS
0x0005  2  0  R_ERR response for non-data FIS
0x0006  2  0  R_ERR response for device-to-host non-data FIS
0x0007  2  0  R_ERR response for host-to-device non-data FIS
0x0008  2  0  Device-to-host non-data FIS retries
0x0009  2  7  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2  8  Device-to-host register FISes sent due to a COMRESET
0x000b  2  0  CRC errors within host-to-device FIS
0x000f  2  0  R_ERR response for host-to-device data FIS, CRC
0x0012  2  0  R_ERR response for host-to-device non-data FIS, CRC
0x8000  4  1889227  Vendor specific

 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
The first thing that concerns me is the HBA firmware. Yes, it's P20, but P20 saw a number of revisions that fixed some nasty bugs.
I recommend you start by updating to 20.00.07
 

28061

Contributor
Joined
Oct 13, 2014
Messages
120
The first thing that concerns me is the HBA firmware. Yes, it's P20, but P20 saw a number of revisions that fixed some nasty bugs.
I recommend you start by updating to 20.00.07

Done. What next?

Code:
        Adapter Selected is a LSI SAS: SAS2308_1(D1)                          
                                                                              
Num   Ctlr            FW Ver        NVDATA        x86-BIOS         PCI Addr    
----------------------------------------------------------------------------                                                                                
0  SAS2308_1(D1)   20.00.07.00    14.01.30.16    07.39.02.00     00:02:00:00  
                                                                              
        Finished Processing Commands Successfully.                            
        Exiting SAS2Flash.        
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Replace the drive, possibly with itself. If you have a spare (which is always a very good idea), use that one and thoroughly test the old drive.

If you don't have a spare, wipe the drive and resilver it and keep a close eye on it - and get a spare.
 

28061

Contributor
Joined
Oct 13, 2014
Messages
120
I only have a spare WD Green. I know they're not recommended for NAS, and I didn't want to mix & match.
So I removed the old drive, wiped it, and replaced it with itself. The HDD activity blitzed away, so I assume they were redistributing the data. Did originally have a green light, and no errors... But...

Now I have this:
DiskStatus.jpg

And this output from zpool status.
Code:
[root@Freenas ~]# zpool status                                                                                                     
  pool: Pool1                                                                                                                       
state: ONLINE                                                                                                                     
status: One or more devices has experienced an unrecoverable error.  An                                                             
        attempt was made to correct the error.  Applications are unaffected.                                                       
action: Determine if the device needs to be replaced, and clear the errors                                                         
        using 'zpool clear' or replace the device with 'zpool replace'.                                                             
   see: http://illumos.org/msg/ZFS-8000-9P                                                                                         
  scan: scrub in progress since Tue Feb  7 18:27:25 2017                                                                           
        86.5G scanned out of 11.7T at 142M/s, 23h54m to go                                                                         
        8K repaired, 0.72% done                                                                                                     
config:                                                                                                                             
                                                                                                                                   
        NAME                                            STATE     READ WRITE CKSUM                                                 
        Pool1                                           ONLINE       0     0     0                                                 
          raidz2-0                                      ONLINE       0     0     0                                                 
            gptid/d00a3c6d-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
            gptid/d06bc743-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
            gptid/d0ce9963-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
            gptid/d13871d1-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
            da4p2                                       ONLINE       0     0     2  (repairing)                                     
            gptid/d201eeae-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
                                                                                                                                   
errors: No known data errors             


Is it likely to be ok when resilvered?
Please tell me it'll be ok!
 

28061

Contributor
Joined
Oct 13, 2014
Messages
120
Updated Zpool status output

Code:
[root@Freenas ~]# zpool status                                                                                                     
  pool: Pool1                                                                                                                       
state: DEGRADED                                                                                                                   
status: One or more devices has experienced an unrecoverable error.  An                                                             
        attempt was made to correct the error.  Applications are unaffected.                                                       
action: Determine if the device needs to be replaced, and clear the errors                                                         
        using 'zpool clear' or replace the device with 'zpool replace'.                                                             
   see: http://illumos.org/msg/ZFS-8000-9P                                                                                         
  scan: scrub in progress since Tue Feb  7 18:27:25 2017                                                                           
        1.37T scanned out of 11.7T at 350M/s, 8h36m to go                                                                           
        93.2M repaired, 11.74% done                                                                                                 
config:                                                                                                                             
                                                                                                                                   
        NAME                                            STATE     READ WRITE CKSUM                                                 
        Pool1                                           DEGRADED     0     0     0                                                 
          raidz2-0                                      DEGRADED     0     0     0                                                 
            gptid/d00a3c6d-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
            gptid/d06bc743-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
            gptid/d0ce9963-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
            gptid/d13871d1-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
            da4p2                                       DEGRADED     0     0 2.94K  too many errors  (repairing)                   
            gptid/d201eeae-ad4b-11e4-ac94-0cc47a31405a  ONLINE       0     0     0                                                 
                                                                                                                                   
errors: No known data errors                   
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Ok, so we're probably dealing with a bad drive that isn't cooperating in SMART or a bad cable with an unusual lack of error messages.

Of course, the cable is the easiest to replace, so start with that. I don't have high hopes for it to work, though.
 

28061

Contributor
Joined
Oct 13, 2014
Messages
120
Any suggestion as to which software to use? I've had a look, but most ones with decent visual output seem to cost for the write tests..

In the meantime, what do you think of this? :eek:
Diag.jpg

After the resilvering, the errors persisted in FreeNas. SMART still reports no errors. But it's obviously repeatedly failing real world application.
 
Last edited:

rs225

Guru
Joined
Jun 28, 2014
Messages
878
Was the system hard power cycled after the firmware update?

Keep in mind, the actual error may be only on read. There may be nothing wrong with the data written on that drive. And the corruption may be occurring in the HBA.

It might be a good test to swap two drives, and see if the error follows the drive or stays with the port, or disappears.

And if you have warranty, contacting the manufacturer may be a good idea.
 

28061

Contributor
Joined
Oct 13, 2014
Messages
120
Yes, the system was hard power cycled post firmware.

I've got no other spare drives, and I don't want to swap two drives, because that'd mean having to format and lose both of my redundancies for the rebuild.

There's another ten months of warrantee on my drives & WD have offered to RMA it today, so I'll be returning it shortly. More info to follow in due course.
 

Glorious1

Guru
Joined
Nov 23, 2014
Messages
1,211
Could I ask how you find the firmware? sas2flash says my HBA is SAS2008. When I go to this page
https://www.broadcom.com/support/download-search/
and search for that, there is no firmware for it. When I search for Megaraid 9240-8i, which I wrote down from the box, there are a number of firmwares, but even in the archive tab, I can't find a version number that low.
 

rs225

Guru
Joined
Jun 28, 2014
Messages
878
Unless I am missing something, there is no reason to require losing redundancy. You would shut down the power, swap two drives, power up. That alone does not require a rebuild of either disk.
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Could I ask how you find the firmware? sas2flash says my HBA is SAS2008. When I go to this page
https://www.broadcom.com/support/download-search/
and search for that, there is no firmware for it. When I search for Megaraid 9240-8i, which I wrote down from the box, there are a number of firmwares, but even in the archive tab, I can't find a version number that low.
No, the controller on your HBA is an SAS2008. The card would be an SAS 9211, for instance.
 
Status
Not open for further replies.
Top