I think I bought a bad drive....

TK-24601

Dabbler
Joined
Sep 27, 2018
Messages
13
I am setting up my system and have run the Short and Long tests prior to starting the badblocks test. Using FreeNAS is new to me and I haven't spent much time in this world, but I think based on the current screen and alerts I am seeing on 1 HDD, I think I have a bad drive out of the box.

I am testing 6 HDDs @ 4TB each. All are HGST Deskstar. In the SSH console screen I am watching the test for this drive scroll up with the following lines nonstop:
FreeNAS screen shot.jpg

I have had 2 alerts pop up over the last two days. The first one:
Device: /dev/da1 [SAT], ATA error count increased from 0 to 300

Second Yesterday:
Device: /dev/da1 [SAT], ATA error count increased from 53748 to 57683

Should I assume my drive is crap and stop the badblocks test now to send it back for replacement? Any input on the screen shot would be greatly appreciated!
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
Should I assume my drive is crap and stop the badblocks test now to send it back for replacement?
Probably. I'd stop badblocks and run a long smart test. If it passes that, we'll have a strange situation--my guess is it won't.
 

TK-24601

Dabbler
Joined
Sep 27, 2018
Messages
13
I will stop it and run a long test and report back.
 

TK-24601

Dabbler
Joined
Sep 27, 2018
Messages
13
Well The badblocks test finished for the other 5 drives without issue. All 5 reported no errors. Here are the results of a long smart test of the previous mentioned drive:
Code:
=== START OF INFORMATION SECTION ===
Model Family:     HGST Deskstar NAS
Device Model:     HGST HDN726040ALE614
Serial Number:    K7J8YDYL
LU WWN Device Id: 5 000cca 269e04068
Firmware Version: APGNW7JH
User Capacity:    4,000,787,030,016 bytes [4.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    7200 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sun Jan 13 16:15:22 2019 MST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (  113) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 571) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_    FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   016    Pre-fail  Always       -    0
  2 Throughput_Performance  0x0005   137   137   054    Pre-fail  Offline      -    104
  3 Spin_Up_Time            0x0007   100   100   024    Pre-fail  Always       -    0
  4 Start_Stop_Count        0x0012   100   100   000    Old_age   Always       -    10
  5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail  Always       -    0
  7 Seek_Error_Rate         0x000b   100   100   067    Pre-fail  Always       -    0
  8 Seek_Time_Performance   0x0005   128   128   020    Pre-fail  Offline      -    18
  9 Power_On_Hours          0x0012   100   100   000    Old_age   Always       -    96
 10 Spin_Retry_Count        0x0013   100   100   060    Pre-fail  Always       -    0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -    10
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -    11
193 Load_Cycle_Count        0x0012   100   100   000    Old_age   Always       -    11
194 Temperature_Celsius     0x0002   176   176   000    Old_age   Always       -    34 (Min/Max 20/38)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -    0
197 Current_Pending_Sector  0x0022   100   100   000    Old_age   Always       -    0
198 Offline_Uncorrectable   0x0008   100   100   000    Old_age   Offline      -    0
199 UDMA_CRC_Error_Count    0x000a   191   191   000    Old_age   Always       -    769624

SMART Error Log Version: 1
ATA Error Count: 65535 (device log contains only the most recent five errors)
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 65535 occurred at disk power-on lifetime: 72 hours (3 days + 0 hours)
  When the command that caused the error occurred, the device was doing SMART Of
fline or Self-test.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 00 00 e4 8c 40 00   2d+23:21:58.020  READ FPDMA QUEUED
  60 00 00 00 e3 8c 40 00   2d+23:21:58.019  READ FPDMA QUEUED
  60 00 00 00 e2 8c 40 00   2d+23:21:58.019  READ FPDMA QUEUED
  60 00 00 00 e1 8c 40 00   2d+23:21:58.018  READ FPDMA QUEUED
  60 00 00 00 e0 8c 40 00   2d+23:21:58.010  READ FPDMA QUEUED

Error 65534 occurred at disk power-on lifetime: 72 hours (3 days + 0 hours)
  When the command that caused the error occurred, the device was doing SMART Of
fline or Self-test.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 00 00 e0 8c 40 00   2d+23:21:58.010  READ FPDMA QUEUED
  60 00 00 00 df 8c 40 00   2d+23:21:58.008  READ FPDMA QUEUED
  60 00 00 00 de 8c 40 00   2d+23:21:58.008  READ FPDMA QUEUED
  60 00 00 00 dd 8c 40 00   2d+23:21:58.000  READ FPDMA QUEUED
  2f 00 01 10 00 00 00 00   2d+23:21:57.999  READ LOG EXT

Error 65533 occurred at disk power-on lifetime: 72 hours (3 days + 0 hours)
  When the command that caused the error occurred, the device was doing SMART Of
fline or Self-test.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 00 00 dd 8c 40 00   2d+23:21:57.999  READ FPDMA QUEUED
  60 00 00 00 dc 8c 40 00   2d+23:21:57.998  READ FPDMA QUEUED
  60 00 00 00 db 8c 40 00   2d+23:21:57.998  READ FPDMA QUEUED
  60 00 00 00 da 8c 40 00   2d+23:21:57.997  READ FPDMA QUEUED
  60 00 00 00 d9 8c 40 00   2d+23:21:57.996  READ FPDMA QUEUED

Error 65532 occurred at disk power-on lifetime: 72 hours (3 days + 0 hours)
  When the command that caused the error occurred, the device was doing SMART Of
fline or Self-test.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 00 00 d4 8c 40 00   2d+23:21:57.985  READ FPDMA QUEUED
  60 00 00 00 d3 8c 40 00   2d+23:21:57.985  READ FPDMA QUEUED
  60 00 00 00 d2 8c 40 00   2d+23:21:57.984  READ FPDMA QUEUED
  60 00 00 00 d1 8c 40 00   2d+23:21:57.976  READ FPDMA QUEUED
  2f 00 01 10 00 00 00 00   2d+23:21:57.975  READ LOG EXT

Error 65531 occurred at disk power-on lifetime: 72 hours (3 days + 0 hours)
  When the command that caused the error occurred, the device was doing SMART Of
fline or Self-test.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 00 00 d1 8c 40 00   2d+23:21:57.975  READ FPDMA QUEUED
  60 00 00 00 d0 8c 40 00   2d+23:21:57.974  READ FPDMA QUEUED
  60 00 00 00 cf 8c 40 00   2d+23:21:57.973  READ FPDMA QUEUED
  60 00 00 00 ce 8c 40 00   2d+23:21:57.973  READ FPDMA QUEUED
  60 00 00 00 cd 8c 40 00   2d+23:21:57.972  READ FPDMA QUEUED

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA
_of_first_error
# 1  Extended offline    Completed without error       00%        84         -
# 2  Extended offline    Completed without error       00%        13         -
# 3  Short offline       Completed without error       00%         4         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 
Last edited:

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
Here are the results of a long smart test of the previous mentioned drive:
Try again, all the attributes on the right are cut off. Though it did (surprisingly) pass the SMART self-test.
 

TK-24601

Dabbler
Joined
Sep 27, 2018
Messages
13
Try again, all the attributes on the right are cut off. Though it did (surprisingly) pass the SMART self-test.
Corrected. The Failed Raw_Value column is now visible.
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
OK, so the only error showing is the CRC errors, which often indicate cabling problems rather than disk errors. Check your connections, and maybe try replacing the SATA cable.
 

TK-24601

Dabbler
Joined
Sep 27, 2018
Messages
13
OK, so the only error showing is the CRC errors, which often indicate cabling problems rather than disk errors. Check your connections, and maybe try replacing the SATA cable.
That is good to know. I will check the cable first and then run the badblocks test again. It is apart of a mini SAS to 4-SATA, so if this 2nd go round fails me, I'll to replace the SAS cable.

I greatly appreciate the help! I was concerned I got a bad drive straight from the box.
 
Top