getting self-test errors on several (relatively new) WD Red Plus drives

Joined
Mar 5, 2022
Messages
224
Log says:
Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.060403-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada0, previous self-test completed with error (read test element)
Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.060689-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada0, new Self-Test Log error at hour timestamp 1634
Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.155403-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada0, previous self-test completed with error (read test element)
Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.155686-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada0, new Self-Test Log error at hour timestamp 1634

Jul 3 06:21:35 phong 1 2022-07-03T06:21:35.487169-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada1, previous self-test completed with error (read test element)
Jul 3 06:21:35 phong 1 2022-07-03T06:21:35.487506-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada1, new Self-Test Log error at hour timestamp 1968
Jul 3 06:21:35 phong 1 2022-07-03T06:21:35.553747-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada1, previous self-test completed with error (read test element)
Jul 3 06:21:35 phong 1 2022-07-03T06:21:35.554120-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada1, new Self-Test Log error at hour timestamp 1968

Jul 3 05:21:36 phong 1 2022-07-03T05:21:36.934779-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada2, previous self-test completed with error (read test element)
Jul 3 05:21:36 phong 1 2022-07-03T05:21:36.935063-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada2, new Self-Test Log error at hour timestamp 1663
Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.019230-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada2, previous self-test completed with error (read test element)
Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.019511-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada2, new Self-Test Log error at hour timestamp 1663

Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.250777-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada3, previous self-test completed with error (read test element)
Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.275287-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada3, new Self-Test Log error at hour timestamp 1841
Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.341636-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada3, previous self-test completed with error (read test element)
Jul 3 05:21:37 phong 1 2022-07-03T05:21:37.359469-04:00 phong.thompco.com smartd 1332 - - Device: /dev/ada3, new Self-Test Log error at hour timestamp 1841
I ran the smartctl command for all four drives and (from what I can tell) there are no errors (the tests all look like they passed.)
sudo smartctl -a /dev/ada# > /tmp/ada#.txt
What am I missing?
The output files are attached... Can I ignore these warnings? If so, can I suppress them?
 

Attachments

  • ada2.txt
    5.6 KB · Views: 140
  • ada3.txt
    5.6 KB · Views: 125
  • ada1.txt
    5.6 KB · Views: 136
  • ada0.txt
    5.6 KB · Views: 132
Joined
Jan 18, 2017
Messages
525
there are no errors (the tests all look like they passed.)
You better go look at the results of the extended tests again......

Code:
ADA0
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       10%      1634         1198863456
ADA1
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       10%      1968         1622205128


are these drives still under warranty?
 
Joined
Mar 5, 2022
Messages
224
Thankfully(?) they are new and still under warranty. This is a new NAS that I recently put together. It's a bit disturbing that so many of these drives (I have 6) are bad!

OK, I found the errors in the output files you were talking about (duh):
Device Model: WDC WD20EFZX-68AWUN0
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 10% 1968 1622205128
# 8 Extended offline Completed: read failure 60% 1795 1622205128
#15 Extended offline Completed: read failure 60% 1628 1622205128

Device Model: WDC WD20EFZX-68AWUN0
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 10% 1634 1198863456
# 8 Extended offline Completed: read failure 70% 1462 1198863456
#15 Extended offline Completed: read failure 70% 1294 1198863456

Device Model: WDC WD20EFZX-68AWUN0
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 10% 1663 1198863456
# 8 Extended offline Completed: read failure 70% 1491 1198863456
#15 Extended offline Completed: read failure 70% 1324 1198863456

Device Model: WDC WD20EFZX-68AWUN0
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 10% 1841 1173868576
# 8 Extended offline Completed: read failure 70% 1669 1173868576
#15 Extended offline Completed: read failure 70% 1502 1173868576
 
Last edited:

Redcoat

MVP
Joined
Feb 18, 2014
Messages
2,925
Are those 4 drives all on the Marvel Chipset SATA ports, by chance? Or is there some other logical grouping that might point at a controller issue affecting the 4 drives?

And did you burn in those drives before putting them into service?
 
Last edited:
Joined
Jan 18, 2017
Messages
525
lol Redcoat you beat me to the punch about burn-in

@jordanthompson if you can safely move one of those drives to another computer and run Western Digital Data LifeGuard Diagnostic it would be interesting to see what it reports

oh and obviously make SURE your data is properly backed-up ASAP
 
Joined
Mar 5, 2022
Messages
224
@Redcoat you hit the nail on the head! They ARE all on the same controller card (su-sa3034a)...

I see recommendations for LSI SAS cards, but unfortunately I only have pcie 2.0 slots available...
Any suggestions for a replacement?

I know nothing about burning in drives, so no, I didn't
 
Last edited:
Joined
Mar 5, 2022
Messages
224
Found (made) a spare pciex-16 slot...
I know nothing about SAS.
Here's what I just ordered:
1x
LSI Logic Controller Card LSI00301 SAS 9207-8i 8Port Internal SAS/SATA 6Gb/s PCI Express Single Retail https://a.co/d/e2akm8E

2x
ANNNWZZD Internal Mini SAS to SATA Cable,(SFF-8643 to 4 SATA Forward Breakout) Mini SAS- 4X SATA Cable (1.5FT/0.5M) https://a.co/d/05zffI5

Hopefully this will all play nicely together
 

runningdeer

Cadet
Joined
Sep 24, 2023
Messages
2
Found (made) a spare pciex-16 slot...
I know nothing about SAS.
Here's what I just ordered:
1x
LSI Logic Controller Card LSI00301 SAS 9207-8i 8Port Internal SAS/SATA 6Gb/s PCI Express Single Retail https://a.co/d/e2akm8E

2x
ANNNWZZD Internal Mini SAS to SATA Cable,(SFF-8643 to 4 SATA Forward Breakout) Mini SAS- 4X SATA Cable (1.5FT/0.5M) https://a.co/d/05zffI5

Hopefully this will all play nicely together
Hi there. I found this thread through google and was wondering if the new HBA resolved your problems. Thanks!
 
Joined
Mar 5, 2022
Messages
224
Hi there. I found this thread through google and was wondering if the new HBA resolved your problems. Thanks!
Yes! I currently have a power supply program (unrelated) and recently had to replace the grease between my CPU and heat sink, but the controller works great (although it did die and I was able to replace it under warranty.)
 

runningdeer

Cadet
Joined
Sep 24, 2023
Messages
2
Yes! I currently have a power supply program (unrelated) and recently had to replace the grease between my CPU and heat sink, but the controller works great (although it did die and I was able to replace it under warranty.)
That's great and thanks for letting me know!
 
Top