SOLVED CAM status: SCSI Status Error

Status
Not open for further replies.

encbox

Dabbler
Joined
Mar 27, 2017
Messages
25
Build FreeNAS-9.10.2-U2 (e1497f2)

The daily security run output e-mailed me the following kernel message. Do I need to be concerned?
Code:
(da1:mps0:0:0:0): READ(10). CDB: 28 00 02 619e 30 00 01 00 00 
(da1:mps0:0:0:0): CAM status: SCSI Status Error
(da1:mps0:0:0:0): SCSI status: Check Condition
(da1:mps0:0:0:0): SCSI sense: MEDIUM ERROR asc:11,0 (Unrecovered read error)
(da1:mps0:0:0:0): Info: 0x2619e60
(da1:mps0:0:0:0): Error 5, Unretryable error
GEOM_ELI: g_eli_read_done() failed (error=5) gptid/90e18512-8339-11e5-9251-000c2997875a.eli[READ(offset=18307833856, length=131072)]

smartctl -a /dev/da1 output:

Code:
smartctl 6.5 2016-05-07 r4318 [FreeBSD 10.3-STABLE amd64] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:	 Western Digital Red
Device Model:	 WDC WD30EFRX-68EUZN0
Serial Number:	WD-WCCxxxxxxxxx
LU WWN Device Id: 5 0014ee 20c6f3d5c
Firmware Version: 82.00A82
User Capacity:	3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:	 512 bytes logical, 4096 bytes physical
Rotation Rate:	5400 rpm
Device is:		In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:	Mon Mar 27 10:11:54 2017 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
										was never started.
										Auto Offline Data Collection: Disabled.
Self-test execution status:	  (   0) The previous self-test routine completed
										without error or no self-test has ever 
										been run.
Total time to complete Offline 
data collection:				(39360) seconds.
Offline data collection
capabilities:					(0x7b) SMART execute Offline immediate.
										Auto Offline data collection on/off support.
										Suspend Offline collection upon new
										command.
										Offline surface scan supported.
										Self-test supported.
										Conveyance Self-test supported.
										Selective Self-test supported.
SMART capabilities:			(0x0003) Saves SMART data before entering
										power-saving mode.
										Supports SMART auto save timer.
Error logging capability:		(0x01) Error logging supported.
										General Purpose Logging supported.
Short self-test routine 
recommended polling time:		(   2) minutes.
Extended self-test routine
recommended polling time:		( 395) minutes.
Conveyance self-test routine
recommended polling time:		(   5) minutes.
SCT capabilities:			  (0x703d) SCT Status supported.
										SCT Error Recovery Control supported.
										SCT Feature Control supported.
										SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME		  FLAG	 VALUE WORST THRESH TYPE	  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate	 0x002f   200   200   051	Pre-fail  Always	   -	   4
  3 Spin_Up_Time			0x0027   179   177   021	Pre-fail  Always	   -	   6050
  4 Start_Stop_Count		0x0032   100   100   000	Old_age   Always	   -	   74
  5 Reallocated_Sector_Ct   0x0033   200   200   140	Pre-fail  Always	   -	   0
  7 Seek_Error_Rate		 0x002e   200   200   000	Old_age   Always	   -	   0
  9 Power_On_Hours		  0x0032   084   084   000	Old_age   Always	   -	   11734
 10 Spin_Retry_Count		0x0032   100   253   000	Old_age   Always	   -	   0
 11 Calibration_Retry_Count 0x0032   100   253   000	Old_age   Always	   -	   0
 12 Power_Cycle_Count	   0x0032   100   100   000	Old_age   Always	   -	   73
192 Power-Off_Retract_Count 0x0032   200   200   000	Old_age   Always	   -	   71
193 Load_Cycle_Count		0x0032   200   200   000	Old_age   Always	   -	   314
194 Temperature_Celsius	 0x0022   122   114   000	Old_age   Always	   -	   28
196 Reallocated_Event_Count 0x0032   200   200   000	Old_age   Always	   -	   0
197 Current_Pending_Sector  0x0032   200   200   000	Old_age   Always	   -	   0
198 Offline_Uncorrectable   0x0030   100   253   000	Old_age   Offline	  -	   0
199 UDMA_CRC_Error_Count	0x0032   200   200   000	Old_age   Always	   -	   0
200 Multi_Zone_Error_Rate   0x0008   200   200   000	Old_age   Offline	  -	   2

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description	Status				  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline	   Completed without error	   00%	 11606		 -
# 2  Short offline	   Completed without error	   00%	 11438		 -
# 3  Short offline	   Completed without error	   00%	 11270		 -
# 4  Extended offline	Completed without error	   00%	 11134		 -
# 5  Short offline	   Completed without error	   00%	 11102		 -
# 6  Short offline	   Completed without error	   00%	 10935		 -
# 7  Short offline	   Completed without error	   00%	 10767		 -
# 8  Short offline	   Completed without error	   00%	 10599		 -
# 9  Extended offline	Completed without error	   00%	 10463		 -
#10  Short offline	   Completed without error	   00%	 10431		 -
#11  Short offline	   Completed without error	   00%	 10359		 -
#12  Short offline	   Completed without error	   00%	 10191		 -
#13  Short offline	   Completed without error	   00%	 10024		 -
#14  Short offline	   Completed without error	   00%	  9856		 -
#15  Extended offline	Completed without error	   00%	  9719		 -
#16  Short offline	   Completed without error	   00%	  9688		 -
#17  Short offline	   Completed without error	   00%	  9616		 -
#18  Short offline	   Completed without error	   00%	  9448		 -
#19  Short offline	   Completed without error	   00%	  9280		 -
#20  Short offline	   Completed without error	   00%	  9112		 -
#21  Extended offline	Completed without error	   00%	  8975		 -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
	1		0		0  Not_testing
	2		0		0  Not_testing
	3		0		0  Not_testing
	4		0		0  Not_testing
	5		0		0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

 

encbox

Dabbler
Joined
Mar 27, 2017
Messages
25
smartctl -x /dev/da1

Code:
=== START OF INFORMATION SECTION ===
Model Family:	 Western Digital Red
Device Model:	 WDC WD30EFRX-68EUZN0
Serial Number:	WD-WCCxxxxxx
LU WWN Device Id: 5 0014ee 20c6f3d5c
Firmware Version: 82.00A82
User Capacity:	3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:	 512 bytes logical, 4096 bytes physical
Rotation Rate:	5400 rpm
Device is:		In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:	Mon Mar 27 10:41:17 2017 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
										was never started.
										Auto Offline Data Collection: Disabled.
Self-test execution status:	  ( 121) The previous self-test completed having
										the read element of the test failed.
Total time to complete Offline
data collection:				(39360) seconds.
Offline data collection
capabilities:					(0x7b) SMART execute Offline immediate.
										Auto Offline data collection on/off support.
										Suspend Offline collection upon new
										command.
										Offline surface scan supported.
										Self-test supported.
										Conveyance Self-test supported.
										Selective Self-test supported.
SMART capabilities:			(0x0003) Saves SMART data before entering
										power-saving mode.
										Supports SMART auto save timer.
Error logging capability:		(0x01) Error logging supported.
										General Purpose Logging supported.
Short self-test routine
recommended polling time:		(   2) minutes.
Extended self-test routine
recommended polling time:		( 395) minutes.
Conveyance self-test routine
recommended polling time:		(   5) minutes.
SCT capabilities:			  (0x703d) SCT Status supported.
										SCT Error Recovery Control supported.
										SCT Feature Control supported.
										SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME		  FLAGS	VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate	 POSR-K   200   200   051	-	4
  3 Spin_Up_Time			POS--K   179   177   021	-	6050
  4 Start_Stop_Count		-O--CK   100   100   000	-	74
  5 Reallocated_Sector_Ct   PO--CK   200   200   140	-	0
  7 Seek_Error_Rate		 -OSR-K   200   200   000	-	0
  9 Power_On_Hours		  -O--CK   084   084   000	-	11734
10 Spin_Retry_Count		-O--CK   100   253   000	-	0
11 Calibration_Retry_Count -O--CK   100   253   000	-	0
12 Power_Cycle_Count	   -O--CK   100   100   000	-	73
192 Power-Off_Retract_Count -O--CK   200   200   000	-	71
193 Load_Cycle_Count		-O--CK   200   200   000	-	314
194 Temperature_Celsius	 -O---K   122   114   000	-	28
196 Reallocated_Event_Count -O--CK   200   200   000	-	0
197 Current_Pending_Sector  -O--CK   200   200   000	-	0
198 Offline_Uncorrectable   ----CK   100   253   000	-	0
199 UDMA_CRC_Error_Count	-O--CK   200   200   000	-	0
200 Multi_Zone_Error_Rate   ---R--   200   200   000	-	2
							||||||_ K auto-keep
							|||||__ C event count
							||||___ R error rate
							|||____ S speed/performance
							||_____ O updated online
							|______ P prefailure warning

General Purpose Log Directory Version 1
SMART		   Log Directory Version 1 [multi-sector log support]
Address	Access  R/W   Size  Description
0x00	   GPL,SL  R/O	  1  Log Directory
0x01		   SL  R/O	  1  Summary SMART error log
0x02		   SL  R/O	  5  Comprehensive SMART error log
0x03	   GPL	 R/O	  6  Ext. Comprehensive SMART error log
0x06		   SL  R/O	  1  SMART self-test log
0x07	   GPL	 R/O	  1  Extended self-test log
0x09		   SL  R/W	  1  Selective self-test log
0x10	   GPL	 R/O	  1  SATA NCQ Queued Error log
0x11	   GPL	 R/O	  1  SATA Phy Event Counters log
0x21	   GPL	 R/O	  1  Write stream error log
0x22	   GPL	 R/O	  1  Read stream error log
0x80-0x9f  GPL,SL  R/W	 16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS	  16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS	   1  Device vendor specific log
0xbd	   GPL,SL  VS	   1  Device vendor specific log
0xc0	   GPL,SL  VS	   1  Device vendor specific log
0xc1	   GPL	 VS	  93  Device vendor specific log
0xe0	   GPL,SL  R/W	  1  SCT Command/Status
0xe1	   GPL,SL  R/W	  1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
Device Error Count: 1
		CR	 = Command Register
		FEATR  = Features Register
		COUNT  = Count (was: Sector Count) Register
		LBA_48 = Upper bytes of LBA High/Mid/Low Registers ]  ATA-8
		LH	 = LBA High (was: Cylinder High) Register	]   LBA
		LM	 = LBA Mid (was: Cylinder Low) Register	  ] Register
		LL	 = LBA Low (was: Sector Number) Register	 ]
		DV	 = Device (was: Device/Head) Register
		DC	 = Device Control Register
		ER	 = Error register
		ST	 = Status register
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 1 [0] occurred at disk power-on lifetime: 11705 hours (487 days + 17 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 00 02 61 9e 60 40 00  Error: UNC at LBA = 0x02619e60 = 39951968

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 01 00 00 00 00 00 02 61 9e 30 40 00 27d+01:15:43.824  READ FPDMA QUEUED
  60 01 00 00 00 00 00 02 61 9d 30 40 00 27d+01:15:43.335  READ FPDMA QUEUED
  60 01 00 00 00 00 00 02 61 9c 30 40 00 27d+01:15:42.845  READ FPDMA QUEUED
  60 01 00 00 00 00 00 02 61 9b 30 40 00 27d+01:15:42.844  READ FPDMA QUEUED
  60 01 00 00 00 00 00 02 61 9a 30 40 00 27d+01:15:42.843  READ FPDMA QUEUED

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description	Status				  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline	Completed: read failure	   90%	 11734		 39952032
# 2  Short offline	   Completed without error	   00%	 11606		 -
# 3  Short offline	   Completed without error	   00%	 11438		 -
# 4  Short offline	   Completed without error	   00%	 11270		 -
# 5  Extended offline	Completed without error	   00%	 11134		 -
# 6  Short offline	   Completed without error	   00%	 11102		 -
# 7  Short offline	   Completed without error	   00%	 10935		 -
# 8  Short offline	   Completed without error	   00%	 10767		 -
# 9  Short offline	   Completed without error	   00%	 10599		 -
#10  Extended offline	Completed without error	   00%	 10463		 -
#11  Short offline	   Completed without error	   00%	 10431		 -
#12  Short offline	   Completed without error	   00%	 10359		 -
#13  Short offline	   Completed without error	   00%	 10191		 -
#14  Short offline	   Completed without error	   00%	 10024		 -
#15  Short offline	   Completed without error	   00%	  9856		 -
#16  Extended offline	Completed without error	   00%	  9719		 -
#17  Short offline	   Completed without error	   00%	  9688		 -
#18  Short offline	   Completed without error	   00%	  9616		 -

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
	1		0		0  Not_testing
	2		0		0  Not_testing
	3		0		0  Not_testing
	4		0		0  Not_testing
	5		0		0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:				  3
SCT Version (vendor specific):	   258 (0x0102)
SCT Support Level:				   1
Device State:						Active (0)
Current Temperature:					28 Celsius
Power Cycle Min/Max Temperature:	 21/32 Celsius
Lifetime	Min/Max Temperature:	  2/36 Celsius
Under/Over Temperature Limit Count:   0/0
Vendor specific:
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:	 2
Temperature Sampling Period:		 1 minute
Temperature Logging Interval:		1 minute
Min/Max recommended Temperature:	  0/60 Celsius
Min/Max Temperature Limit:		   -41/85 Celsius
Temperature History Size (Index):	478 (31)

Index	Estimated Time   Temperature Celsius
  32	2017-03-27 02:44	28  *********
...	..(476 skipped).	..  *********
  31	2017-03-27 10:41	28  *********

SCT Error Recovery Control:
		   Read:	 70 (7.0 seconds)
		  Write:	 70 (7.0 seconds)

Device Statistics (GP/SMART Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID	  Size	 Value  Description
0x0001  2			0  Command failed due to ICRC error
0x0002  2			0  R_ERR response for data FIS
0x0003  2			0  R_ERR response for device-to-host data FIS
0x0004  2			0  R_ERR response for host-to-device data FIS
0x0005  2			0  R_ERR response for non-data FIS
0x0006  2			0  R_ERR response for device-to-host non-data FIS
0x0007  2			0  R_ERR response for host-to-device non-data FIS
0x0008  2			0  Device-to-host non-data FIS retries
0x0009  2			2  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2			3  Device-to-host register FISes sent due to a COMRESET
0x000b  2			0  CRC errors within host-to-device FIS
0x000f  2			0  R_ERR response for host-to-device data FIS, CRC
0x0012  2			0  R_ERR response for host-to-device non-data FIS, CRC
0x8000  4	 11032954  Vendor specific
 

encbox

Dabbler
Joined
Mar 27, 2017
Messages
25
Yes, it seems that the disk is indeed faulty. I replaced it and the error is gone.
 
Status
Not open for further replies.
Top