smartctl false positives

KenNashua

Explorer
Joined
Feb 24, 2012
Messages
62
I have a old Kingston 64GB SSD drive that I'm using as my boot device.

Everything runs fine, except that one of the SMART tests trigger alerts continuously, which I believe is a false-positive.

Code:
CRITICAL: 		 	 	 	Oct. 6, 2018, 6:02 a.m. - Device: /dev/ada0, Failed SMART usage Attribute: 194 Temperature_Celsius.


Here's the attribute block:
Code:
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME		  FLAG	 VALUE WORST THRESH TYPE	  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate	 0x000a   100   100   000	Old_age   Always	   -	   0
  2 Throughput_Performance  0x0005   100   100   050	Pre-fail  Offline	  -	   0
  3 Unknown_JMF_Attribute   0x0007   100   100   050	Pre-fail  Always	   -	   0
  5 Reallocated_Sector_Ct   0x0013   100   100   050	Pre-fail  Always	   -	   0
  7 Unknown_JMF_Attribute   0x000b   100   100   050	Pre-fail  Always	   -	   0
  8 Unknown_JMF_Attribute   0x0005   100   100   050	Pre-fail  Offline	  -	   0
  9 Power_On_Hours		  0x0012   100   100   000	Old_age   Always	   -	   12607
 10 Unknown_JMF_Attribute   0x0013   100   100   050	Pre-fail  Always	   -	   0
 12 Power_Cycle_Count	   0x0012   100   100   000	Old_age   Always	   -	   942
167 Unknown_JMF_Attribute   0x0022   100   100   000	Old_age   Always	   -	   0
168 SATA_Phy_Error_Count	0x0012   100   100   000	Old_age   Always	   -	   0
169 Unknown_JMF_Attribute   0x0013   100   100   010	Pre-fail  Always	   -	   0
173 Erase_Count			 0x0012   200   200   000	Old_age   Always	   -	   0 0 0
175 Bad_Cluster_Table_Count 0x0013   100   100   010	Pre-fail  Always	   -	   0
192 Unexpect_Power_Loss_Ct  0x0012   100   100   000	Old_age   Always	   -	   0
194 Temperature_Celsius	 0x0023   035   035   030	Pre-fail  Always	   -	   35 (Min/Max 27/40)
197 Current_Pending_Sector  0x0012   100   100   000	Old_age   Always	   -	   0
240 Unknown_JMF_Attribute   0x0013   100   100   050	Pre-fail  Always	   -	   0


Normally, I believe I could edit /usr/local/etc/smartd.conf to ignore the attribute for that drive, but since this is auto-generated, is there anything that will allow me to override this attribute?
 

titan_rw

Guru
Joined
Sep 1, 2012
Messages
586
No, you cannot override it, however you can prevent smartd from monitoring the device.

http://doc.freenas.org/11/storage.html#view-disks

Theoretically you can. I haven't tested it, but the functionality exists.

Looking at the docs for smartd.conf, it looks like -i is used for this:

Code:
-i ID  Ignore device Attribute number ID when checking for failure of
			 Usage Attributes.  ID must be a decimal integer in the range from
			 1 to 255.  This Directive modifies the behavior of the ´-f´
			 Directive and has no effect without it.

			 This is useful, for example, if you have a very old disk and don´t
			 want to keep getting messages about the hours-on-lifetime
			 Attribute (usually Attribute 9) failing.  This Directive may
			 appear multiple times for a single device, if you want to ignore
			 multiple Attributes.


Freenas supports adding additional user supplied directives to each drive under the 'smart extra options' section under 'storage - view disks - edit disk in question'. If you add "-i 194", freenas should add that directive to the conf file when it generates it.

I tested it, and it seemed to 'do the right thing' in the config. I added -i 194 to da0, and it did this to the config:

Code:
/dev/da1 -a -n never -W 5,35,45 -m me@here.com -M exec /usr/local/www/freenasUI/tools/smart_alert.py\
-s L/(01|02|03|04|05|06|07|08|09|10|11|12)/(07|21)/(1|2|3|4|5|6|7)/(02)\

/dev/da0 -a -n never -W 5,35,45 -m me@here.com -M exec /usr/local/www/freenasUI/tools/smart_alert.py\
-s L/(01|02|03|04|05|06|07|08|09|10|11|12)/(07|21)/(1|2|3|4|5|6|7)/(02)\
 -i 194
/dev/da8 -a -n never -W 5,35,45 -m me@here.com -M exec /usr/local/www/freenasUI/tools/smart_alert.py\
-s L/(01|02|03|04|05|06|07|08|09|10|11|12)/(07|21)/(1|2|3|4|5|6|7)/(02)\



Maybe try that?
 

pro lamer

Guru
Joined
Feb 16, 2018
Messages
626
I believe is a false-positive.
Is the reason the threshold being 30 centigrade? Is it the drive's firmware bug?

What if the temperature increases above some critical level? (ie a level dangerous to the drive)

Sent from my mobile phone
 

KenNashua

Explorer
Joined
Feb 24, 2012
Messages
62
Theoretically you can. I haven't tested it, but the functionality exists.

Freenas supports adding additional user supplied directives to each drive under the 'smart extra options' section under 'storage - view disks - edit disk in question'. If you add "-i 194", freenas should add that directive to the conf file when it generates it.

Thanks! That's exactly what I was looking for. I looked for options under the SMART cron tasks but didn't think to look there.
 

MikeyG

Patron
Joined
Dec 8, 2017
Messages
442
Were you ever able to get this to work? I'm trying to ignore attribute 197 with -i 197 and -I 197 and not getting anywhere.
 

KenNashua

Explorer
Joined
Feb 24, 2012
Messages
62
@mgittelman , it works exactly as @titan_rw indicated:

Go to storage/disks and edit the disk in question. There's S.M.A.R.T. extra options. In your case you'd want to set it to -i 197.

You might need to restart the S.M.A.R.T. service from the Services menu or reboot.

1568768203771.png
 

MikeyG

Patron
Joined
Dec 8, 2017
Messages
442
Thanks @KenNashua I have done exactly that and it did not help.
 
Top