ada2 is giving me the blues...

Status
Not open for further replies.

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
Im a noob, I setup freenas 9.2.1, I have a RAIDz with three 2tb drives, a WD green, a WD red and a segate normal desktop drive, i have 8 gigs of kingston non-ecc memory temporary as ecc is on the way in shipping.

in freenas gui everything seems ok, alert system says im healthy, and all datasets are healthy. My question is, why do i repeatedly get errors on the freenas server itself?

it states:

freenas smartd[2966] device /dev/ada2, 65517 currently unreadable (pending) sectors

freenas smartd[2966] device /dev/ada2, 6 offline uncorrectable sectors

im too much of a noob to know if i should be scared! thank you for any help,
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
also, i know the sytem log is loaded in ram but i cant figure out how to view it so i can post some of the errors

i dont have a remote sys log server
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
If you search for those error messages you'll know what that stuff means... in short ada2 is failing and should be replaced.
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
oh believe me, i tried to search, i know you'll flame a noob in a hot second ! LOL, i wasn't looking forward to being singed!! LOL trust me, i tried to save myself that embarrassment :) i have no unix / linux / bsd background and have been reading for the last two weeks page after page bits and pieces from here and there just to get as far as i have, the learning curve is steep at best, but i couldn't find my exact errors or how to fix them, thanks for the advice, i will replace that drive and start over from scratch and google how to replace a drive in raidz, that should keep me off the hit list :)

actually might not have to start from scratch, already got shares and jails setup.. and set the mother board to ahci before i started the build
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
If you simply searched for "current unreadable sectors" you'll find that we told those guys to search too! The first thread when I put in that search term was http://forums.freenas.org/index.php?threads/offline-uncorrectable-sectors-bad-news.17782/#post-95821, and someone said that's all over the forums too.

The learning curve is extremely steep.

You won't have to start over unless you can't rebuild the pool. I'd take this opportunity to go to RAIDZ2 though. ;) RAIDZ1 just isn't reliable in today's hard drive world.
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
before i go buy that drive, the manual says "Before physically removing the failed device, go to Storage → Volumes → View Volumes → Volume Status and locate the failed disk." all of my disks say online, none of them say failed, replace ada2 anyway?
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
raidz2 sounds great, but i have 3 2tb drives and can only afford to replace the one that is slowly failing with the errors, otherwise, i would have done the raidz2 for sure
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
Yeah. It's saying to do that to not to verify it's set to a "FAILED" status. It's just wanting you to verify that you understand what pool the disk is in and such. It's not a good thing when you think you are pulling a drive from PoolA, but instead removed a drive from PoolB. ;)

Basically you should double and triple check every step of the way as you don't want to offline a good disk or do anything to jeopardize your pool.
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,994
I always verify the serial number of the hard drive in question. If I need to pull ada2, well I will look up the serial number first for ada2 in the GUI or via smartctl and then after the machine it turned off when I pull the drive I will physically ensure the serial numbers match. If it doesn't then I need to locate the correct drive.
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
thanks! i decided to tool around with it before i left to go buy the drive, just on a whim, i moved the faulty drive over one sata port and wallah! no more errors! i think i am safe, right?? if so, i can still go get that drive and do a raidz2! cause then ill have four 2 tb drives, provided the wood be faulty drive is actually still good, im looking for some error checking commands i throw in the gui cause the cli on the nas isnt giving any error at all, just sitting there nice and quiet on the 1-11 console setup screen, i hope it isnt a false since of security :)
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,994
Sometimes those errors can occur because of a poor SATA connection. Maybe you should post the results of "smartctl -a /dev/ada2" assuming the drive is still recognized as ada2. Lets see what the drive really looks like.
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
Current pending sectors is never a sata connection. That's internal head-can't-read-the-data-from-the-platter problem. The same is true for offline uncorrectables.
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
could somebody help me translate this please.. i hope this was the correct syntax, i used "smartctl -t /dev/ada2" just trying to confirm that my drive is bad before i leave to the store, thanks

a side note, the top of the printouts seem to get cut off and i cant scroll up to see the begining of the print out
SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 186 169 051 Pre-fail Always - 5402 3 Spin_Up_Time 0x0027 253 203 021 Pre-fail Always - 975 4 Start_Stop_Count 0x0032 098 098 000 Old_age Always - 2291 5 Reallocated_Sector_Ct 0x0033 187 187 140 Pre-fail Always - 256 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 095 095 000 Old_age Always - 3728 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 098 098 000 Old_age Always - 2179 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 302 193 Load_Cycle_Count 0x0032 162 162 000 Old_age Always - 116472 194 Temperature_Celsius 0x0022 109 091 000 Old_age Always - 41 196 Reallocated_Event_Count 0x0032 194 194 000 Old_age Always - 6 197 Current_Pending_Sector 0x0032 001 001 000 Old_age Always - 65517 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 6 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 3 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 14 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 90% 3728 40113992 SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. [root@freenas ~]# smartctl -t /dev/ada2 smartctl 6.2 2013-07-26 r3841 [FreeBSD 9.2-RELEASE-p3 amd64] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org =======> INVALID ARGUMENT TO -t: /dev/ada2 =======> VALID ARGUMENTS ARE: offline, short, long, conveyance, force, vendor,N, select,M-N, pending,N, afterselect,[on|off] <====== = Use smartctl -h to get a usage summary
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
oops i didnt see you last post thanks man im off to the store!
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
oh, ok give me a sec to see if i can print it to screen or copy paste somehow
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 186 169 051 Pre-fail Always - 5402
3 Spin_Up_Time 0x0027 253 203 021 Pre-fail Always - 975
4 Start_Stop_Count 0x0032 098 098 000 Old_age Always - 2291
5 Reallocated_Sector_Ct 0x0033 187 187 140 Pre-fail Always - 256
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 095 095 000 Old_age Always - 3728
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 098 098 000 Old_age Always - 2179
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 302
193 Load_Cycle_Count 0x0032 162 162 000 Old_age Always - 116472
194 Temperature_Celsius 0x0022 109 091 000 Old_age Always - 41
196 Reallocated_Event_Count 0x0032 194 194 000 Old_age Always - 6
197 Current_Pending_Sector 0x0032 001 001 000 Old_age Always - 65517
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 6
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 3
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 14

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed: read failure 90% 3728 40113992

SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

[root@freenas ~]# smartctl -t /dev/ada2
smartctl 6.2 2013-07-26 r3841 [FreeBSD 9.2-RELEASE-p3 amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=======> INVALID ARGUMENT TO -t: /dev/ada2
=======> VALID ARGUMENTS ARE: offline, short, long, conveyance, force, vendor,N, select,M-N, pending,N, afterselect,[on|off] <======
=

Use smartctl -h to get a usage summary
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
everytime i try to get the usage summary most off the command explanation are chopped of is there a way to scroll up?

so what do ya think about the test results?
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
I knew it would fail. If CUPS is even 1 or offline uncorrectable is even 1 it'll fail the test. I have a drive that fails because of 1 CUPS error. :(
 

Wade

Contributor
Joined
Feb 16, 2014
Messages
110
i bought 2 wd 2tb reds, now i have three 2tb wd reds and a 2 tb segate making up a four drive raidz2 :) yet the saga continues.... :( i put the new drives in, reset freenas back to factory, built a new volume for my four drives, everytime i boot i get a error saying

da0 umas sim 0:0:0:0 cam status scusi error

i found this on another site, its probably exactly the same error im getting, any ideas??

from what i gather da0 is the usb stick that i boot too and run freenas off of, but the stick appears to work fine....

(da0:umass-sim0:0:0:0):READ(10). CDB: 28 0 0 13 80 1 0 0 1 0
(da0:umass-sim0:0:0:0):CAM Status: SCSI Status Error
(da0:umass-sim0:0:0:0):SCSI Status: Check Condition
(da0:umass-sim0:0:0:0):MEDIUM ERROR asc:11,0
(da0:umass-sim0:0:0:0):Unrecovered read error
(da0:umass-sim0:0:0:0):Retrying Command (per Sense Data)
(da0:umass-sim0:0:0:0):READ(10). CDB: 28 0 0 13 80 1 0 0 1 0
(da0:umass-sim0:0:0:0):CAM Status: SCSI Status Error
(da0:umass-sim0:0:0:0):SCSI Status: Check Condition
(da0:umass-sim0:0:0:0):ILLEGAL REQUEST asc:20,0
(da0:umass-sim0:0:0:0):Invalid command operation code
(da0:umass-sim0:0:0:0):Unretryable error
 
Status
Not open for further replies.
Top