Lorsung23647
Dabbler
- Joined
- Mar 10, 2014
- Messages
- 17
So I started replacing some old disks in my array with new SAS disks. I'm doing them 1 at a time as I have money/disks fail. (Keep one on hand at all times, and replace when the next comes in.)
On the first disk I replaced I noticed that the activity light was on almost constantly. It would also flash for normal drive activity so I didn't really think much of it since my array performance wasn't actually any slower.
Last night I got a strange set of errors in a security run log that included that SAS disk:
I also had a nightly backup process take 50% longer to complete tonight that previous nights.
I found a thread with some info about those errors and did a smart check with "smartctl -a -q noserial /dev/da0"
If someone could give me some assistance on if something does seem wrong, I'd appreciate it.
System Specs:
FreeNAS 9.2.1.7-Release-x64
32GB ECC Memory
LSI 9201-16i HBA
Norco RPC-4224 Case
On the first disk I replaced I noticed that the activity light was on almost constantly. It would also flash for normal drive activity so I didn't really think much of it since my array performance wasn't actually any slower.
Last night I got a strange set of errors in a security run log that included that SAS disk:
Code:
> (da0:mps0:0:16:0): WRITE(10). CDB: 2a 00 73 5a a7 90 00 01 00 00 length 131072 SMID 393 terminated ioc 804b scsi 0 state c xfer 131072 > (da0:mps0:0:16:0): WRITE(10). CDB: 2a 00 73 84 35 98 00 00 c0 00 length 98304 SMID 867 terminated ioc 804b scsi 0 state c xfer 98304 > (da0:mps0:0:16:0): WRITE(10). CDB: 2a 00 73 8f 80 88 00 01 00 00 length 131072 SMID 392 terminated ioc 804b scsi 0 state c xfer 131072 > (da0:mps0:0:16:0): WRITE(10). CDB: 2a 00 73 a1 0f 10 00 01 00 00 length 131072 SMID 393 terminated ioc 804b scsi 0 state c xfer 131072 > (da0:mps0:0:16:0): WRITE(10). CDB: 2a 00 73 a9 84 90 00 01 00 00 length 131072 SMID 273 terminated ioc 804b scsi 0 state c xfer 131072
I also had a nightly backup process take 50% longer to complete tonight that previous nights.
I found a thread with some info about those errors and did a smart check with "smartctl -a -q noserial /dev/da0"
Code:
=== START OF INFORMATION SECTION === Vendor: HGST Product: HUS724040ALS640 Revision: A1C4 User Capacity: 4,000,787,030,016 bytes [4.00 TB] Logical block size: 512 bytes LU is resource provisioned, LBPRZ=0 Rotation Rate: 7200 rpm Form Factor: 3.5 inches Device type: disk Transport protocol: SAS Local Time is: Fri Oct 3 19:13:18 2014 CDT SMART support is: Available - device has SMART capability. SMART support is: Enabled Temperature Warning: Enabled === START OF READ SMART DATA SECTION === SMART Health Status: OK Current Drive Temperature: 33 C Drive Trip Temperature: 85 C Manufactured in week 35 of year 2014 Specified cycle count over device lifetime: 50000 Accumulated start-stop cycles: 1 Specified load-unload count over device lifetime: 600000 Accumulated load-unload cycles: 8 Elements in grown defect list: 0 Vendor (Seagate) cache information Blocks sent to initiator = 273934406647808 Error counter log: Errors Corrected by Total Correction Gigabytes Total ECC rereads/ errors algorithm processed uncorrected fast | delayed rewrites corrected invocations [10^9 bytes] errors read: 2168 0 0 2168 18337 351.764 0 write: 0 0 0 0 3596 808.894 0 verify: 0 0 0 0 39 0.000 0 Non-medium error count: 0 SMART Self-test log Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ Description number (hours) # 1 Background long Completed - 163 - [- - - # 2 Background short Completed - 150 - [- - - # 3 Background short Completed - 126 - [- - - # 4 Background short Completed - 102 - [- - - # 5 Background short Completed - 78 - [- - - # 6 Background short Completed - 54 - [- - - # 7 Background short Completed - 30 - [- - - # 8 Background short Completed - 6 - [- - - Long (extended) Self Test duration: 37038 seconds [617.3 minutes]
The output seems a bit short, but I haven't had to troubleshoot SAS disks too intensively yet, however, the error counter log, and the time to complete a Self Test worry me. 10+ hours for a smart test seems excessive, and 2k+ error corrections for 351 GB of data seems high.If someone could give me some assistance on if something does seem wrong, I'd appreciate it.
System Specs:
FreeNAS 9.2.1.7-Release-x64
32GB ECC Memory
LSI 9201-16i HBA
Norco RPC-4224 Case
Last edited: