Charles Frank
Dabbler
- Joined
- Dec 24, 2014
- Messages
- 13
I've got a FreeNAS box built with a Supermicro X10SL7-F-O motherboard that's been flawless. I ran an update a few days ago as I hadn't patched it for some time and (this may be coincidence) a day or two later I started noticing the box being a little 'flaky'. Tonight I dumped some data to it and when I tried to check it - while I could see the shares fine on my Windows PC and could list the contents of the shares ok - the files themselves were unreadable. So I checked and I'm seeing a lot of CAM and SCSI status errors:
(da3:mps0:0:3:0): READ(10). CDB: 28 00 00 40 02 a0 00 00 e0 00 length 114688 SMID 193 command timeout cm 0xfffffe0000a8dd50 ccb 0xfffff8009f7be000
(noperiph:mps0:0:4294967295:0): SMID 2 Aborting command 0xfffffe0000a8dd50
mps0: Sending reset from mpssas_send_abort for target ID 3
mps0: Unfreezing devq for target ID 3
(da3:mps0:0:3:0): READ(10). CDB: 28 00 00 40 02 a0 00 00 e0 00
(da3:mps0:0:3:0): CAM status: command timeout
(da3:mps0:0:3:0): Retrying command
(da3:mps0:0:3:0): READ(10). CDB: 28 00 00 40 02 a0 00 00 e0 00
(da3:mps0:0:3:0): CAM status: SCSI Status Error
(da3:mps0:0:3:0): SCSI status: Check Condition
(da3:mps0:0:3:0): SCSI sense: UNIT ATTENTION asc:29,0 (Power on, reset, or bus device reset occurred)
(da3:mps0:0:3:0): Retrying command (per sense data)
and it just starts looping through that sequence. I've reseated all the power and data cables both at the drive end and at the on-board LSI controller, to no avail. From the FreeNAS UI itself I had no warnings, the drives reported they were OK, scrubs showed nothing. Assuming the update had borked something I did something very silly (don't hate me, my knowledge of FreeNAS/BSD is very limited) I decided to do a clean reinstall of FreeNAS 9.10 (my existing setup had been upgraded to 9.10 from 9.3) and import my volumes and settings. This is where things went from bad to worse... The clean 9.10 install couldn't import my volume - it could see it, but when it tried to import it, the above error just started cycling through. Now with my (VERY) limited knowledge I'm guessing one of the drives has died - but my understanding for ZFS was (and correct me if I'm wrong) it would warn you and the RAID array would carry on, assuming you set it up with redundant drives (I am using 6x3Tb Hitachi NAS drives set up with RAID-Z2).
Any help here would be MUCH appreciated - I've got about 10Tb on my server and while I have a backup of the data it would take me a considerable amount of time to copy it all back over if I had to do a complete rebuild from scratch.
Cheers,
Charles.
(da3:mps0:0:3:0): READ(10). CDB: 28 00 00 40 02 a0 00 00 e0 00 length 114688 SMID 193 command timeout cm 0xfffffe0000a8dd50 ccb 0xfffff8009f7be000
(noperiph:mps0:0:4294967295:0): SMID 2 Aborting command 0xfffffe0000a8dd50
mps0: Sending reset from mpssas_send_abort for target ID 3
mps0: Unfreezing devq for target ID 3
(da3:mps0:0:3:0): READ(10). CDB: 28 00 00 40 02 a0 00 00 e0 00
(da3:mps0:0:3:0): CAM status: command timeout
(da3:mps0:0:3:0): Retrying command
(da3:mps0:0:3:0): READ(10). CDB: 28 00 00 40 02 a0 00 00 e0 00
(da3:mps0:0:3:0): CAM status: SCSI Status Error
(da3:mps0:0:3:0): SCSI status: Check Condition
(da3:mps0:0:3:0): SCSI sense: UNIT ATTENTION asc:29,0 (Power on, reset, or bus device reset occurred)
(da3:mps0:0:3:0): Retrying command (per sense data)
and it just starts looping through that sequence. I've reseated all the power and data cables both at the drive end and at the on-board LSI controller, to no avail. From the FreeNAS UI itself I had no warnings, the drives reported they were OK, scrubs showed nothing. Assuming the update had borked something I did something very silly (don't hate me, my knowledge of FreeNAS/BSD is very limited) I decided to do a clean reinstall of FreeNAS 9.10 (my existing setup had been upgraded to 9.10 from 9.3) and import my volumes and settings. This is where things went from bad to worse... The clean 9.10 install couldn't import my volume - it could see it, but when it tried to import it, the above error just started cycling through. Now with my (VERY) limited knowledge I'm guessing one of the drives has died - but my understanding for ZFS was (and correct me if I'm wrong) it would warn you and the RAID array would carry on, assuming you set it up with redundant drives (I am using 6x3Tb Hitachi NAS drives set up with RAID-Z2).
Any help here would be MUCH appreciated - I've got about 10Tb on my server and while I have a backup of the data it would take me a considerable amount of time to copy it all back over if I had to do a complete rebuild from scratch.
Cheers,
Charles.