Uncorrectable Parity/CRC Error ??

Status
Not open for further replies.

antsrealm

Explorer
Joined
Jan 28, 2013
Messages
82
I'm trying to find out what is going on here as I'm a bit nervous there could be an imminent failure here. I have been getting the uncorrectable parity/crc error in the footer of the gui for ada4 everytime I look at it and it's also in my email report for the kernel log on both ada4 and ada3.

I don't know how to fix this and have run long and short smart test with no problems and the zpool status shows no errors although there were some checksum errors once but after a scrub and clearing the errors there have been no more checksum errors ??

I have the following:-

1 x Z1 ZFS volume consisting of 5 x 3TB WD Green drives all connected to onnboard sata ports.
16GB Ram (Not ECC Ram)

I plan to upgrade to a Z2 volume soon and better hardware but want to get to the bottom of this error first and be sure I have healthy disks before using them in the new Z2 volume.

Here is the data from the email I got today showing the kernel log :-

Code:
freenas.local kernel log messages:
+++ /tmp/security.8MFCdcUw    2013-08-16 03:01:01.000000000 +1000
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 50 3c b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 38 ed 7c 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 38 bd aa 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 e0 9a 75 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 b0 f1 7c 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 20 be aa 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 c0 40 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 d0 40 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 e8 2d 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 00 3e 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 30 42 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 50 42 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 68 42 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 60 f0 6a b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 60 d8 32 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 18 80 47 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 40 98 48 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 10 3d 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 10 c9 aa 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 10 75 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 70 c9 aa 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 a0 3e 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 b0 76 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 98 3e 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 58 50 43 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 68 20 55 46 40 00 01 00 00 00 00
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 d0 63 46 40 00 01 00 00 00 00
+(ada3:ahcich3:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada3:ahcich3:0:0:0): Retrying command
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f8 63 46 40 00 01 00 00 00 00
+(ada3:ahcich3:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada3:ahcich3:0:0:0): Retrying command
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 18 78 98 b3 40 16 00 00 00 00 00
+(ada3:ahcich3:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada3:ahcich3:0:0:0): Retrying command
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 40 af 75 40 73 00 00 00 00 00
+(ada3:ahcich3:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada3:ahcich3:0:0:0): Retrying command
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 48 60 7d 40 73 00 00 00 00 00
+(ada3:ahcich3:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada3:ahcich3:0:0:0): Retrying command
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 18 58 60 7d 40 73 00 00 00 00 00
+(ada3:ahcich3:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada3:ahcich3:0:0:0): Retrying command
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 38 47 46 40 00 01 00 00 00 00
+(ada3:ahcich3:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada3:ahcich3:0:0:0): Retrying command
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 30 68 46 40 00 01 00 00 00 00
+(ada3:ahcich3:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada3:ahcich3:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 c0 48 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 d0 6a 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 e8 6a 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 60 b0 6b 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 10 49 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 80 6c 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 a0 6c 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 38 6d 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f0 9f b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 d0 67 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f8 9f b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f8 72 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 18 10 73 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 d0 b2 75 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f8 71 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 10 72 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 a0 4a 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 40 74 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 f0 b5 75 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f0 81 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 08 82 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 e0 4c 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 08 7f 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 c0 bb b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 b0 83 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 88 80 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 c8 bb b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 58 e8 c3 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 58 d0 8b 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 20 95 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 e8 8d 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 58 18 ce b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 58 00 96 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 b0 d0 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 58 b8 d0 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 58 a0 98 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 48 4f 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 88 93 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 18 a0 93 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 08 02 40 40 00 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 08 04 40 40 00 00 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 08 a0 50 40 5d 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 08 a2 50 40 5d 01 00 00 00 00
+(ada4:ahcich4:0:0:0): CAM status: Uncorrectable parity/CRC error
+(ada4:ahcich4:0:0:0): Retrying command
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 00 e0 b3 40 16 00 00 00 00 00
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 48 9c 46 40 00 01 00 00 00 00
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 e8 df b3 40 16 00 00 00 00 00
+(ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 08 e0 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 10 9d 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 60 00 a5 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 50 f3 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 40 bb 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 00 ab 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 60 f3 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 e0 f3 b3 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 90 ab 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f0 9f 50 40 5d 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f0 a1 50 40 5d 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 28 12 b4 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 20 c2 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 d0 e6 aa 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 10 12 b4 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 70 14 b4 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 70 c6 75 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 10 e7 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 30 e7 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 b8 57 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 78 ca 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 98 ca 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 90 d0 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 18 a8 d0 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f0 32 b4 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 e0 fa 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 a0 d7 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 60 90 fe 7d 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f0 16 7e 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f0 ed 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 50 f1 aa 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f0 4e b4 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 58 20 ee 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 68 c0 ee 46 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 88 36 7e 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 f8 37 7e 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 a0 06 47 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 a8 06 47 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 18 40 81 b4 40 16 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 20 49 7e 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 18 30 49 7e 40 73 00 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 40 13 47 40 00 01 00 00 00 00
+(ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 68 48 19 47 40 00 01 00 00 00 00
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
That sounds like a cabling or disk problem. But, its on 2 disks.

Post the output of:

smartctl -a -q noserial /dev/ada4

smartctl -a -q noserial /dev/ada3


and please post it in CODE or attach it as a text file.

Thanks.
 

antsrealm

Explorer
Joined
Jan 28, 2013
Messages
82
That sounds like a cabling or disk problem. But, its on 2 disks.

Post the output of:

smartctl -a -q noserial /dev/ada4

smartctl -a -q noserial /dev/ada3


and please post it in CODE or attach it as a text file.

Thanks.



Ok, thanks for responding here is the putty log for ada4 :-


Code:
=~=~=~=~=~=~=~=~=~=~=~= PuTTY log 2013.08.18 20:14:40 =~=~=~=~=~=~=~=~=~=~=~=
clearsmartctl -a -q noserial /dev/ada4
 
smartctl 6.1 2013-03-16 r3800 [FreeBSD 9.1-STABLE amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
 
=== START OF INFORMATION SECTION ===
Model Family:    Western Digital Caviar Green (AF, SATA 6Gb/s)
Device Model:    WDC WD30EZRX-00MMMB0
Firmware Version: 80.00A80
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:    512 bytes logical, 4096 bytes physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:  ATA8-ACS (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sun Aug 18 20:09:39 2013 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
 
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
 
General SMART Values:
Offline data collection status:  (0x82)    Offline data collection activity
                    was completed without error.
                    Auto Offline Data Collection: Enabled.
Self-test execution status:      (  37)    The self-test routine was interrupted
                    by the host with a hard or soft reset.
Total time to complete Offline
data collection:        (51660) seconds.
Offline data collection
capabilities:              (0x7b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (  2) minutes.
Extended self-test routine
recommended polling time:      ( 496) minutes.
Conveyance self-test routine
recommended polling time:      (  5) minutes.
SCT capabilities:            (0x3035)    SCT Status supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.
 
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG    VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate    0x002f  200  200  051    Pre-fail  Always      -      0
  3 Spin_Up_Time            0x0027  178  152  021    Pre-fail  Always      -      8066
  4 Start_Stop_Count        0x0032  100  100  000    Old_age  Always      -      48
  5 Reallocated_Sector_Ct  0x0033  200  200  140    Pre-fail  Always      -      0
  7 Seek_Error_Rate        0x002e  200  200  000    Old_age  Always      -      0
  9 Power_On_Hours          0x0032  087  087  000    Old_age  Always      -      9841
10 Spin_Retry_Count        0x0032  100  253  000    Old_age  Always      -      0
11 Calibration_Retry_Count 0x0032  100  253  000    Old_age  Always      -      0
12 Power_Cycle_Count      0x0032  100  100  000    Old_age  Always      -      46
192 Power-Off_Retract_Count 0x0032  200  200  000    Old_age  Always      -      22
193 Load_Cycle_Count        0x0032  155  155  000    Old_age  Always      -      136285
194 Temperature_Celsius    0x0022  125  106  000    Old_age  Always      -      27
196 Reallocated_Event_Count 0x0032  200  200  000    Old_age  Always      -      0
197 Current_Pending_Sector  0x0032  200  200  000    Old_age  Always      -      0
198 Offline_Uncorrectable  0x0030  200  200  000    Old_age  Offline      -      0
199 UDMA_CRC_Error_Count    0x0032  200  200  000    Old_age  Always      -      0
200 Multi_Zone_Error_Rate  0x0008  200  200  000    Old_age  Offline      -      0
 
SMART Error Log Version: 1
No Errors Logged
 
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Interrupted (host reset)      50%      9829        -
# 2  Short offline      Completed without error      00%      9824        -
# 3  Short offline      Completed without error      00%      9800        -
# 4  Short offline      Completed without error      00%      9776        -
# 5  Short offline      Completed without error      00%      9752        -
# 6  Short offline      Completed without error      00%      9728        -
# 7  Short offline      Completed without error      00%      9704        -
# 8  Short offline      Completed without error      00%      9680        -
# 9  Extended offline    Interrupted (host reset)      70%      9659        -
#10  Short offline      Interrupted (host reset)      90%      9657        -
#11  Short offline      Completed without error      00%      9635        -
#12  Short offline      Completed without error      00%      9611        -
#13  Short offline      Completed without error      00%      9588        -
#14  Short offline      Completed without error      00%      9564        -
#15  Short offline      Completed without error      00%      9540        -
#16  Short offline      Completed without error      00%      9516        -
#17  Extended offline    Interrupted (host reset)      30%      9499        -
#18  Short offline      Completed without error      00%      9492        -
#19  Short offline      Completed without error      00%      9468        -
#20  Short offline      Completed without error      00%      9445        -
#21  Short offline      Completed without error      00%      9421        -
 
SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


And here is the putty log for ada3:-

Code:
]0;root@freenas:~[root@freenas] ~# smartctl -a -q noserial /dev/ada3
 
smartctl 6.1 2013-03-16 r3800 [FreeBSD 9.1-STABLE amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org
 
=== START OF INFORMATION SECTION ===
Model Family:    Western Digital Caviar Green (AF, SATA 6Gb/s)
Device Model:    WDC WD30EZRX-00MMMB0
Firmware Version: 80.00A80
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:    512 bytes logical, 4096 bytes physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:  ATA8-ACS (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sun Aug 18 20:12:46 2013 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
 
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
 
General SMART Values:
Offline data collection status:  (0x82)    Offline data collection activity
                    was completed without error.
                    Auto Offline Data Collection: Enabled.
Self-test execution status:      (  36)    The self-test routine was interrupted
                    by the host with a hard or soft reset.
Total time to complete Offline
data collection:        (49800) seconds.
Offline data collection
capabilities:              (0x7b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (  2) minutes.
Extended self-test routine
recommended polling time:      ( 478) minutes.
Conveyance self-test routine
recommended polling time:      (  5) minutes.
SCT capabilities:            (0x3035)    SCT Status supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.
 
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG    VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate    0x002f  200  200  051    Pre-fail  Always      -      0
  3 Spin_Up_Time            0x0027  173  145  021    Pre-fail  Always      -      8333
  4 Start_Stop_Count        0x0032  100  100  000    Old_age  Always      -      48
  5 Reallocated_Sector_Ct  0x0033  200  200  140    Pre-fail  Always      -      0
  7 Seek_Error_Rate        0x002e  200  200  000    Old_age  Always      -      0
  9 Power_On_Hours          0x0032  087  087  000    Old_age  Always      -      9841
10 Spin_Retry_Count        0x0032  100  253  000    Old_age  Always      -      0
11 Calibration_Retry_Count 0x0032  100  253  000    Old_age  Always      -      0
12 Power_Cycle_Count      0x0032  100  100  000    Old_age  Always      -      46
192 Power-Off_Retract_Count 0x0032  200  200  000    Old_age  Always      -      22
193 Load_Cycle_Count        0x0032  156  156  000    Old_age  Always      -      133661
194 Temperature_Celsius    0x0022  124  102  000    Old_age  Always      -      28
196 Reallocated_Event_Count 0x0032  200  200  000    Old_age  Always      -      0
197 Current_Pending_Sector  0x0032  200  200  000    Old_age  Always      -      0
198 Offline_Uncorrectable  0x0030  200  200  000    Old_age  Offline      -      0
199 UDMA_CRC_Error_Count    0x0032  200  200  000    Old_age  Always      -      0
200 Multi_Zone_Error_Rate  0x0008  200  200  000    Old_age  Offline      -      0
 
SMART Error Log Version: 1
No Errors Logged
 
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Interrupted (host reset)      40%      9830        -
# 2  Short offline      Completed without error      00%      9825        -
# 3  Short offline      Completed without error      00%      9801        -
# 4  Short offline      Completed without error      00%      9777        -
# 5  Short offline      Completed without error      00%      9753        -
# 6  Short offline      Completed without error      00%      9729        -
# 7  Short offline      Completed without error      00%      9705        -
# 8  Short offline      Completed without error      00%      9681        -
# 9  Short offline      Completed without error      00%      9657        -
#10  Short offline      Completed without error      00%      9635        -
#11  Short offline      Completed without error      00%      9611        -
#12  Short offline      Completed without error      00%      9589        -
#13  Short offline      Completed without error      00%      9565        -
#14  Short offline      Completed without error      00%      9541        -
#15  Short offline      Completed without error      00%      9517        -
#16  Extended offline    Completed without error      00%      9502        -
#17  Short offline      Completed without error      00%      9493        -
#18  Short offline      Completed without error      00%      9469        -
#19  Short offline      Completed without error      00%      9445        -
#20  Short offline      Completed without error      00%      9421        -
#21  Short offline      Completed without error      00%      9397        -
 
SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
Drives themselves are healthy.

But, you need to use the wdidle tool on them or you are going to wear them out. I can see from your high load cycle count that you haven't changed the intellipark setting(what the wdidle tool changes). You're at about 130k, and the drives are only rated for 250k. Use the WDIDLE tool and set your disks to 300 seconds.
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
I'd try changing your SATA cables for ada3 and ada4.
 

antsrealm

Explorer
Joined
Jan 28, 2013
Messages
82
I'd try changing your SATA cables for ada3 and ada4.

I did the wdidle tool last week and set it to disable the head park all together and the load cycle count hasn't gone up since doing that.

I'll swap the sata cables tonight and see what happens. Do people really have problems with sata cables ? Seems unlikely especially two. Stranger things have happened I suppose.

Thanks for the help.
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
It does happen with sata cables. Rarely 2 at a time.. so it could be something wrong with your sata controller, bad power supply, etc. But the sata cables is the easiest and cheapest to check/fix.
 

antsrealm

Explorer
Joined
Jan 28, 2013
Messages
82
It does happen with sata cables. Rarely 2 at a time.. so it could be something wrong with your sata controller, bad power supply, etc. But the sata cables is the easiest and cheapest to check/fix.


Sata cables replaced with decent sata 3 lockable ones, sadly still getting the same parity/crc error on ada4... the power supply is nothing special so I might replace that next with one of the 80 plus gold ones and see what happens.

What problems can I expect with the system while I am getting this error ??
 

antsrealm

Explorer
Joined
Jan 28, 2013
Messages
82
Also can I change ada4 into the spare / unused sata port on the motherboard to see if that helps or does freenas require the disk to stay on the same sata port ??
 

antsrealm

Explorer
Joined
Jan 28, 2013
Messages
82
Ok it has been two days and no parity / crc errors since I changed the sata port on the mother board! Usually I would see them within 30 minutes of booting the system so I think it's safe to say this problem is solved.

Thanks for the help guys.
 

rdyson

Cadet
Joined
Aug 22, 2013
Messages
1
Thanks for suggesting swapping SATA cables cyberjock. It appears a bad cable was causing CAM status: Uncorrectable parity/CRC error for me.
 

Yatti420

Wizard
Joined
Aug 12, 2012
Messages
1,437
For consumer grade builds why not 80 plus platinum.. +/- 90% efficiency? Get a good PSU!..
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
I know that the platinums I've priced out are usually so expensive that the electricity saved doesn't offset the difference in cost between a gold and platinum. Sometimes even going Gold isn't a cost savings.
 

Yatti420

Wizard
Joined
Aug 12, 2012
Messages
1,437
I thought that would be the standard argument about them.. It depends on the load etc right.. For 80$ - 100$ I bet something decent can be picked up.. I'd hope it'd of higher quality with better components to obtain platinum certification.. Although I haven't looked in-depth at 80plus..

Edit: Can get a decent Antec EA-550 for $80 clearance here.. The 650s seem to be standard now around $115..
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
There's several factors. FreeNAS servers are usually built to be low power, and you ideally want loading to be between 30-80%. So if you build a server with 200watts of load, you really want to stay around 500-600watts tops. To achieve the expected efficiency of platinum the wattage has to go up. Getting extremely efficient at low wattages is actually quite hard.

Also, consider this. If I go from gold to platinum you'll save at best, 3%. If your machine really draws 1000watts(which is very very very unlikely) you will save a whole 30 watts(3% of 1000watts). If you do the math you'll find that the saving's isn't that much.

Now, change your numbers to more closely represent real world usage. Assume 150w for your machine (which is overkill for about 80% of desktops and FreeNAS servers), and with a 3% savings you'll save a whole 4.5w. Now tell me its cost effective to buy platinum over gold. ;)

Then add in that if for any reason you didn't use it for the entire projected lifespan, either through failure or you upgraded, then you almost guaranteed yourself that you'd lose money.

The real savings is going from a non-80+ to an 80+ certified. The non-80+ are usually around 50-60% efficient because of the design. They can also be designed to regulate voltage better, but that's a whole different discussion. But that jump from 60% to 80%+ is pretty significant.
 

marcevan

Patron
Joined
Dec 15, 2013
Messages
432
I'm now getting the same errors on one drive: ADA4. Comes up every check cycle. Now I already swapped out the cables so I doubt old cable and new cable are causes.

Leaving either power or Mobo SATA connectors/handler.

Any reliable way to see if it's the mobo vs the power?

And if power, switching to new power is fairly painless (operationally speaking), new MOBO seems a death wish: would that mess up anything in FreeNAS Storage or is it fairly plug and play?
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
Yes, try a different SATA controller like the OP did. If it fixes it then its the motherboard(which is what the OP determined). If not, then the PSU is a very likely suspect.

FreeNAS is pretty plug-n-play as long as your new hardware is still compatible with it. Most people screw this up and get hardware that isn't compatible. Then somehow surprisingly their pool doesn't mount and they are in shock. But their data is completely safe while the shop around to find the proper hardware to use. ;)
 

marcevan

Patron
Joined
Dec 15, 2013
Messages
432
Well, the mobo (circa 2007) has 6 SATA ports and I'm using all of them with 6 x 3TB drives.

I suspect I could add a PCI(-e) card with a port or two and switch over to that but if it still persists it's the mobo proper and not a physical port issue.

One thing I've learned about doing this: when adding a PCI(-e) card with SATA port(s), reboot. Otherwise if seated with power on, it will take up existing /dev/ADAn and will foul up your storage. You're best off powering down and seating the card, then booting up; less desirable is seating the card then rebooting.
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
Wait.. you've seated a PCIe card with the power on? HOLY SMOKES!


I didn't realize anyone had such iron....

More to the point, isn't that like.. REALLY stupid to do?(no offense)
 
Status
Not open for further replies.
Top