I'm seeing IO errors and Bad checksums in arc_summary.pl on my l2arc device. The errors start showing up after hours of usage, usually 100+ GB of populated data at that point. Another weird thing is that the size is reported wrong, eg. over my SSD size. Might be due to compression, but I dont have anything that has been compressed over 1.3x.
Hardware:
Generic build with AMD 5800k, 32GB of NON-ecc RAM (tested multiple times with memtest), Kingston V300 60GB SSD.
L2 ARC Summary: (DEGRADED)
Passed Headroom: 13.06m
Tried Lock Failures: 146.61k
IO In Progress: 465
Low Memory Aborts: 20
Free on Write: 6.55k
Writes While Full: 19.11k
R/W Clashes: 26
Bad Checksums: 11.12k
IO Errors: 4.35k
SPA Mismatch: 337.01m
pool: tank
state: ONLINE
scan: scrub repaired 0 in 27h41m with 0 errors on Mon Aug 26 02:21:53 2013
config:
NAME STATE READ WRITE CKSUM
tank ONLINE 0 0 0
raidz2-0 ONLINE 0 0 0
gptid/38d32808-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/39766b13-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3a2e1011-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3af0e894-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3bb4c993-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3c8c502e-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3d69c1ad-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3e4bb0f4-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3f26a89a-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/400252f1-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
cache
gptid/18550d61-0dae-11e3-a860-0017087e7ad4 ONLINE 0 0 0
Things I have tried out:
- Changing SSD (OCZ Vertex 2 -> Kingston V300)
- Changing SATA -cables
- Changing PSU-cable
- Changing SATA-controller
- Assigning the device to a another pool
I have no problems running memtest86 for over 24h and the server runs fine otherwise. I haven't seen any errors reported in zpool status and scrubs have been error free.
Hardware:
Generic build with AMD 5800k, 32GB of NON-ecc RAM (tested multiple times with memtest), Kingston V300 60GB SSD.
L2 ARC Summary: (DEGRADED)
Passed Headroom: 13.06m
Tried Lock Failures: 146.61k
IO In Progress: 465
Low Memory Aborts: 20
Free on Write: 6.55k
Writes While Full: 19.11k
R/W Clashes: 26
Bad Checksums: 11.12k
IO Errors: 4.35k
SPA Mismatch: 337.01m
pool: tank
state: ONLINE
scan: scrub repaired 0 in 27h41m with 0 errors on Mon Aug 26 02:21:53 2013
config:
NAME STATE READ WRITE CKSUM
tank ONLINE 0 0 0
raidz2-0 ONLINE 0 0 0
gptid/38d32808-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/39766b13-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3a2e1011-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3af0e894-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3bb4c993-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3c8c502e-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3d69c1ad-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3e4bb0f4-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/3f26a89a-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
gptid/400252f1-f0a3-11e2-bf20-0017087e7ad4 ONLINE 0 0 0
cache
gptid/18550d61-0dae-11e3-a860-0017087e7ad4 ONLINE 0 0 0
Things I have tried out:
- Changing SSD (OCZ Vertex 2 -> Kingston V300)
- Changing SATA -cables
- Changing PSU-cable
- Changing SATA-controller
- Assigning the device to a another pool
I have no problems running memtest86 for over 24h and the server runs fine otherwise. I haven't seen any errors reported in zpool status and scrubs have been error free.