Douche_Baguette
Cadet
- Joined
- Jul 8, 2015
- Messages
- 8
Hi everyone!
So, I've been running a freenas server for a while now, with my extra, spare drives. For a long time, the SSD was sitting idle, and the 1TB drive housed my jails and a volume of storage for music, movies, data, etc. The other 2 drives were striped as a single volume for Time Machine backups.
Well, the 1TB drive started throwing errors and reporting unreadable sectors, so I figured this was my chance to rebuild the system, better. So my current setup is:
250GB Samsung 840 Evo ("SSD") as my jails storage
3x2TB drives setup in ZRAID1 ("RAID") (or whatever it's called) for single disk redundancy and ~4TB of storage.
I have a Plex jail and a jail running an apache web server. I have the SSD set to never sleep/standby. The hard drives sleep after an hour.
Anyway, everything is pretty much working fine, but this morning I got an email at 3AM with a critical alert:
Weird. Then, around 2PM:
and 5 minutes later:
Then again at around 5, when streaming a movie with Plex, I got the same 2 emails back to back ("data corruption" and "unrecoverable error"). Even so, Plex didn't stutter and everything seems to be working fine.
If I SSH into freenas and run zpool status, I get:
As you can see, I tried a scrub, and it came back with no errors, and no repairs.
What could be causing this? What else can I do to troubleshoot? The SSD is pretty new and was used as my desktop's boot drive for maybe 6 months, then sat unused until now. I REALLY doubt the drive is actually failing, but I guess it's possible.
My motherboard's sata ports are SATA 2, so I am using a PCI express SATA 3 card for the SSD only. Should I try taking that out of the equation and running the SSD directly to the motherboard? It is, FWIW, a "crappy gamer-grade motherboard".
Full system specs:
ASUS P6T motherboard
12GB 1600MHz triple-channel DDR3 memory
Intel Core i7 920 2.66GHz
Antec 650W power supply
No graphics card installed during normal use.
So, I've been running a freenas server for a while now, with my extra, spare drives. For a long time, the SSD was sitting idle, and the 1TB drive housed my jails and a volume of storage for music, movies, data, etc. The other 2 drives were striped as a single volume for Time Machine backups.
Well, the 1TB drive started throwing errors and reporting unreadable sectors, so I figured this was my chance to rebuild the system, better. So my current setup is:
250GB Samsung 840 Evo ("SSD") as my jails storage
3x2TB drives setup in ZRAID1 ("RAID") (or whatever it's called) for single disk redundancy and ~4TB of storage.
I have a Plex jail and a jail running an apache web server. I have the SSD set to never sleep/standby. The hard drives sleep after an hour.
Anyway, everything is pretty much working fine, but this morning I got an email at 3AM with a critical alert:
The volume SSD (ZFS) state is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.
Weird. Then, around 2PM:
The volume SSD (ZFS) state is ONLINE: One or more devices has experienced an error resulting in data corruption. Applications may be affected.
and 5 minutes later:
The volume SSD (ZFS) state is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.
Then again at around 5, when streaming a movie with Plex, I got the same 2 emails back to back ("data corruption" and "unrecoverable error"). Even so, Plex didn't stutter and everything seems to be working fine.
If I SSH into freenas and run zpool status, I get:
pool: SSD
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: http://illumos.org/msg/ZFS-8000-9P
scan: scrub repaired 0 in 0h0m with 0 errors on Wed Jul 8 17:27:46 2015
config:
NAME STATE READ WRITE CKSUM
SSD ONLINE 0 1 0
gptid/7ccc476e-238b-11e5-bf3b-6805ca145a40 ONLINE 0 1 0
errors: No known data errors
As you can see, I tried a scrub, and it came back with no errors, and no repairs.
What could be causing this? What else can I do to troubleshoot? The SSD is pretty new and was used as my desktop's boot drive for maybe 6 months, then sat unused until now. I REALLY doubt the drive is actually failing, but I guess it's possible.
My motherboard's sata ports are SATA 2, so I am using a PCI express SATA 3 card for the SSD only. Should I try taking that out of the equation and running the SSD directly to the motherboard? It is, FWIW, a "crappy gamer-grade motherboard".
Full system specs:
ASUS P6T motherboard
12GB 1600MHz triple-channel DDR3 memory
Intel Core i7 920 2.66GHz
Antec 650W power supply
No graphics card installed during normal use.