Hello everybody.
I've got really strange problems with a HP Gen8 G1610 16gb ECC FN 9.10.2 Box.
From the beginning:
Installed the System with 4x 2TB Raidz2 early last year.
No problems till last week.
There are 2 Kingston Data Traveler Mirrored.
One gave up without any notice.
I changed the faulty one and replace the first one.
During resilver the second one has thrown read errors.
I decided its the best to replace booth.
Put in a new Intenso and a new Toshiba 16GB Stick. Installed FN from scratch, restored the config from a backup.
FN back up and everyone was happy.
Now one week later (last night) I got a mail:
The boot volume state is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.
The toshiba stick is gone bad.
Ok, faulty new stick can happen.
Some /var/log/messages from the dying stick:
Added another one and started a replace.
/var/log/messages/
Some output from zpool status
The resilver is damn slow and the "ok" usb sticks seems to go bad too.
A little later:
After it has finished I started a scrub, now both are resilvering:
Even the brand new already has checksum errors!?
Every stick is in a different USB Port.
So the inner, rear and front have been in use.
What the hell am I doing wrong?
What shall I do now?
Problem is that the FN is 1,5h driving away from me.
If the resilver will work add a third stick and make a 3-way mirror?
As the grub install failed, will it boot?
Thanks in advance
I've got really strange problems with a HP Gen8 G1610 16gb ECC FN 9.10.2 Box.
From the beginning:
Installed the System with 4x 2TB Raidz2 early last year.
No problems till last week.
There are 2 Kingston Data Traveler Mirrored.
One gave up without any notice.
I changed the faulty one and replace the first one.
During resilver the second one has thrown read errors.
I decided its the best to replace booth.
Put in a new Intenso and a new Toshiba 16GB Stick. Installed FN from scratch, restored the config from a backup.
FN back up and everyone was happy.
Now one week later (last night) I got a mail:
The boot volume state is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.
The toshiba stick is gone bad.
Ok, faulty new stick can happen.
Some /var/log/messages from the dying stick:
Code:
Jan 19 06:36:04 fnnas (da0:umass-sim0:0:0:0): READ(10). CDB: 28 00 01 cd ee 38 00 00 10 00 Jan 19 06:36:04 fnnas (da0:umass-sim0:0:0:0): CAM status: CCB request completed with an error Jan 19 06:36:04 fnnas (da0:umass-sim0:0:0:0): Retrying command Jan 19 06:36:04 fnnas (da0:umass-sim0:0:0:0): READ(10). CDB: 28 00 01 cd ee 38 00 00 10 00 Jan 19 06:36:04 fnnas (da0:umass-sim0:0:0:0): CAM status: CCB request completed with an error Jan 19 06:36:04 fnnas (da0:umass-sim0:0:0:0): Retrying command Jan 19 06:36:05 fnnas (da0:umass-sim0:0:0:0): READ(10). CDB: 28 00 01 cd ee 38 00 00 10 00 Jan 19 06:36:05 fnnas (da0:umass-sim0:0:0:0): CAM status: CCB request completed with an error Jan 19 06:36:05 fnnas (da0:umass-sim0:0:0:0): Error 5, Retries exhausted Jan 19 06:36:07 fnnas (da0:umass-sim0:0:0:0): got CAM status 0x44 Jan 19 06:36:07 fnnas (da0:umass-sim0:0:0:0): fatal error, failed to attach to device Jan 19 06:36:07 fnnas da0 at umass-sim0 bus 0 scbus7 target 0 lun 0 Jan 19 06:36:07 fnnas da0: <TOSHIBA TransMemory 1.00> s/n 7427EA2C3BC7C0216ABA520B detached Jan 19 06:36:08 fnnas (da0:umass-sim0:0:0:0): Periph destroyed
Added another one and started a replace.
/var/log/messages/
Code:
Jan 19 12:23:20 fnnas da0 at umass-sim2 bus 2 scbus9 target 0 lun 0 Jan 19 12:23:20 fnnas da0: <Intenso Rainbow Line 8.07> Removable Direct Access SPC-2 SCSI device Jan 19 12:23:20 fnnas da0: Serial Number C8EDFA52 Jan 19 12:23:20 fnnas da0: 40.000MB/s transfers Jan 19 12:23:20 fnnas da0: 30000MB (61440000 512 byte sectors) Jan 19 12:23:20 fnnas da0: quirks=0x2<NO_6_BYTE> Jan 19 12:40:11 fnnas notifier: 32+0 records in Jan 19 12:40:11 fnnas notifier: 32+0 records out Jan 19 12:40:11 fnnas notifier: 33554432 bytes transferred in 3.492055 secs (9608793 bytes/sec) Jan 19 12:40:15 fnnas notifier: dd: /dev/da0: end of device Jan 19 12:40:15 fnnas notifier: 33+0 records in Jan 19 12:40:15 fnnas notifier: 32+0 records out Jan 19 12:40:15 fnnas notifier: 33554432 bytes transferred in 3.628615 secs (9247173 bytes/sec) Jan 19 12:40:16 fnnas ZFS: vdev state changed, pool_guid=964699875610815041 vdev_guid=13863602393974146011 Jan 19 12:41:15 fnnas notifier: /usr/local/sbin/grub-install: Input/output error Jan 19 12:41:21 fnnas manage.py: [py.warnings:206] /usr/local/www/freenasUI/../freenasUI/freeadmin/middleware.py:206: DeprecationWarning: BaseException.message has been deprecated as of Python 2.6 else unicode(excp.message) Jan 19 16:38:52 fnnas ZFS: vdev state changed, pool_guid=964699875610815041 vdev_guid=6409044869017274353
Some output from zpool status
Code:
pool: freenas-boot state: DEGRADED status: One or more devices is currently being resilvered. The pool will continue to function, possibly in a degraded state. action: Wait for the resilver to complete. scan: resilver in progress since Thu Jan 19 12:40:25 2017 574M scanned out of 650M at 75.7K/s, 0h17m to go 568M resilvered, 88.29% done config: NAME STATE READ WRITE CKSUM freenas-boot DEGRADED 0 0 445 mirror-0 DEGRADED 0 0 929 replacing-0 FAULTED 0 0 0 da0p2/old FAULTED 18 301 44 too many errors da0p2 ONLINE 0 0 0 (resilvering) da1p2 DEGRADED 0 0 975 too many errors
The resilver is damn slow and the "ok" usb sticks seems to go bad too.
A little later:
Code:
pool: freenas-boot state: DEGRADED status: One or more devices is currently being resilvered. The pool will continue to function, possibly in a degraded state. action: Wait for the resilver to complete. scan: resilver in progress since Thu Jan 19 12:40:25 2017 676M scanned out of 650M at 73.0K/s, (scan is slow, no estimated time) 668M resilvered, 103.93% done config: NAME STATE READ WRITE CKSUM freenas-boot DEGRADED 0 0 495 mirror-0 DEGRADED 0 0 1.01K replacing-0 FAULTED 0 0 0 da0p2/old FAULTED 18 301 44 too many errors da0p2 ONLINE 0 0 0 (resilvering) da1p2 DEGRADED 0 0 1.05K too many errors (resilvering) errors: 495 data errors, use '-v' for a list
After it has finished I started a scrub, now both are resilvering:
Code:
pool: freenas-boot state: DEGRADED status: One or more devices is currently being resilvered. The pool will continue to function, possibly in a degraded state. action: Wait for the resilver to complete. scan: resilver in progress since Thu Jan 19 16:39:14 2017 270M scanned out of 652M at 100K/s, 1h4m to go 269M resilvered, 41.34% done config: NAME STATE READ WRITE CKSUM freenas-boot DEGRADED 0 0 3 mirror-0 DEGRADED 0 0 6 replacing-0 DEGRADED 6 0 0 4788658346873365335 UNAVAIL 0 0 0 was /dev/da0p2/old da0p2 ONLINE 0 0 6 (resilvering) da1p2 ONLINE 0 0 9 (resilvering) errors: 273 data errors, use '-v' for a list
Even the brand new already has checksum errors!?
Every stick is in a different USB Port.
So the inner, rear and front have been in use.
What the hell am I doing wrong?
What shall I do now?
Problem is that the FN is 1,5h driving away from me.
If the resilver will work add a third stick and make a 3-way mirror?
As the grub install failed, will it boot?
Thanks in advance