-Adam-
Dabbler
- Joined
- Jan 10, 2017
- Messages
- 27
HI!
I have a strange problem with my backup FREENAS. I have set up RAID1 (since this is a replica backup of my main RAIDZ2 FREENAS).
3 days ago I got a notification that 1 of 4 HD is getting errors:
I have purchased new 3TB WD RED, but today second disk disappeared:
I have rebooted the machine and now data is available (second disk ada 2 is visible), but still ada1 is to be replaced due to the smart errors. My question is - why ada2 was removed from pool?
Could you please take a look at smartctl below and tell what you think? I will post results in next messages.
I'm definitely replacing ADA1, but ADA2 is ok in my opinion.
Thanks!
Adam
I have a strange problem with my backup FREENAS. I have set up RAID1 (since this is a replica backup of my main RAIDZ2 FREENAS).
3 days ago I got a notification that 1 of 4 HD is getting errors:
FREENAS-BACKUP.local kernel log messages:
(aprobe0:ata3:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(ada1:ata3:0:0:0): SETFEATURES ENABLE WCACHE. ACB: ef 02 00 00 00 40 00 00 00 00 00 00(ada1:ata3:0:0:0): CAM status: Command timeout(ada1:ata3:0:0:0): Retrying command(aprobe0:ata3:0:0:0): SET_MULTI. ACB: c6 00 00 00 00 40 00 00 00 00 10 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(aprobe0:ata3:0:0:0): SETFEATURES SET TRANSFER MODE. ACB: ef 03 00 00 00 40 00 00 00 00 45 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(ada1:ata3:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 00 00 00 00 00(ada1:ata3:0:0:0): CAM status: Command timeout(ada1:ata3:0:0:0): Retrying command(ada1:ata3:0:0:0): SETFEATURES ENABLE WCACHE. ACB: ef 02 00 00 00 40 00 00 00 00 00 00(ada1:ata3:0:0:0): CAM status: Command timeout(ada1:ata3:0:0:0): Retrying command(ada1:ata3:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 00 00 00 00 00(ada1:ata3:0:0:0): CAM status: Command timeout(ada1:ata3:0:0:0): Retrying command
Checking status of zfs pools:
NAME SIZE ALLOC FREE CKPOINT EXPANDSZ FRAG CAP DEDUP HEALTH ALTROOT
DOE_ARRAY 10.9T 6.27T 4.60T - - 0% 57% 1.00x DEGRADED /mnt
freenas-boot 14G 774M 13.2G - - - 5% 1.00x ONLINE -
pool: DOE_ARRAY
state: DEGRADED
status: One or more devices are faulted in response to persistent errors.
Sufficient replicas exist for the pool to continue functioning in a
degraded state.
action: Replace the faulted device, or use 'zpool clear' to mark the device
repaired.
scan: scrub repaired 0 in 0 days 04:47:09 with 0 errors on Sun Apr 28 04:47:10 2019
config:
NAME STATE READ WRITE CKSUM
DOE_ARRAY DEGRADED 0 0 0
raidz1-0 DEGRADED 0 0 0
gptid/d68b5873-2938-11e9-a464-001b21b6e94c ONLINE 0 0 0
gptid/d77038ae-2938-11e9-a464-001b21b6e94c FAULTED 1 416 0 too many errors
gptid/d85a31dd-2938-11e9-a464-001b21b6e94c ONLINE 0 0 0
gptid/d93b2fae-2938-11e9-a464-001b21b6e94c ONLINE 0 0 0
errors: No known data errors
New alerts:
* Device: /dev/ada1, not capable of SMART self-check
I have purchased new 3TB WD RED, but today second disk disappeared:
FREENAS-BACKUP.local kernel log messages:
(aprobe0:ata3:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(aprobe0:ata3:0:0:0): SET_MULTI. ACB: c6 00 00 00 00 40 00 00 00 00 10 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(aprobe0:ata3:0:0:0): SETFEATURES SET TRANSFER MODE. ACB: ef 03 00 00 00 40 00 00 00 00 45 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(aprobe0:ata3:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(aprobe0:ata3:0:0:0): SET_MULTI. ACB: c6 00 00 00 00 40 00 00 00 00 10 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(ada1:ata3:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 00 00 00 00 00(ada1:ata3:0:0:0): CAM status: Command timeout(ada1:ata3:0:0:0): Retrying command(aprobe0:ata3:0:0:0): SETFEATURES SET TRANSFER MODE. ACB: ef 03 00 00 00 40 00 00 00 00 45 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(ada1:ata3:0:0:0): SETFEATURES ENABLE WCACHE. ACB: ef 02 00 00 00 40 00 00 00 00 00 00(ada1:ata3:0:0:0): CAM status: Command timeout(ada1:ata3:0:0:0): Retrying command(aprobe0:ata3:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying command(aprobe0:ata3:0:0:0): SET_MULTI. ACB: c6 00 00 00 00 40 00 00 00 00 10 00(aprobe0:ata3:0:0:0): CAM status: Command timeout(aprobe0:ata3:0:0:0): Retrying commandada2 at ata3 bus 0 scbus1 target 1 lun 0ada2: <WDC WD30EFRX-68EUZN0 82.00A82> s/n WD-WCC4N6EATE69 detachedada1 at ata3 bus 0 scbus1 target 0 lun 0ada1: <WDC WD30EFRX-68EUZN0 82.00A82> s/n WD-WCC4N4CH8V2A detachedGEOM_MIRROR: Device swap1: provider ada1p1 disconnected.GEOM_MIRROR: Device swap0: provider ada2p1 disconnected.(ada2:ata3:0:1:0): Periph destroyed(ada1:ata3:0:0:0): Periph destroyedGEOM_ELI: Device mirror/swap1.eli destroyed.GEOM_MIRROR: Device swap1: provider destroyed.GEOM_MIRROR: Device swap1 destroyed.GEOM_ELI: Device mirror/swap0.eli destroyed.GEOM_MIRROR: Device swap0: provider destroyed.GEOM_MIRROR: Device swap0 destroyed.GEOM_MIRROR: Device mirror/swap0 launched (2/2).GEOM_ELI: Device mirror/swap0.eli created.GEOM_ELI: Encryption: AES-XTS 128GEOM_ELI: Crypto: hardware
Checking status of zfs pools:
NAME SIZE ALLOC FREE CKPOINT EXPANDSZ FRAG CAP DEDUP HEALTH ALTROOT
DOE_ARRAY 10.9T 6.27T 4.60T - - 0% 57% 1.00x UNAVAIL /mnt
freenas-boot 14G 774M 13.2G - - - 5% 1.00x ONLINE -
pool: DOE_ARRAY
state: UNAVAIL
status: One or more devices are faulted in response to IO failures.
action: Make sure the affected devices are connected, then run 'zpool clear'.
see: http://illumos.org/msg/ZFS-8000-JQ
scan: scrub repaired 0 in 0 days 04:47:09 with 0 errors on Sun Apr 28 04:47:10 2019
config:
NAME STATE READ WRITE CKSUM
DOE_ARRAY UNAVAIL 0 0 0
raidz1-0 UNAVAIL 0 0 0
gptid/d68b5873-2938-11e9-a464-001b21b6e94c ONLINE 0 0 0
gptid/d77038ae-2938-11e9-a464-001b21b6e94c FAULTED 1 416 0 too many errors
13022967432022470486 REMOVED 0 0 0 was /dev/gptid/d85a31dd-2938-11e9-a464-001b21b6e94c
gptid/d93b2fae-2938-11e9-a464-001b21b6e94c ONLINE 0 0 0
errors: 2 data errors, use '-v' for a list
I have rebooted the machine and now data is available (second disk ada 2 is visible), but still ada1 is to be replaced due to the smart errors. My question is - why ada2 was removed from pool?
Could you please take a look at smartctl below and tell what you think? I will post results in next messages.
I'm definitely replacing ADA1, but ADA2 is ok in my opinion.
Thanks!
Adam