m0t0rh3ad
Dabbler
- Joined
- Jul 13, 2020
- Messages
- 32
Hello, I'm using TrueNAS-SCALE-20.12-ALPHA
During Scrub process on of the Disks became FAULTED and another Degraded:
zpool status -x
pool: storage
state: DEGRADED
status: One or more devices are faulted in response to persistent errors.
Sufficient replicas exist for the pool to continue functioning in a
degraded state.
action: Replace the faulted device, or use 'zpool clear' to mark the device
repaired.
scan: scrub in progress since Sat Jan 16 10:26:48 2021
18.1T scanned at 671M/s, 15.8T issued at 587M/s, 52.7T total
396M repaired, 30.07% done, 18:16:00 to go
config:
NAME STATE READ WRITE CKSUM
storage DEGRADED 0 0 0
raidz2-0 DEGRADED 0 0 0
25f5c8e3-dad5-4317-9cd0-541eb34b02c1 ONLINE 0 0 0
f7166c07-fa60-4b8f-be4c-6bafca895a6f ONLINE 0 0 0
b98c2d82-2be3-4713-9cfd-df3366bf6056 ONLINE 0 0 0
a0a8f7b7-73a8-4107-984f-6107ac9bf869 ONLINE 0 0 0
9edf71ba-b7b6-4d17-a8fa-afd2c6f25598 FAULTED 36 1 0 too many errors (repairing)
c5cd41e3-d069-4553-8702-514256a365bc ONLINE 0 0 0
raidz2-1 DEGRADED 0 0 0
b5efefcc-8e65-4c45-a813-9f4d60f891ee ONLINE 0 0 0
fec84c1d-84a1-49c8-8f15-566e6185e4fe ONLINE 0 0 0
f3a37a11-f0bc-4e58-8f2b-b586f8f720dd ONLINE 0 0 0
043a1431-0bbd-4b6a-913f-ca0c45b6c1f3 ONLINE 0 0 0
c1c71649-68f0-4da9-9d30-bd81b32e5ef9 DEGRADED 0 0 12.6K too many errors (repairing)
ead04bff-68f9-4a59-90f6-bccb61a78f99 ONLINE 0 0 3 (repairing)
errors: No known data errors
When I've checked Disks Smart I can't found anything "bad":
smartctl -a /dev/sdg
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.9.0-1-amd64] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Vendor: HGST
Product: HUH721010AL5204
Revision: C384
Compliance: SPC-4
User Capacity: 10,000,831,348,736 bytes [10.0 TB]
Logical block size: 4096 bytes
LU is fully provisioned
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Logical Unit id: 0x5000cca26b2b1ff8
Serial number: 1SGSR76Z
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Sat Jan 16 18:22:12 2021 +03
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled
=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Grown defects during certification = 0
Total blocks reassigned during format = 0
Total new blocks reassigned = 0
Power on minutes since format = 307792
Current Drive Temperature: 42 C
Drive Trip Temperature: 85 C
Manufactured in week 41 of year 2017
Specified cycle count over device lifetime: 50000
Accumulated start-stop cycles: 29
Specified load-unload count over device lifetime: 600000
Accumulated load-unload cycles: 201
Elements in grown defect list: 0
Vendor (Seagate Cache) information
Blocks sent to initiator = 11330039840768000
Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 0 1233620 0 1233620 18437875 68744.629 0
write: 0 0 0 0 2345199 29193.131 0
verify: 0 0 0 0 1788847 0.000 0
Non-medium error count: 0
SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background short Self test in progress ... - NOW - [- - -]
# 2 Foreground short Completed - 5088 - [- - -]
# 3 Foreground long Aborted (device reset ?) - 5087 - [- - -]
# 4 Foreground short Completed - 5082 - [- - -]
# 5 Background short Completed - 5082 - [- - -]
# 6 Background short Completed - 5082 - [- - -]
# 7 Background short Completed - 5082 - [- - -]
# 8 Background short Completed - 752 - [- - -]
# 9 Background short Completed - 1 - [- - -]
Long (extended) Self-test duration: 64492 seconds [1074.9 minutes]
smartctl -a /dev/sdn
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.9.0-1-amd64] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Vendor: HGST
Product: HUH721010AL5204
Revision: C384
Compliance: SPC-4
User Capacity: 10,000,831,348,736 bytes [10.0 TB]
Logical block size: 4096 bytes
LU is fully provisioned
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Logical Unit id: 0x5000cca266d3b32c
Serial number: 7JKSE7UC
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Sat Jan 16 18:22:51 2021 +03
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled
=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Grown defects during certification = 0
Total blocks reassigned during format = 0
Total new blocks reassigned = 0
Power on minutes since format = 318948
Current Drive Temperature: 38 C
Drive Trip Temperature: 85 C
Manufactured in week 41 of year 2017
Specified cycle count over device lifetime: 50000
Accumulated start-stop cycles: 148
Specified load-unload count over device lifetime: 600000
Accumulated load-unload cycles: 371
Elements in grown defect list: 0
Vendor (Seagate Cache) information
Blocks sent to initiator = 10750650816135168
Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 0 187 0 187 4496513 69141.144 0
write: 0 0 0 0 2410094 20699.158 0
verify: 0 0 0 0 28839 0.000 0
Non-medium error count: 0
SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background short Completed - 6276 - [- - -]
# 2 Background short Completed - 6253 - [- - -]
# 3 Background short Completed - 6231 - [- - -]
# 4 Background short Completed - 6207 - [- - -]
# 5 Background short Completed - 6183 - [- - -]
# 6 Background short Completed - 979 - [- - -]
Long (extended) Self-test duration: 64549 seconds [1075.8 minutes]
Can you suggest something and should I worried about these disks and data?
During Scrub process on of the Disks became FAULTED and another Degraded:
zpool status -x
pool: storage
state: DEGRADED
status: One or more devices are faulted in response to persistent errors.
Sufficient replicas exist for the pool to continue functioning in a
degraded state.
action: Replace the faulted device, or use 'zpool clear' to mark the device
repaired.
scan: scrub in progress since Sat Jan 16 10:26:48 2021
18.1T scanned at 671M/s, 15.8T issued at 587M/s, 52.7T total
396M repaired, 30.07% done, 18:16:00 to go
config:
NAME STATE READ WRITE CKSUM
storage DEGRADED 0 0 0
raidz2-0 DEGRADED 0 0 0
25f5c8e3-dad5-4317-9cd0-541eb34b02c1 ONLINE 0 0 0
f7166c07-fa60-4b8f-be4c-6bafca895a6f ONLINE 0 0 0
b98c2d82-2be3-4713-9cfd-df3366bf6056 ONLINE 0 0 0
a0a8f7b7-73a8-4107-984f-6107ac9bf869 ONLINE 0 0 0
9edf71ba-b7b6-4d17-a8fa-afd2c6f25598 FAULTED 36 1 0 too many errors (repairing)
c5cd41e3-d069-4553-8702-514256a365bc ONLINE 0 0 0
raidz2-1 DEGRADED 0 0 0
b5efefcc-8e65-4c45-a813-9f4d60f891ee ONLINE 0 0 0
fec84c1d-84a1-49c8-8f15-566e6185e4fe ONLINE 0 0 0
f3a37a11-f0bc-4e58-8f2b-b586f8f720dd ONLINE 0 0 0
043a1431-0bbd-4b6a-913f-ca0c45b6c1f3 ONLINE 0 0 0
c1c71649-68f0-4da9-9d30-bd81b32e5ef9 DEGRADED 0 0 12.6K too many errors (repairing)
ead04bff-68f9-4a59-90f6-bccb61a78f99 ONLINE 0 0 3 (repairing)
errors: No known data errors
When I've checked Disks Smart I can't found anything "bad":
smartctl -a /dev/sdg
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.9.0-1-amd64] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Vendor: HGST
Product: HUH721010AL5204
Revision: C384
Compliance: SPC-4
User Capacity: 10,000,831,348,736 bytes [10.0 TB]
Logical block size: 4096 bytes
LU is fully provisioned
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Logical Unit id: 0x5000cca26b2b1ff8
Serial number: 1SGSR76Z
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Sat Jan 16 18:22:12 2021 +03
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled
=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Grown defects during certification = 0
Total blocks reassigned during format = 0
Total new blocks reassigned = 0
Power on minutes since format = 307792
Current Drive Temperature: 42 C
Drive Trip Temperature: 85 C
Manufactured in week 41 of year 2017
Specified cycle count over device lifetime: 50000
Accumulated start-stop cycles: 29
Specified load-unload count over device lifetime: 600000
Accumulated load-unload cycles: 201
Elements in grown defect list: 0
Vendor (Seagate Cache) information
Blocks sent to initiator = 11330039840768000
Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 0 1233620 0 1233620 18437875 68744.629 0
write: 0 0 0 0 2345199 29193.131 0
verify: 0 0 0 0 1788847 0.000 0
Non-medium error count: 0
SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background short Self test in progress ... - NOW - [- - -]
# 2 Foreground short Completed - 5088 - [- - -]
# 3 Foreground long Aborted (device reset ?) - 5087 - [- - -]
# 4 Foreground short Completed - 5082 - [- - -]
# 5 Background short Completed - 5082 - [- - -]
# 6 Background short Completed - 5082 - [- - -]
# 7 Background short Completed - 5082 - [- - -]
# 8 Background short Completed - 752 - [- - -]
# 9 Background short Completed - 1 - [- - -]
Long (extended) Self-test duration: 64492 seconds [1074.9 minutes]
smartctl -a /dev/sdn
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.9.0-1-amd64] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Vendor: HGST
Product: HUH721010AL5204
Revision: C384
Compliance: SPC-4
User Capacity: 10,000,831,348,736 bytes [10.0 TB]
Logical block size: 4096 bytes
LU is fully provisioned
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Logical Unit id: 0x5000cca266d3b32c
Serial number: 7JKSE7UC
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Sat Jan 16 18:22:51 2021 +03
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Enabled
=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Grown defects during certification = 0
Total blocks reassigned during format = 0
Total new blocks reassigned = 0
Power on minutes since format = 318948
Current Drive Temperature: 38 C
Drive Trip Temperature: 85 C
Manufactured in week 41 of year 2017
Specified cycle count over device lifetime: 50000
Accumulated start-stop cycles: 148
Specified load-unload count over device lifetime: 600000
Accumulated load-unload cycles: 371
Elements in grown defect list: 0
Vendor (Seagate Cache) information
Blocks sent to initiator = 10750650816135168
Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 0 187 0 187 4496513 69141.144 0
write: 0 0 0 0 2410094 20699.158 0
verify: 0 0 0 0 28839 0.000 0
Non-medium error count: 0
SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background short Completed - 6276 - [- - -]
# 2 Background short Completed - 6253 - [- - -]
# 3 Background short Completed - 6231 - [- - -]
# 4 Background short Completed - 6207 - [- - -]
# 5 Background short Completed - 6183 - [- - -]
# 6 Background short Completed - 979 - [- - -]
Long (extended) Self-test duration: 64549 seconds [1075.8 minutes]
Can you suggest something and should I worried about these disks and data?