pixel8383
Explorer
- Joined
- Apr 19, 2018
- Messages
- 80
During the last months my TrueNAS had two unscheduled shutdown due to some hardware issue.
I couldn't actually understand what happened by I got the following alert:
The scrub process ended and I have an ONLINE (Unhealty) status for my pool.
If I check the Pool Status page, I get:
The pool status for my MIRROR pool tells:
My email report just returned these result summary:
What do you suggest me to do? I am running TrueNAS on a HP Microserver Gen10 with two 4TB WD Red hard drives.
I couldn't actually understand what happened by I got the following alert:
After each event I run a scrub of my pool. The first time the pool came back with an HEALTY status, this time seems to be different.* Pool NAS state is ONLINE: One or more devices has experienced an error resulting in data corruption. Applications may be affected.
The scrub process ended and I have an ONLINE (Unhealty) status for my pool.
If I check the Pool Status page, I get:
SCRUB
Status: FINISHED
Errors: 3
Date: 2023-06-12 19:15:23
The pool status for my MIRROR pool tells:
How should I read this table? Is the checksum a counter or the actual number of unrecovered data sectors?(Name) (Read) (Write) (CheckSum) (Status)
Mirror 0 0 0 ONLINE
> ada0 0 0 2380 ONLINE
> ada1 0 0 2380 ONLINE
My email report just returned these result summary:
########## ZPool status report for NAS ##########
pool: NAS
state: ONLINE
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-8A
scan: scrub repaired 0B in 23:22:53 with 3 errors on Tue Jun 13 18:38:16 2023
config:
NAME STATE READ WRITE CKSUM
NAS ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
gptid/e81e097b-53ba-11e8-a3a4-70106fca6c4a ONLINE 0 0 2.32K
gptid/e91c8486-53ba-11e8-a3a4-70106fca6c4a ONLINE 0 0 2.32K
errors: Permanent errors have been detected in the following files:
<0x99f>:<0x866c>
<0x99f>:<0x88a8>
<0x99f>:<0x88ef>
/mnt/NAS/iocage/jails/nextcloud/root/var/db/elasticsearch/nodes/0/indices/PiMxmfHoR0quyS28KggRtQ/0/index/_30l.fdt
########## ZPool status report for freenas-boot ##########
pool: freenas-boot
state: ONLINE
status: Some supported and requested features are not enabled on the pool.
The pool can still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
the pool may no longer be accessible by software that does not support
the features. See zpool-features(7) for details.
scan: scrub repaired 0B in 00:00:18 with 0 errors on Wed May 31 03:45:18 2023
config:
NAME STATE READ WRITE CKSUM
freenas-boot ONLINE 0 0 0
ada2p2 ONLINE 0 0 0
errors: No known data errors
########## SMART status report for ada0 drive (Western Digital Red: WD-WCC7K0HVHCJT) ##########
smartctl 7.2 2021-09-14 r5236 [FreeBSD 13.1-RELEASE-p7 amd64] (local build)
SMART overall-health self-assessment test result: PASSED
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 176 152 021 Pre-fail Always - 6158
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 132
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 040 040 000 Old_age Always - 44378
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 114
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 56
193 Load_Cycle_Count 0x0032 199 199 000 Old_age Always - 3453
194 Temperature_Celsius 0x0022 117 103 000 Old_age Always - 33
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
No Errors Logged
Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
Extended offline Completed without error 00% 43680 -
Short offline Completed without error 00% 44299 -
########## SMART status report for ada1 drive (Western Digital Red: WD-WCC7K5NYYJFV) ##########
smartctl 7.2 2021-09-14 r5236 [FreeBSD 13.1-RELEASE-p7 amd64] (local build)
SMART overall-health self-assessment test result: PASSED
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 174 158 021 Pre-fail Always - 6300
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 126
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 040 040 000 Old_age Always - 44377
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 114
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 49
193 Load_Cycle_Count 0x0032 199 199 000 Old_age Always - 3730
194 Temperature_Celsius 0x0022 118 103 000 Old_age Always - 32
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
No Errors Logged
Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
Extended offline Completed without error 00% 43680 -
Short offline Completed without error 00% 44297 -
What do you suggest me to do? I am running TrueNAS on a HP Microserver Gen10 with two 4TB WD Red hard drives.