Hi,
I am using TrueNAS Core v13.0 U4 and I really don't understand the information provided in the disk status notifications. I have a drive bench with 15 disks seperated in two pools : Main and Archive. Each one have a hot spare. I also have a USB disk pack with 4 drives used for backup and a quick way to bring it in case of fire.
I receive the following content by email daily and decided to act :
* Device: /dev/da18 [SAT], Self-Test Log error count increased from 0 to 1.
* Device: /dev/da20 [SAT], 16 Currently unreadable (pending) sectors.
* Device: /dev/da20 [SAT], 16 Offline uncorrectable sectors.
* Device: /dev/da18 [SAT], 16 Currently unreadable (pending) sectors.
* Device: /dev/da18 [SAT], 16 Offline uncorrectable sectors.
* Pool Main state is ONLINE: One or more devices has experienced an
unrecoverable error. An attempt was made to correct the error. Applications
are unaffected.
I deduce disks /dev/da18 and /dev/da20 have problems and those are in the pool named "Main"', as it is the only pool named on this content with an error.
So, I go to the storage>disks section to find DA18.
It show DA18 is in my "Archive" pool, not my "Main" pool! So I go in the dashboard and look for the archive pool status, disk w/error says 0, so every disk of archive is good, including DA18, so why do I receive a daily notification as it is defective??
Now, the next one, DA20. It is not in my list in storage>disks…
So, I go to the pools screen and ask for a status of each one. DA20 is in the main pool, but as a hot spare and shows 0 read, 0 write and 0 checksum errors. I was expecting 16 read errors… Also, DA3 indicates a checksum error, but is not reported in the email…
Probably the reason it did not show in the disk screen is because it is a hot spare? I go back to the disk screen and see the only hot spare of the main pool I have is DA21!! And the pool is "N/A". Only the label I put for this drive says it is in the main pool.
So, I am really confused… As managing disks is a crucial part of a NAS, I don't understand why I receive either false alarms or else, the information provided either on the email or the screens are simply wrong and can't be trusted?
So, the questions are :
Thank you for your help...
I am using TrueNAS Core v13.0 U4 and I really don't understand the information provided in the disk status notifications. I have a drive bench with 15 disks seperated in two pools : Main and Archive. Each one have a hot spare. I also have a USB disk pack with 4 drives used for backup and a quick way to bring it in case of fire.
I receive the following content by email daily and decided to act :
* Device: /dev/da18 [SAT], Self-Test Log error count increased from 0 to 1.
* Device: /dev/da20 [SAT], 16 Currently unreadable (pending) sectors.
* Device: /dev/da20 [SAT], 16 Offline uncorrectable sectors.
* Device: /dev/da18 [SAT], 16 Currently unreadable (pending) sectors.
* Device: /dev/da18 [SAT], 16 Offline uncorrectable sectors.
* Pool Main state is ONLINE: One or more devices has experienced an
unrecoverable error. An attempt was made to correct the error. Applications
are unaffected.
I deduce disks /dev/da18 and /dev/da20 have problems and those are in the pool named "Main"', as it is the only pool named on this content with an error.
So, I go to the storage>disks section to find DA18.
Now, the next one, DA20. It is not in my list in storage>disks…
So, I go to the pools screen and ask for a status of each one. DA20 is in the main pool, but as a hot spare and shows 0 read, 0 write and 0 checksum errors. I was expecting 16 read errors… Also, DA3 indicates a checksum error, but is not reported in the email…
Probably the reason it did not show in the disk screen is because it is a hot spare? I go back to the disk screen and see the only hot spare of the main pool I have is DA21!! And the pool is "N/A". Only the label I put for this drive says it is in the main pool.
So, I am really confused… As managing disks is a crucial part of a NAS, I don't understand why I receive either false alarms or else, the information provided either on the email or the screens are simply wrong and can't be trusted?
So, the questions are :
- Did I do something wrong in my configuration to produce this nonsense?
- Why the email report DA18 and DA20 as defective and the screen show no errors?
- Why DA20 becomes DA21?
- Why the error reported for DA3 on the screen is not reported in the email?
Thank you for your help...