Marschmellow1337
Cadet
- Joined
- Mar 9, 2022
- Messages
- 6
Hi all,
I've got a weird situation and I can't work out what i'm doing wrong.
First some information:
I'm running Truenas Core 12.0 U7, installed as freenas couple of years ago and kept it up to date.
I got 1 pool, called Storage. It's a RAIDZ2, 8 drives. 2TB each. Installed in a supermicro case with 24 front bays.
All this is connected to 3 LSI cards in the PCI slots.
It's also connected to a UPS, and as far as I know there was no power failure.
A couple of weeks ago I started getting emails that 1 disk is getting more and more errors. 2 weeks ago it died. Truenas reported a degraded state for my pool.
This drive was labeled da7 in Truenas disk management. It reported it as Faulted.
Ordered a replacement drive, WD 2TB NAS drive, same as before.
Tried to offline the faulted disk for replacement, but it didnt go offline. Quick google search shows that a faulted drive is already a offline drive, so I could just continue with the replacement steps. I took the old drive out, new drive in. Clicked replace, new drive showed up. Ticked the force option (so its formatted fist as i understand it) and it started formatting and adding the drive to the pool. It also started resilvering, and all disks are online. Great!
No.
When the resilvering was at about 15%, I got a email that I got a faulted drive. The new drive that I just replaced. Weird.
I let the resilvering finish, thinking/hoping this was some sort of weird thing because the drives are more stressed during this proces.
But no, when the resilvering was finished the new disk was still faulted.
I started a scrub, nothing changes when that was done.
I tried which seemed to be the trick, all drive showed online. Nothing was degraded. But it started resilvering again, and when it hit about 15% I got the same e-mail. Storage pool is degraded with 1 drive faulted. (still the same new drive).
Ok, at this point I was thinking the new drive is a DOA. Send it back to the store and I got a new replacement drive. (different serial numbers, so brand new). Thinking I was correct in thinking the first drive was a DOA (death on arrival) because they accepted the replacement.
Got the new drive, did the same steps as above but I got the same result. New drive again faulted.
I also tried a different slot in my case. Different backplane and different LSI card. No luck.
At this point I have no idea what to do. Am i doing anything wrong here? Is there a bug in the software? Hoping you can answer this.
Thanks in advance!
I've got a weird situation and I can't work out what i'm doing wrong.
First some information:
I'm running Truenas Core 12.0 U7, installed as freenas couple of years ago and kept it up to date.
I got 1 pool, called Storage. It's a RAIDZ2, 8 drives. 2TB each. Installed in a supermicro case with 24 front bays.
All this is connected to 3 LSI cards in the PCI slots.
It's also connected to a UPS, and as far as I know there was no power failure.
A couple of weeks ago I started getting emails that 1 disk is getting more and more errors. 2 weeks ago it died. Truenas reported a degraded state for my pool.
This drive was labeled da7 in Truenas disk management. It reported it as Faulted.
Ordered a replacement drive, WD 2TB NAS drive, same as before.
Tried to offline the faulted disk for replacement, but it didnt go offline. Quick google search shows that a faulted drive is already a offline drive, so I could just continue with the replacement steps. I took the old drive out, new drive in. Clicked replace, new drive showed up. Ticked the force option (so its formatted fist as i understand it) and it started formatting and adding the drive to the pool. It also started resilvering, and all disks are online. Great!
No.
When the resilvering was at about 15%, I got a email that I got a faulted drive. The new drive that I just replaced. Weird.
I let the resilvering finish, thinking/hoping this was some sort of weird thing because the drives are more stressed during this proces.
But no, when the resilvering was finished the new disk was still faulted.
I started a scrub, nothing changes when that was done.
I tried
Code:
zpool clear Storage
Ok, at this point I was thinking the new drive is a DOA. Send it back to the store and I got a new replacement drive. (different serial numbers, so brand new). Thinking I was correct in thinking the first drive was a DOA (death on arrival) because they accepted the replacement.
Got the new drive, did the same steps as above but I got the same result. New drive again faulted.
I also tried a different slot in my case. Different backplane and different LSI card. No luck.
At this point I have no idea what to do. Am i doing anything wrong here? Is there a bug in the software? Hoping you can answer this.
Thanks in advance!