Hi all,
This is my first FreeNAS setup, brand new 9.10 with latest upgrades, so far I've been mostly impressed, except this morning when things turned badly.
The machine, a standard (yes I know) dell studio xps 435mt motherboard with a core i7 920 + 8G non-ECC RAM was in panic() state. I powered off the system as no local or remote shell was available, added 4 more GB (because the panic() seemed related to swap space) and booted it up again.
I've ran a complete memtest to get sure the memory was not faulty, everything's good on that side.
Once up, "zpool status" showed that one disk was in DEGRADED state, with a couple of checksum errors. I assumed this had been caused by the power-off, so I ran a "zpool scrub". After about 1 hour, all disks were ONLINE, I then assumed (and I might have been very wrong here) that I could "zpool clear" at that point. After running once again "zpool status", I realized another "scrub" had started, and shortly after that, the status of both disks that were ONLINE at first was REMOVED. The console showed that GEOM had them destroyed. Here I thought the NAS was simply broken.
I then rebooted once again the server, and all my disks were imported without any errors, a "scrub" automatically ran for about 20 minutes and all my disks were back ONLINE.
I assume the process I followed was wrong but I can't really figure out what happened there, could any grownup elaborate on this failure, and what could have been the correct procedure? I tried hard to find such a methodology but I could only find information about disk replacement. Note I'm not complaining about the initial panic(), I didn't use the recommended hardware, I knew the risks.
Thanks a lot
This is my first FreeNAS setup, brand new 9.10 with latest upgrades, so far I've been mostly impressed, except this morning when things turned badly.
The machine, a standard (yes I know) dell studio xps 435mt motherboard with a core i7 920 + 8G non-ECC RAM was in panic() state. I powered off the system as no local or remote shell was available, added 4 more GB (because the panic() seemed related to swap space) and booted it up again.
I've ran a complete memtest to get sure the memory was not faulty, everything's good on that side.
Once up, "zpool status" showed that one disk was in DEGRADED state, with a couple of checksum errors. I assumed this had been caused by the power-off, so I ran a "zpool scrub". After about 1 hour, all disks were ONLINE, I then assumed (and I might have been very wrong here) that I could "zpool clear" at that point. After running once again "zpool status", I realized another "scrub" had started, and shortly after that, the status of both disks that were ONLINE at first was REMOVED. The console showed that GEOM had them destroyed. Here I thought the NAS was simply broken.
I then rebooted once again the server, and all my disks were imported without any errors, a "scrub" automatically ran for about 20 minutes and all my disks were back ONLINE.
I assume the process I followed was wrong but I can't really figure out what happened there, could any grownup elaborate on this failure, and what could have been the correct procedure? I tried hard to find such a methodology but I could only find information about disk replacement. Note I'm not complaining about the initial panic(), I didn't use the recommended hardware, I knew the risks.
Thanks a lot