Richard666
Cadet
- Joined
- Jan 20, 2014
- Messages
- 5
VMware ESXi v5.1.0, 1065491
FreeNAS-9.2.0-RELEASE-x64.iso
Supermicro X9DR3-LN4+
96GB ECC Ram, pass memtest 1pass +19% of 2nd pass (took 11hours for just 19%)
pcie device passthrough for my LSI SAS 9206-16E HBA (2 x lsi 2308, one is passed through for this VM)
All hardware + disks are on UPS
Pool is 11 drive Raidz3
Poolname: rdkmovie01
Datasetname: rdkmovie01
There is no backup of the data, as murphy's law dictates, I had just started a backup as I had structured all the folders/data the way I wanted and cleaned everything up.
Overall summary is the pool is there, all devices available/online, labels appear to check out, importing pool crashes OS. I can view the structure to some degree using zdb, but it hits points where it crashes/fails. The attached images and code quotes below have command outputs. If you want the long story, I've included that below as well as various troubleshooting steps I've tried.
Pool was originally created under FreeNAS 9.1, I believe from the GUI. I forget as I've gone back and forth with nas4free but gave up on it due to gui failing to create the array. The VM has been working great and I recently upgraded my other VM to use XBMC v13 (gotham) to view my movie collection. While performing my library scan (first time I think), my FreeNAS VM crashed. It was a 'Fatal trap' of some kind but I didn't make any notes. I rebooted the VM and thought nothing of it. I believe the scan continued on and crashed the FreeNAS VM again. This time the FreeNAS VM wouldn't boot, I got some kind of error that lead me to believe the contents of the virtual drive were corrupt.
I created a new VM and did a fresh install of FreeNAS 9.2 release x64. I loaded up my database from the other VM and upon boot got a 'Fatal trap 12: page fault while in kernel mode' (attached freenas_92rel_crash.jpg)
I went through and determined rdkmovie01 pool was the issue. I then tried to manually import the pool (zpool import rdkmovie01) which resulted in another 'fatal trap 12'. Upon reboot it crashed again with a 'fatal trap 12'. Turns out another pool was now corrupt (rdkscratch01). I am able to mount rdkscratch01 in read only mode and copy the data off.
I've now split my LSI 9206-16E controller between two VMs, one with the stable/working pools and the other with rdkmovie01 + spare drives to tinker with.
With all the detailed history above, here's what I've done so far:
Ubuntu 12.04.3 server x64
zpool import -f rdkmovie01 results in Ubuntu_crash.jpg
Openindiana dev151a8 x86
zpool import sees the pool but says all its drives are unavailable (openindiana_zpool_import.jpg). zpool import -f rdkmovie01 results in 'invalid vdev configuration'
Solaris 11.1 liveCD x86
zpool import -f rdkmovie01 results in 'invalid vdev configuration'
FreeNAS 9.2.0 x64 following http://forums.freenas.org/threads/zfs-has-failed-you.11951/
zpool import -fR /mnt rdkmovie01 this results in the 'fatal trap 12' crash
zpool import -fo readonly=on -R /mnt rdkmovie01 results in the 'fatal trap 12' crash
zpool import -R /mnt -FX poolname I have not run this as it says it can destroy the pool
Currently I've been reading through http://cuddletech.com/blog/?p=407 which covers use of zdb to examine the pool/structure. Based on the posts I've read on these forums and that, here's the output of various commands that I suspect will be helpful. I've run all of them under Ubuntu 12.04.3 as it has proven to not crash the OS as I troubleshoot and reboots very quickly compared to some of the other OSes. But if needed, I can switch back to FreeNAS to perform the troubleshooting.
zpool import
note: shows all disks online and no errors of any kind
dmesg
http://pastebin.com/AquCYSBJ
zdb -e rdkmovie01
note: this seems to stop at object 2368 and shows corruption/unknown values?
http://pastebin.com/Wj2DnsMm
zdb -eh rdkmovie01
note: just shows the history for this pool, didn't see anything helpful
http://pastebin.com/L4XWs5Te
FreeNAS-9.2.0-RELEASE-x64.iso
Supermicro X9DR3-LN4+
96GB ECC Ram, pass memtest 1pass +19% of 2nd pass (took 11hours for just 19%)
pcie device passthrough for my LSI SAS 9206-16E HBA (2 x lsi 2308, one is passed through for this VM)
All hardware + disks are on UPS
Pool is 11 drive Raidz3
Poolname: rdkmovie01
Datasetname: rdkmovie01
There is no backup of the data, as murphy's law dictates, I had just started a backup as I had structured all the folders/data the way I wanted and cleaned everything up.
Overall summary is the pool is there, all devices available/online, labels appear to check out, importing pool crashes OS. I can view the structure to some degree using zdb, but it hits points where it crashes/fails. The attached images and code quotes below have command outputs. If you want the long story, I've included that below as well as various troubleshooting steps I've tried.
Pool was originally created under FreeNAS 9.1, I believe from the GUI. I forget as I've gone back and forth with nas4free but gave up on it due to gui failing to create the array. The VM has been working great and I recently upgraded my other VM to use XBMC v13 (gotham) to view my movie collection. While performing my library scan (first time I think), my FreeNAS VM crashed. It was a 'Fatal trap' of some kind but I didn't make any notes. I rebooted the VM and thought nothing of it. I believe the scan continued on and crashed the FreeNAS VM again. This time the FreeNAS VM wouldn't boot, I got some kind of error that lead me to believe the contents of the virtual drive were corrupt.
I created a new VM and did a fresh install of FreeNAS 9.2 release x64. I loaded up my database from the other VM and upon boot got a 'Fatal trap 12: page fault while in kernel mode' (attached freenas_92rel_crash.jpg)
I went through and determined rdkmovie01 pool was the issue. I then tried to manually import the pool (zpool import rdkmovie01) which resulted in another 'fatal trap 12'. Upon reboot it crashed again with a 'fatal trap 12'. Turns out another pool was now corrupt (rdkscratch01). I am able to mount rdkscratch01 in read only mode and copy the data off.
I've now split my LSI 9206-16E controller between two VMs, one with the stable/working pools and the other with rdkmovie01 + spare drives to tinker with.
With all the detailed history above, here's what I've done so far:
Ubuntu 12.04.3 server x64
zpool import -f rdkmovie01 results in Ubuntu_crash.jpg
Openindiana dev151a8 x86
zpool import sees the pool but says all its drives are unavailable (openindiana_zpool_import.jpg). zpool import -f rdkmovie01 results in 'invalid vdev configuration'
Solaris 11.1 liveCD x86
zpool import -f rdkmovie01 results in 'invalid vdev configuration'
FreeNAS 9.2.0 x64 following http://forums.freenas.org/threads/zfs-has-failed-you.11951/
zpool import -fR /mnt rdkmovie01 this results in the 'fatal trap 12' crash
zpool import -fo readonly=on -R /mnt rdkmovie01 results in the 'fatal trap 12' crash
zpool import -R /mnt -FX poolname I have not run this as it says it can destroy the pool
Currently I've been reading through http://cuddletech.com/blog/?p=407 which covers use of zdb to examine the pool/structure. Based on the posts I've read on these forums and that, here's the output of various commands that I suspect will be helpful. I've run all of them under Ubuntu 12.04.3 as it has proven to not crash the OS as I troubleshoot and reboots very quickly compared to some of the other OSes. But if needed, I can switch back to FreeNAS to perform the troubleshooting.
zpool import
note: shows all disks online and no errors of any kind
pool: rdkmovie01
id: 530582438526630596
state: ONLINE
status: The pool was last accessed by another system.
action: The pool can be imported using its name or numeric identifier and
the '-f' flag.
see: http://zfsonlinux.org/msg/ZFS-8000-EY
config:
rdkmovie01 ONLINE
raidz3-0 ONLINE
ata-Hitachi_HDS723030ALA640_MK0303YHGD1E3C ONLINE
ata-Hitachi_HDS5C4040ALE630_PL2331LAGDL0VJ ONLINE
ata-Hitachi_HDS5C4040ALE630_PL2331LAGDP9JJ ONLINE
ata-HGST_HDS5C4040ALE630_PL2331LAGHY3PJ ONLINE
ata-HGST_HDS5C4040ALE630_PL2331LAGHVHEJ ONLINE
ata-Hitachi_HDS5C3030ALA630_MJ1321YNG12WBA ONLINE
ata-HGST_HDS5C4040ALE630_PL2331LAGHXRUJ ONLINE
ata-Hitachi_HDS5C3030ALA630_MJ1311YNG25LBA ONLINE
ata-Hitachi_HDS5C3030ALA630_MJ0351YNG809JA ONLINE
ata-Hitachi_HDS5C3030ALA630_MJ1321YNG0VN5A ONLINE
ata-HGST_HDS5C4040ALE630_PL2331LAGJ3B3J ONLINE
dmesg
http://pastebin.com/AquCYSBJ
zdb -e rdkmovie01
note: this seems to stop at object 2368 and shows corruption/unknown values?
http://pastebin.com/Wj2DnsMm
zdb -eh rdkmovie01
note: just shows the history for this pool, didn't see anything helpful
http://pastebin.com/L4XWs5Te