danjng
Explorer
- Joined
- Mar 20, 2017
- Messages
- 51
After seeing these errors, the GUI dies and all my VMs seem to die as well (which pull the Docker containers down too). During these errors, it seems that one of my Backup pool drives (not the one that contains the VMs) goes into a degraded state and says it was removed by an administrator (I did not do any sort of removal). Just the other day, the server became unresponsive (maybe due to the same issue) and after a hard reset, it went through the resilvering process which completed without any issues.
Anyone have any thoughts or ideas? I wouldn't think that a bad drive would cause a catastrophic failure like this, so there's got to be something else. zpool status reports all other drives and pools to be OK (with the exception of the Backup pool, of course). It doesn't seem to be shutting down either. I can still SSH to it and issue the shutdown command, but it's not doing anything. It might be in the middle of a scrub - would that make it unable to shut down?
Thanks for any help!!
Edit: If the Backup pool has some sort of swap or something critical to operations on it, but it is a mirrored pool, should this happen if only one drive goes bad?
Logs:
Anyone have any thoughts or ideas? I wouldn't think that a bad drive would cause a catastrophic failure like this, so there's got to be something else. zpool status reports all other drives and pools to be OK (with the exception of the Backup pool, of course). It doesn't seem to be shutting down either. I can still SSH to it and issue the shutdown command, but it's not doing anything. It might be in the middle of a scrub - would that make it unable to shut down?
Thanks for any help!!
Edit: If the Backup pool has some sort of swap or something critical to operations on it, but it is a mirrored pool, should this happen if only one drive goes bad?
Logs:
Code:
Jul 31 10:24:02 sandcrawler swap_pager: I/O error - pagein failed; blkno 525333,size 4096, error 6 Jul 31 10:24:02 sandcrawler vm_fault: pager read error, pid 1934 (daemon) Jul 31 10:24:02 sandcrawler swap_pager: I/O error - pagein failed; blkno 525332,size 4096, error 6 Jul 31 10:24:02 sandcrawler vm_fault: pager read error, pid 1934 (daemon) Jul 31 10:24:02 sandcrawler kernel: Failed to fully fault in a core file segment at VA 0x800622000 with size 0x23000 to be written at offset 0x4000 for process daemon Jul 31 10:24:02 sandcrawler kernel: Failed to fully fault in a core file segment at VA 0x800622000 with size 0x23000 to be written at offset 0x4000 for process daemon Jul 31 10:24:02 sandcrawler swap_pager: I/O error - pagein failed; blkno 525333,size 4096, error 6 Jul 31 10:24:02 sandcrawler vm_fault: pager read error, pid 1934 (daemon) Jul 31 10:24:02 sandcrawler kernel: Failed to fully fault in a core file segment at VA 0x800dcb000 with size 0x19000 to be written at offset 0x36000 for process daemon Jul 31 10:24:02 sandcrawler kernel: Failed to fully fault in a core file segment at VA 0x800dcb000 with size 0x19000 to be written at offset 0x36000 for process daemon Jul 31 10:24:02 sandcrawler kernel: pid 1934 (daemon), uid 0: exited on signal 11 (core dumped)