Hi there,
I've had this server up and running a few years now, going through regular updates and replacing failing drives, running mostly as an SMB share and hosting my Plex server. I have a 5 disk RAIDZ2 setup.
Recently I had a drive failure. So I turned it off from the web console, swapped in a replacement drive and booted it up. WebUI never comes back up so I go to my server room (closet) and check. There's nothing but a spew of errors (see below). No commands do anything and an occasional line swings through about inability to boot middleware or about the ethernet going up and down. It doesn't look like it's loaded my storage pool. I can't get the thing to shutdown. So I nervously hardbooted it, hoping the storage wasn't up so I don't frag volume. It boots back up and same errors but eventually does get to the usual 1-12 display. It never gets back online (no web ui) so i shut it down safe this time since it lets me (typing 11 then y). I pull the new disk thinking that's screwing things up, plug in the old failed volume. Same behavior.
Failed to fully fault in a core file segment at VA 080064e00 with pid 302 (Python 3.6), uid -: exited on signal 11 (core dumped)
VM_fault : Pager read error. pid 384 (python 3.6)
This repeats, with the pid's generally increasing the longer it runs.
I've SSH'd into it but I'm reaching the edge of my experience (I'm a windows sysadmin, my linux familiarity is pretty low).
I've got a backup of about 80% of my media luckily (though stupidly not the other 20%) so hopefully my ZFS volume is still healthy somewhere in there. I'd appreciate any tips, including how to get some useful logs from SSH (I've copy pasted a chunk of what i could output into a txt file attached)
Thanks.
I've had this server up and running a few years now, going through regular updates and replacing failing drives, running mostly as an SMB share and hosting my Plex server. I have a 5 disk RAIDZ2 setup.
Recently I had a drive failure. So I turned it off from the web console, swapped in a replacement drive and booted it up. WebUI never comes back up so I go to my server room (closet) and check. There's nothing but a spew of errors (see below). No commands do anything and an occasional line swings through about inability to boot middleware or about the ethernet going up and down. It doesn't look like it's loaded my storage pool. I can't get the thing to shutdown. So I nervously hardbooted it, hoping the storage wasn't up so I don't frag volume. It boots back up and same errors but eventually does get to the usual 1-12 display. It never gets back online (no web ui) so i shut it down safe this time since it lets me (typing 11 then y). I pull the new disk thinking that's screwing things up, plug in the old failed volume. Same behavior.
Failed to fully fault in a core file segment at VA 080064e00 with pid 302 (Python 3.6), uid -: exited on signal 11 (core dumped)
VM_fault : Pager read error. pid 384 (python 3.6)
This repeats, with the pid's generally increasing the longer it runs.
I've SSH'd into it but I'm reaching the edge of my experience (I'm a windows sysadmin, my linux familiarity is pretty low).
I've got a backup of about 80% of my media luckily (though stupidly not the other 20%) so hopefully my ZFS volume is still healthy somewhere in there. I'd appreciate any tips, including how to get some useful logs from SSH (I've copy pasted a chunk of what i could output into a txt file attached)
Thanks.
Attachments
Last edited: