HertogArjan
Dabbler
- Joined
- Oct 16, 2016
- Messages
- 30
Hi everyone,
Sorry to skip the introduction, but I have a frustrating problem that needs fixing. Took me a while to get in here, trying to sign up for the forums through chrome somehow did not work. Needed the suggestion from the IRC channel to use another brower to finally get in. Anyway, I have terrible problems with my new server configuration and it is really annoying. I had already opened an issue on the Bug Tracking System, see here https://bugs.freenas.org/issues/17240. I actually opened 2 because at first I thought I had 2 different problems, so look at this one as well https://bugs.freenas.org/issues/17140. I can give a quick summary of what already happened, but to also see the errors themselves have a look over there and in the files I uploaded.
To summarize, I recently transitioned from my old dell pc as server to new hardware, always exciting. The new hardware was a Supermicro X10SL7-F motherboard, Intel Core i3-4170, Crucial 8GB ECC CT102472BD160B x 4 and while I was at it I also increased my 3-disk RAID-Z1 pool to a 6 disk RAID-Z2 pool. I installed everything, nothing wrong there, booted into my original freenas OS and all seemed fine at first. But then the trouble started happening. My system would just randomly start showing hundreds of errors which always resulted in the system crashing and rebooting. The error is not always the same, sometimes it is kernel panic, then it is CAM errors, then some times it shows problems with IOC, which is the I/O controller I think. Trying to troubleshoot the problem my motherboard at one point would not boot anymore and I had to ask for a replacement. I am not sure if this is related to the original problem, because I have the replacement motherboard now and the issues are exactly the same. I tried many different firmwares for the sas controller, always the IT version (16.0.1,19,20,20.4). I also tried booting the system with some disks excluded to check for faulty disks or cables, but that did not help either. It does however seem to happen when it tries to open a volume. Not a particular volume, I have 2, but any of the volumes. I have tried multiple fresh installs of freenas and the problems seem to happen as soon as I upload my configuration or if I try to import the volume via the storage page. I tried to boot the system without hyperthreading, also did not help. Tried changing some of the settings in the SAS controller, did not help. Performed a memtest, but no problems there.
Have a look at both the issues in the bug tracker, I might not have mentioned everything that has happened. Anyone any ideas?
Thanks in advance,
Jelle Jan
EDIT: So I have 2 pools, one RAID-Z2 with all WD30EFRX disks and one mirror with a 2tb WD green disk and a 2tb Seagate barracuda disk. If I connect only the mirror to the system via either the SATA or SAS ports the system seems to have no problems but as soon as I connect the other pool I get the errors. The systems says IOC fault 0x40002622 and resetting, than a LOT of messages saying 'Warning: io_cmds_active is out of sync - resynching to 0'. I attached a rar containing a recording where you can see the error happening. Skip to about 35 seconds. Are my WD RED 3tb disks somehow incompatible with the lsi 2308 controller?
Sorry to skip the introduction, but I have a frustrating problem that needs fixing. Took me a while to get in here, trying to sign up for the forums through chrome somehow did not work. Needed the suggestion from the IRC channel to use another brower to finally get in. Anyway, I have terrible problems with my new server configuration and it is really annoying. I had already opened an issue on the Bug Tracking System, see here https://bugs.freenas.org/issues/17240. I actually opened 2 because at first I thought I had 2 different problems, so look at this one as well https://bugs.freenas.org/issues/17140. I can give a quick summary of what already happened, but to also see the errors themselves have a look over there and in the files I uploaded.
To summarize, I recently transitioned from my old dell pc as server to new hardware, always exciting. The new hardware was a Supermicro X10SL7-F motherboard, Intel Core i3-4170, Crucial 8GB ECC CT102472BD160B x 4 and while I was at it I also increased my 3-disk RAID-Z1 pool to a 6 disk RAID-Z2 pool. I installed everything, nothing wrong there, booted into my original freenas OS and all seemed fine at first. But then the trouble started happening. My system would just randomly start showing hundreds of errors which always resulted in the system crashing and rebooting. The error is not always the same, sometimes it is kernel panic, then it is CAM errors, then some times it shows problems with IOC, which is the I/O controller I think. Trying to troubleshoot the problem my motherboard at one point would not boot anymore and I had to ask for a replacement. I am not sure if this is related to the original problem, because I have the replacement motherboard now and the issues are exactly the same. I tried many different firmwares for the sas controller, always the IT version (16.0.1,19,20,20.4). I also tried booting the system with some disks excluded to check for faulty disks or cables, but that did not help either. It does however seem to happen when it tries to open a volume. Not a particular volume, I have 2, but any of the volumes. I have tried multiple fresh installs of freenas and the problems seem to happen as soon as I upload my configuration or if I try to import the volume via the storage page. I tried to boot the system without hyperthreading, also did not help. Tried changing some of the settings in the SAS controller, did not help. Performed a memtest, but no problems there.
Have a look at both the issues in the bug tracker, I might not have mentioned everything that has happened. Anyone any ideas?
Thanks in advance,
Jelle Jan
EDIT: So I have 2 pools, one RAID-Z2 with all WD30EFRX disks and one mirror with a 2tb WD green disk and a 2tb Seagate barracuda disk. If I connect only the mirror to the system via either the SATA or SAS ports the system seems to have no problems but as soon as I connect the other pool I get the errors. The systems says IOC fault 0x40002622 and resetting, than a LOT of messages saying 'Warning: io_cmds_active is out of sync - resynching to 0'. I attached a rar containing a recording where you can see the error happening. Skip to about 35 seconds. Are my WD RED 3tb disks somehow incompatible with the lsi 2308 controller?
Attachments
Last edited: