I have a very new build of FreeNAS 8.3.1-RC1-x64 on brand new hardware. I was able to get the box up and working without much difficulty. I am booting from a USB key which is 32Gb in size. I was able to get my drive pool built and my CIFS share working. The NAS volume is 15.2TB built with ZFS. The box is very stable when it is idle. I run into problems when I begin to copy a large amount of data from my PC to the NAS. At about 2 to 4 hours into the copy at an average data transfer rate of about 75Mb per sec the box crashes with this error:
Fatal trap1: privileged instruction fault while in kernel mode
cupid = 3; apic id =13
instruction pointer = 0x20:0xffffffff805a188a
stack pointer = 0x28:0xffffff847fdb8510
frame pointer = 0x28:0xffffff847fdb8520
code segment = base 0x0, limit 0xfffff, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags = interrupt enabled, resume, IOPL = 0
current process = 3160 (smbd)
[thread pid 3160 tid 100129 ]
Stopped at _sleep+0x32a:
db>
This error happens every time I try to make the copy. I have read forums for the past 3 days and I have not been able to find a resolution.
My hardware build is as follows:
ASUS M5A97 LE R2.0 AM3+ ATX motherboard
AMD FX-4100 Zambezi 3.6GHz Socket AM3+ Quad-Core Processor –FD4100WMGUSBX
CORSAIR Vengeance LP 16GB (2 x 8GB) 240-Pin DDR3 SDRAM
HighPoint Rocket 620 PCI-Express 2.0 SATA III Controller Card
ASUS EAH6450 Radeon HD 6450 1GB 64-Bit DDR3 PCU Express 2.1x16 HDCP
4 – Toshiba DT01ACA300 3TB 7200 RPM SATA Hard Drive
4 – Seagate Barracuda 7200 ST3000DM001 3TB 7200 RPM SATA Hard Drive
8 – Silverstone SST-CP07 SATA III Cables
ASUS DRW-24B1ST SATA 24X DVD Burner
CORSAIR TX650 V2 650W PS
NZXT Tempest 410 Elite ATX Mid Tower Case
Based on what I have read I did a memory test with memtest86+ which showed no errors and I did a extensive SMART test with these results:
Removing stale files from /var/preserve:
Cleaning out old system announcements:
Backup passwd and group files:
no /var/backups/master.passwd.bak
no /var/backups/group.bak
Verifying group file syntax:
/etc/group is fine
Backing up package db directory:
Disk status:
Filesystem Size Used Avail Capacity Mounted on
/dev/ufs/FreeNASs1a 926M 382M 469M 45% /
devfs 1.0k 1.0k 0B 100% /dev
/dev/md0 4.6M 3.2M 979k 77% /etc
/dev/md1 823k 2.0k 756k 0% /mnt
/dev/md2 149M 16M 121M 12% /var
/dev/ufs/FreeNASs4 19M 580k 17M 3% /data
NAS 15T 121G 15T 1% /mnt/NAS
Last dump(s) done (Dump '>' file systems):
Checking status of zfs pools:
NAME SIZE ALLOC FREE CAP DEDUP HEALTH ALTROOT
NAS 21.8T 171G 21.6T 0% 1.00x ONLINE /mnt
pool: NAS
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: http://www.sun.com/msg/ZFS-8000-9P
scan: resilvered 15.8G in 0h5m with 0 errors on Fri Mar 15 14:22:09 2013
config:
NAME STATE READ WRITE CKSUM
NAS ONLINE 0 0 0
raidz2-0 ONLINE 0 0 0
gptid/7d550643-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 2
gptid/7dfea425-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 2
gptid/8f7edac8-8db5-11e2-b592-50465daa16d0 ONLINE 0 0 0
gptid/7ee95569-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
gptid/7f70af60-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
gptid/7ff6d47b-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
gptid/8090b10e-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
gptid/813472f1-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
errors: No known data errors
Checking status of ATA raid partitions:
Checking status of gmirror(8) devices:
Checking status of graid3(8) devices:
Checking status of gstripe(8) devices:
Network interface status:
Name Mtu Network Address Ipkts Ierrs Idrop Opkts Oerrs Coll
re0 1500 <Link#1> 50:46:5d:aa:16:d0 55025 0 0 59003 0 0
re0 1500 192.168.0.0 192.168.0.20 53285 - - 58160 - -
usbus 0 <Link#2> 0 0 0 0 0 0
usbus 0 <Link#3> 0 0 0 0 0 0
usbus 0 <Link#4> 0 0 0 0 0 0
usbus 0 <Link#5> 0 0 0 0 0 0
usbus 0 <Link#6> 0 0 0 0 0 0
usbus 0 <Link#7> 0 0 0 0 0 0
usbus 0 <Link#8> 0 0 0 0 0 0
lo0 16384 <Link#9> 403027 0 0 403029 0 0
lo0 16384 fe80::1%lo0 fe80::1 0 - - 0 - -
lo0 16384 localhost ::1 4 - - 4 - -
lo0 16384 your-net localhost 403026 - - 403025 - -
Security check:
(output mailed separately)
Checking status of 3ware RAID controllers:
Alarms (most recent first):
+++ /var/log/3ware_raid_alarms.today 2013-03-16 03:01:00.000000000 -0700
@@ -0,0 +1 @@
+
-- End of daily output --
I was having very slow transfers and made the following changes to the CIFS Auxiliary parameters during the build of the NAS. I made this changes before I tried the large data copies so I have not data to know if the fatal traps are related or not to these changes. These changes along with moving from 100Mb to 1000Mb on the network took me from 8 to 15Mb/sec transfer rate to 70 t0 100Mb/sec transfer rate:
socket options = TCP_NODELAY IPTOS_LOWDELAY SO_KEEPALIVE SO_RCVBUF=2097152 SO_SNDBUF=2097152
read raw = yes
write raw = yes
max xmit = 65536
getwd cache = yes
I appreciate any assistance you can offer. Please be patient as I am very new to Linux and may need to be spoon fed a bit.
Fatal trap1: privileged instruction fault while in kernel mode
cupid = 3; apic id =13
instruction pointer = 0x20:0xffffffff805a188a
stack pointer = 0x28:0xffffff847fdb8510
frame pointer = 0x28:0xffffff847fdb8520
code segment = base 0x0, limit 0xfffff, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags = interrupt enabled, resume, IOPL = 0
current process = 3160 (smbd)
[thread pid 3160 tid 100129 ]
Stopped at _sleep+0x32a:
db>
This error happens every time I try to make the copy. I have read forums for the past 3 days and I have not been able to find a resolution.
My hardware build is as follows:
ASUS M5A97 LE R2.0 AM3+ ATX motherboard
AMD FX-4100 Zambezi 3.6GHz Socket AM3+ Quad-Core Processor –FD4100WMGUSBX
CORSAIR Vengeance LP 16GB (2 x 8GB) 240-Pin DDR3 SDRAM
HighPoint Rocket 620 PCI-Express 2.0 SATA III Controller Card
ASUS EAH6450 Radeon HD 6450 1GB 64-Bit DDR3 PCU Express 2.1x16 HDCP
4 – Toshiba DT01ACA300 3TB 7200 RPM SATA Hard Drive
4 – Seagate Barracuda 7200 ST3000DM001 3TB 7200 RPM SATA Hard Drive
8 – Silverstone SST-CP07 SATA III Cables
ASUS DRW-24B1ST SATA 24X DVD Burner
CORSAIR TX650 V2 650W PS
NZXT Tempest 410 Elite ATX Mid Tower Case
Based on what I have read I did a memory test with memtest86+ which showed no errors and I did a extensive SMART test with these results:
Removing stale files from /var/preserve:
Cleaning out old system announcements:
Backup passwd and group files:
no /var/backups/master.passwd.bak
no /var/backups/group.bak
Verifying group file syntax:
/etc/group is fine
Backing up package db directory:
Disk status:
Filesystem Size Used Avail Capacity Mounted on
/dev/ufs/FreeNASs1a 926M 382M 469M 45% /
devfs 1.0k 1.0k 0B 100% /dev
/dev/md0 4.6M 3.2M 979k 77% /etc
/dev/md1 823k 2.0k 756k 0% /mnt
/dev/md2 149M 16M 121M 12% /var
/dev/ufs/FreeNASs4 19M 580k 17M 3% /data
NAS 15T 121G 15T 1% /mnt/NAS
Last dump(s) done (Dump '>' file systems):
Checking status of zfs pools:
NAME SIZE ALLOC FREE CAP DEDUP HEALTH ALTROOT
NAS 21.8T 171G 21.6T 0% 1.00x ONLINE /mnt
pool: NAS
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: http://www.sun.com/msg/ZFS-8000-9P
scan: resilvered 15.8G in 0h5m with 0 errors on Fri Mar 15 14:22:09 2013
config:
NAME STATE READ WRITE CKSUM
NAS ONLINE 0 0 0
raidz2-0 ONLINE 0 0 0
gptid/7d550643-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 2
gptid/7dfea425-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 2
gptid/8f7edac8-8db5-11e2-b592-50465daa16d0 ONLINE 0 0 0
gptid/7ee95569-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
gptid/7f70af60-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
gptid/7ff6d47b-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
gptid/8090b10e-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
gptid/813472f1-87c2-11e2-b5a4-50465daa16d0 ONLINE 0 0 0
errors: No known data errors
Checking status of ATA raid partitions:
Checking status of gmirror(8) devices:
Checking status of graid3(8) devices:
Checking status of gstripe(8) devices:
Network interface status:
Name Mtu Network Address Ipkts Ierrs Idrop Opkts Oerrs Coll
re0 1500 <Link#1> 50:46:5d:aa:16:d0 55025 0 0 59003 0 0
re0 1500 192.168.0.0 192.168.0.20 53285 - - 58160 - -
usbus 0 <Link#2> 0 0 0 0 0 0
usbus 0 <Link#3> 0 0 0 0 0 0
usbus 0 <Link#4> 0 0 0 0 0 0
usbus 0 <Link#5> 0 0 0 0 0 0
usbus 0 <Link#6> 0 0 0 0 0 0
usbus 0 <Link#7> 0 0 0 0 0 0
usbus 0 <Link#8> 0 0 0 0 0 0
lo0 16384 <Link#9> 403027 0 0 403029 0 0
lo0 16384 fe80::1%lo0 fe80::1 0 - - 0 - -
lo0 16384 localhost ::1 4 - - 4 - -
lo0 16384 your-net localhost 403026 - - 403025 - -
Security check:
(output mailed separately)
Checking status of 3ware RAID controllers:
Alarms (most recent first):
+++ /var/log/3ware_raid_alarms.today 2013-03-16 03:01:00.000000000 -0700
@@ -0,0 +1 @@
+
-- End of daily output --
I was having very slow transfers and made the following changes to the CIFS Auxiliary parameters during the build of the NAS. I made this changes before I tried the large data copies so I have not data to know if the fatal traps are related or not to these changes. These changes along with moving from 100Mb to 1000Mb on the network took me from 8 to 15Mb/sec transfer rate to 70 t0 100Mb/sec transfer rate:
socket options = TCP_NODELAY IPTOS_LOWDELAY SO_KEEPALIVE SO_RCVBUF=2097152 SO_SNDBUF=2097152
read raw = yes
write raw = yes
max xmit = 65536
getwd cache = yes
I appreciate any assistance you can offer. Please be patient as I am very new to Linux and may need to be spoon fed a bit.