Hello,
It's my first post on the FreeNAS so I would like to say welcome to everyone !
I am running into issue that is very strange and I can not find similar issue on other topics (there are few topics that tells about CAM status and similar errors, but this time it's different as I was using two different boxes and 3 drives).
I had one pool which had three same hard drives in a mirror-0 configuration:
Drive A: WD2003FZEX
Drive B: WD2002FAEX
Drive C: WD2002FAEX
Those drives have exactly the same geometry and always one drive was unplugged in case pool goes dead. Every few weeks I was replacing one of the drives with the unplugged ones.
All of them were running on one system (miniITX with non-ECC memory), where I noticed that Drive B went unhealthy due to errors. In a dmesg I've spotted the following information for Drive B (commands are shortened as I don't have the original ones and using just what I copied/pasted):
CAM status: Uncorrectable parity/CRC error
Retrying command
WRITE_FPDMA_QUEUED. ACB: [...]
At this state I've decided to unplug Drive B and put Drive C so it will resilver from Drive A.
It did not as I was having the following errors similar to the one below from Drive A:
CAM status: CCB request terminated by the host
Retrying command
Read(10). CDB: [...]
At this stage Drive C went crazy and was marked as unhealthy.
Having only one copy of my data on Drive A, I've decided to use rsync to copy all my important data on the external USB drive. Everything went smooth without any errors in a dmesg.
Once I had copy I've decided to move the healthy Drive A to another system (very stable workstation ultra 40m2 with 16GB ECC memory).
The HDD's are running in a SATA 1.5 Gbit/s mode as the system is not supporting 3 nor 6Gbit/s.
I've wiped out Drive C and connected to the system. Created another pool (bkpool) and started moving data using send/recv of the snapshot from Drive A.
System started reporting straight away lot's of errors:
CAM status: CCB request terminated by the host
Retrying command
Read(10). CDB: [...]
I've canceled send/recv command and created manually all the zfs filesystems on Drive C. Using rsync I was able to copy all of the data to Drive C from Drive A without single error in dmesg.
Question: What is wrong and why when using rsync everything works as expected without errors and using zfs send/recv or adding disk to mirror creates such errors?
I thought it's cable/controller issue but separate box made me thing there could be a bug in freenas??
Freenas 9.1-STABLE
It's my first post on the FreeNAS so I would like to say welcome to everyone !
I am running into issue that is very strange and I can not find similar issue on other topics (there are few topics that tells about CAM status and similar errors, but this time it's different as I was using two different boxes and 3 drives).
I had one pool which had three same hard drives in a mirror-0 configuration:
Drive A: WD2003FZEX
Drive B: WD2002FAEX
Drive C: WD2002FAEX
Those drives have exactly the same geometry and always one drive was unplugged in case pool goes dead. Every few weeks I was replacing one of the drives with the unplugged ones.
All of them were running on one system (miniITX with non-ECC memory), where I noticed that Drive B went unhealthy due to errors. In a dmesg I've spotted the following information for Drive B (commands are shortened as I don't have the original ones and using just what I copied/pasted):
CAM status: Uncorrectable parity/CRC error
Retrying command
WRITE_FPDMA_QUEUED. ACB: [...]
At this state I've decided to unplug Drive B and put Drive C so it will resilver from Drive A.
It did not as I was having the following errors similar to the one below from Drive A:
CAM status: CCB request terminated by the host
Retrying command
Read(10). CDB: [...]
At this stage Drive C went crazy and was marked as unhealthy.
Having only one copy of my data on Drive A, I've decided to use rsync to copy all my important data on the external USB drive. Everything went smooth without any errors in a dmesg.
Once I had copy I've decided to move the healthy Drive A to another system (very stable workstation ultra 40m2 with 16GB ECC memory).
The HDD's are running in a SATA 1.5 Gbit/s mode as the system is not supporting 3 nor 6Gbit/s.
I've wiped out Drive C and connected to the system. Created another pool (bkpool) and started moving data using send/recv of the snapshot from Drive A.
System started reporting straight away lot's of errors:
CAM status: CCB request terminated by the host
Retrying command
Read(10). CDB: [...]
I've canceled send/recv command and created manually all the zfs filesystems on Drive C. Using rsync I was able to copy all of the data to Drive C from Drive A without single error in dmesg.
Question: What is wrong and why when using rsync everything works as expected without errors and using zfs send/recv or adding disk to mirror creates such errors?
I thought it's cable/controller issue but separate box made me thing there could be a bug in freenas??
Freenas 9.1-STABLE