I messed some snapshots on source and destination, deleted some... replication stuck
It takes about a day and half to replicate 1.3TB of data over the connection it got
I've terminated replication on source with kill to make changes and restart ( I killed the "/usr/local/bin/ssh -i /data/ssh/replication...." process, or pipe compression process, sometime both - each time I killed one the other closed too )
After it nearly finishes (closest I've seen is 97% but I had to leave before I indeed seen 100), replication fails without explanation and starts anew
Edited replication task so as to replicate to different desto (h4-p0/copy-of-H3) (to preserve what old copy I had in h4-p0/copy-of-h3), it kept restarting
Deleted the old desto h4-p0/copy-of-h3 - it kept repeating
Deleted the old and new destos, reverted replication setting to h4-p0/copy-of-h3, I have not much hope it would complete replication though
I read multiple similar threads but cleaning up snapshots and restarting from scratch worked for them, or it was free space issue, or there were restart involved
I tried everything including cleaning up space, disabling ref/reservation, checked readonly is disabled etc.
Restarting FreeNAS is last resort as they host multiple VMs and not the least, I would lose trained ARC for a year and 3 month uptime.
Could anyone tell why this is happening and/or suggest a solution to make replication work again?
I'm considering zrep as one more last-resort option but would I be able to go on replicating by native/GUI means even if zrep replicates "basis bulk of data" snapshot for me once? Also as these are production systems I'm very hesitant to run anything non-GUI, non-native there at all.
It takes about a day and half to replicate 1.3TB of data over the connection it got
I've terminated replication on source with kill to make changes and restart ( I killed the "/usr/local/bin/ssh -i /data/ssh/replication...." process, or pipe compression process, sometime both - each time I killed one the other closed too )
After it nearly finishes (closest I've seen is 97% but I had to leave before I indeed seen 100), replication fails without explanation and starts anew
Edited replication task so as to replicate to different desto (h4-p0/copy-of-H3) (to preserve what old copy I had in h4-p0/copy-of-h3), it kept restarting
Deleted the old desto h4-p0/copy-of-h3 - it kept repeating
Deleted the old and new destos, reverted replication setting to h4-p0/copy-of-h3, I have not much hope it would complete replication though
I read multiple similar threads but cleaning up snapshots and restarting from scratch worked for them, or it was free space issue, or there were restart involved
I tried everything including cleaning up space, disabling ref/reservation, checked readonly is disabled etc.
Restarting FreeNAS is last resort as they host multiple VMs and not the least, I would lose trained ARC for a year and 3 month uptime.
Could anyone tell why this is happening and/or suggest a solution to make replication work again?
I'm considering zrep as one more last-resort option but would I be able to go on replicating by native/GUI means even if zrep replicates "basis bulk of data" snapshot for me once? Also as these are production systems I'm very hesitant to run anything non-GUI, non-native there at all.
Attachments
Last edited: