Obi-Wan
Dabbler
- Joined
- Dec 28, 2018
- Messages
- 17
My data consists of some that is critical, the stuff that is important to me, and some that I deem non-critical, the stuff that I don't necessarily have any backup of. It would be inconvenient to lose this data, but it wouldn't really affect my life much. (In fact, a part of me thinks it would be kind of healthy to lose it, but that's a different discussion!)
The non-critical data makes up the bulk of everything, and I think this will always be the case. When talking number of bytes, the unimportant data will always be much larger than the important stuff.
Which leads to the question! What are your thoughts on having two pools in this scenario?
The primary reason for doing so is the all-important fact that if any vdev in a pool goes down, the pool goes down with it. If I end up with several vdevs in a pool, but most of the data is non-critical, splitting the data in two pools seems like a reasonable way to mitigate that risk.
This would also allow me to have different levels of redundancy on the two types of data. Say two disks for the important data and a single disk for the unimportant data. Now an argument against this is something I’ve seen a few times on the forum: it is generally not recommended to have large disks in a RAIDZ1 vdev because of the long resilvering time. Part of the reason for doing this would also be to get more data stored on fewer disks for the unimportant stuff, if it is really not recommended to have single-disk redundancy for large disks, meaning I would at least have two disk redundancy for the unimportant data as well, then I would need to have three-disk redundancy on the important data for this point to have any merit. Which is doable of course, but I find that a bit extreme, since I do also have backup.
Another point that I don’t know the value of is if it is somehow easier to backup a whole pool instead of a dataset in a pool. Or of it somehow makes something else easier from a managing point of view.
I think those are most of my thoughts, what are yours?
The non-critical data makes up the bulk of everything, and I think this will always be the case. When talking number of bytes, the unimportant data will always be much larger than the important stuff.
Which leads to the question! What are your thoughts on having two pools in this scenario?
The primary reason for doing so is the all-important fact that if any vdev in a pool goes down, the pool goes down with it. If I end up with several vdevs in a pool, but most of the data is non-critical, splitting the data in two pools seems like a reasonable way to mitigate that risk.
This would also allow me to have different levels of redundancy on the two types of data. Say two disks for the important data and a single disk for the unimportant data. Now an argument against this is something I’ve seen a few times on the forum: it is generally not recommended to have large disks in a RAIDZ1 vdev because of the long resilvering time. Part of the reason for doing this would also be to get more data stored on fewer disks for the unimportant stuff, if it is really not recommended to have single-disk redundancy for large disks, meaning I would at least have two disk redundancy for the unimportant data as well, then I would need to have three-disk redundancy on the important data for this point to have any merit. Which is doable of course, but I find that a bit extreme, since I do also have backup.
Another point that I don’t know the value of is if it is somehow easier to backup a whole pool instead of a dataset in a pool. Or of it somehow makes something else easier from a managing point of view.
I think those are most of my thoughts, what are yours?