mediahound
Dabbler
- Joined
- Mar 11, 2013
- Messages
- 15
I'm the guy who posted in introductions who needs to eventually build a 200tb-1petabyte array. (unique data, before mirroring/backups) I'm not a sysadmin and am a total newbie who might even be using terms wrong but I keep reading and taking notes trying to understand.
I'm currently exploring hardware options trying to get a better understanding of different possible ways to do it because all that storage does not need to be on a single NAS unit (that I can tell/unless I misunderstand anyways/see my intro post for more info) and i'm trying to understand exactly how far a person can reasonably stretch a single NAS box in terms of ultimate storage. For comparison i'm speccing out boxes that max at 32gigs, 64gigs and 128gigs just to compare with a guesstimated per-drive-head cost of about $40-60 added on the server end to whatever each drive costs. (USB or SATA) Although i'm not any cheaper than Backblaze storage pods, theyre running RAID6 (which I dont trust due to silent data corruption) whereas i'm primarily interested in ZFS to prevent exactly that, which means building around it's extreme RAM requirements and such by design. The primary bottleneck therefore seems to be - how much RAM you can practically stick on the motherboard so far.
I am assuming (correct me if i'm wrong) that you can network together multiple NAS boxes (and essentially sum their RAM together) to get the total storage and performance needed? Ie - if nothing else, three boxes, with 32gigs ram each, and 32terabytes apiece should perform well and be treated as a single volume if properly configured? Or do I have this completely wrong? :P If i'm right it eases some of the issue, since costs-per-connected-drive go up noticibly when you have to use 16gig modules and server class motherboards. 32gig boards are consumer level and the i3's low enough wattage that running several isn't that big of a problem to get the desired number of supported drives.
I've often seen a rule of thumb of 1gig ram recommended per terabyte of drive space, I can fully understand that applying at lower levels but i'm wondering if it still holds just as true when you start maxing out motherboard ram slots? Are there any real world examples to go buy of people with 32gigs or more of RAM, and total drive storage in excess of 32 terabytes to act as rules of thumb?
How directly does the RAM-to-drivespace requirement affect performance, and what does affecting performance mean? If I have a somewhat slower sustained bandwidth (say 15MB/sec typical constant access) is that the issue, or is it more of an IOPS limitation, and does performance degrade linearily lacking RAM or does it collapse in some kind of catastrophic uselessness? (like drive thrash dropping performance to 1%) Or is the part that suffers the necessary background housekeeping (like scrubbing) which ZFS normally needs to do at some minimum expected performance level?
Is there any way to offload some of the RAM requirement onto SSD (in the same way deduplication tables can be offloaded to SSD, note I dont understand what it's doing with all that RAM either) to more readily deal with pretty darn large arrays on a single NAS?
I'm currently exploring hardware options trying to get a better understanding of different possible ways to do it because all that storage does not need to be on a single NAS unit (that I can tell/unless I misunderstand anyways/see my intro post for more info) and i'm trying to understand exactly how far a person can reasonably stretch a single NAS box in terms of ultimate storage. For comparison i'm speccing out boxes that max at 32gigs, 64gigs and 128gigs just to compare with a guesstimated per-drive-head cost of about $40-60 added on the server end to whatever each drive costs. (USB or SATA) Although i'm not any cheaper than Backblaze storage pods, theyre running RAID6 (which I dont trust due to silent data corruption) whereas i'm primarily interested in ZFS to prevent exactly that, which means building around it's extreme RAM requirements and such by design. The primary bottleneck therefore seems to be - how much RAM you can practically stick on the motherboard so far.
I am assuming (correct me if i'm wrong) that you can network together multiple NAS boxes (and essentially sum their RAM together) to get the total storage and performance needed? Ie - if nothing else, three boxes, with 32gigs ram each, and 32terabytes apiece should perform well and be treated as a single volume if properly configured? Or do I have this completely wrong? :P If i'm right it eases some of the issue, since costs-per-connected-drive go up noticibly when you have to use 16gig modules and server class motherboards. 32gig boards are consumer level and the i3's low enough wattage that running several isn't that big of a problem to get the desired number of supported drives.
I've often seen a rule of thumb of 1gig ram recommended per terabyte of drive space, I can fully understand that applying at lower levels but i'm wondering if it still holds just as true when you start maxing out motherboard ram slots? Are there any real world examples to go buy of people with 32gigs or more of RAM, and total drive storage in excess of 32 terabytes to act as rules of thumb?
How directly does the RAM-to-drivespace requirement affect performance, and what does affecting performance mean? If I have a somewhat slower sustained bandwidth (say 15MB/sec typical constant access) is that the issue, or is it more of an IOPS limitation, and does performance degrade linearily lacking RAM or does it collapse in some kind of catastrophic uselessness? (like drive thrash dropping performance to 1%) Or is the part that suffers the necessary background housekeeping (like scrubbing) which ZFS normally needs to do at some minimum expected performance level?
Is there any way to offload some of the RAM requirement onto SSD (in the same way deduplication tables can be offloaded to SSD, note I dont understand what it's doing with all that RAM either) to more readily deal with pretty darn large arrays on a single NAS?