Help & Documentation

Learn how to use Quick Sizer and understand the sizing calculations.

Getting Started

Quick Sizer calculates the net writable storage available in a Cloudian HyperStore cluster. Enter your hardware configuration and choose a data protection policy — the results update instantly as you type.

The result shown is the capacity HyperStore will let you actually write to, after accounting for filesystem formatting, your chosen protection policy, and the built-in stop-write safety threshold.

No login is required. When you are happy with your configuration, use the Save & Get Shareable Link button to generate a permanent URL you can share with your team.

Step-by-Step Guide

1Hardware

Enter the physical specifications of your cluster:

Total nodes — the total number of storage nodes across all data centers.
Drives / node — how many data drives are in each node.
Drive size — the capacity of a single drive, in TB or TiB (see TB vs TiB).

The raw capacity hint below the fields updates live so you can verify the total before applying any policy overhead.

2Data Protection

Choose how many data centers participate, then select a protection policy.

Data centers — click 1, 2, or 3. The policy list below filters automatically to only show policies valid for that topology.
Protection policy — also filters to only show policies your node count can actually support. Policies that require more nodes than you have entered are hidden.

If a policy you expect to see is missing, increase the node count in Step 1. Each policy has a minimum node requirement.

3Policy Configuration

Fine-tune the protection parameters:

Replication policies — set the number of full copies per data center.
Erasure Coding policies — pick a preset k+m scheme from the dropdown, or choose Custom k+m… to enter your own values. The dropdown only shows schemes that fit your node count.

Badges below the selector show the selected scheme's fragment count and storage efficiency at a glance.

4Notes & Saving

Optionally add notes — customer name, planned expansion date, assumptions — then click Save & Get Shareable Link. A dialog will appear with a unique URL, a 6-digit share code (for read-only access), and a 6-digit edit code (to update later). See Saving & Sharing for details.

How Capacity Is Calculated

Every result is produced by the same four-step formula applied to your raw hardware numbers:

Raw Capacity

Nodes × Drives/node × Drive size

−3.13%filesystem overhead (formatting & reserved blocks)

After Filesystem

Raw × 96.87%

× RF/EC efficiency (the fraction of capacity devoted to your data vs. protection overhead)

After RF/EC

After filesystem × policy efficiency

× 90% stop-write threshold (HyperStore stops accepting writes at 90% disk fill to protect performance)

Net Usable (Writable)

After RF/EC × 90% — this is the number shown in the results panel

Worked Example

9 nodes · 12 drives/node · 16 TB drives · EC Distributed 3DC (5+4):

Step	Operation	Result
Raw	9 × 12 × 16 TB → TiB	1,571.6 TiB
After filesystem	× (1 − 3.13%)	1,522.4 TiB
After EC (5+4)	× 5/9 = 55.6% efficiency	845.8 TiB
Net usable	× 90% stop-write	761.2 TiB

Protection Policies

Cloudian HyperStore supports five protection policies across one to three data centers. Policies are filtered automatically to only show options compatible with your selected DC count and node count.

Replication — 1 Data Center1 DCMin 3 nodes (RF3)

HyperStore stores N full copies of every object, spread across N different nodes within a single data center.

Efficiency

1/N (e.g. 33% for RF3)

Fault tolerance

Up to N−1 node failures

Best for

Simplicity, low-latency reads

Replication — 2 Data Centers2 DCsMin 3 nodes (RF3)

Copies are split across two sites. You specify how many copies go in DC1 and how many in DC2. The cluster can survive the complete loss of either site as long as copies remain in the other.

Efficiency

1/(DC1 + DC2 copies)

Fault tolerance

Full DC loss (if ≥1 copy in other DC)

Best for

Simple DR, 2-site setups

Erasure Coding — 1 Data Center1 DCMin k+m nodes

An object is split into k data fragments plus m parity fragments, stored across k+m different nodes. Any k fragments can reconstruct the full object. More efficient than replication for large datasets.

Efficiency

k/(k+m) — e.g. 67% for 4+2

Fault tolerance

Any m node failures

Best for

Large capacity, single site

EC Replicated — 2 Data Centers2 DCsMin 2×(k+m) nodes

A complete k+m stripe is independently maintained in each data center — the full EC scheme runs in DC1, and an identical copy runs in DC2. This gives full DC-level redundancy at the cost of doubling the storage overhead compared to single-site EC.

Efficiency

k / (2×(k+m))

Fault tolerance

Full DC loss + m node failures

Best for

Mission-critical, 2-site EC

EC Distributed — 3 Data Centers3 DCsMin 6 nodes

The k+m fragments are spread evenly across three sites — no duplication. Losing an entire DC means losing roughly ⅓ of fragments, which stays below the parity count m, so data remains readable. This gives the same efficiency as single-site EC while tolerating a full DC outage.

Efficiency

k/(k+m) — same as 1DC EC

Fault tolerance

Full DC loss

Best for

Geo-distributed, efficient DR

Only the 5+4 and 7+5 schemes are supported for EC Distributed 3DC. These are the only standard Cloudian presets where (k+m) is divisible by 3 and losing one DC's share of fragments does not exceed the parity count.

Quick Comparison

Policy	DCs	Efficiency	Tolerates
Replication	1	1/N	N−1 node failures
Replication	2	1/(c₁+c₂)	Full DC loss
EC	1	k/(k+m)	Any m node failures
EC Replicated	2	k/(2(k+m))	Full DC + m node failures
EC Distributed	3	k/(k+m)	Full DC loss

Erasure Coding Schemes

An EC scheme is written as k+m, where:

k = number of data fragments (the object is split into k pieces)
m = number of parity fragments (m extra pieces computed from the data for redundancy)
Together k+m fragments are stored, one per node. Any k of them are sufficient to reconstruct the full object.
Storage efficiency = k / (k+m). Higher k relative to m means better efficiency but less fault tolerance.

Cloudian HyperStore requires a minimum of k ≥ 3 and m ≥ 2. The smallest supported scheme is 3+2 (60% efficiency, tolerates 2 node failures).

Preset Schemes

Scheme	Efficiency	Min nodes	Fault tolerance	3DC distributed
3+2	60%	5	2 nodes	—
4+2	67%	6	2 nodes	—
5+2	71%	7	2 nodes	—
5+4	56%	9	4 nodes	✓ Yes
6+2	75%	8	2 nodes	—
6+3	67%	9	3 nodes	—
7+5	58%	12	5 nodes	✓ Yes
8+2	80%	10	2 nodes	—
8+4	67%	12	4 nodes	—
9+3	75%	12	3 nodes	—
10+2	83%	12	2 nodes	—
10+4	71%	14	4 nodes	—
12+4	75%	16	4 nodes	—

Rows highlighted in green are valid for EC Distributed 3DC. Custom k+m values can be entered in the calculator for advanced use cases.

TB vs TiB

Drive manufacturers rate capacity in TB (terabytes), while operating systems often report capacity in TiB (tebibytes). They are not the same:

Unit	Definition	Bytes
1 TB	10¹² — decimal, as labeled by drive vendors	1,000,000,000,000
1 TiB	2⁴⁰ — binary, as reported by most OSes	1,099,511,627,776

1 TB ≈ 0.909 TiB. A "16 TB" drive actually holds about 14.55 TiB as reported by Linux or Windows.

In Quick Sizer, the Drive size unit selector sets how your input is interpreted. The TB / TiB toggle in the results panel independently controls how the output numbers are displayed — you can enter drives in TB and view results in TiB, or vice versa.

Open Quick Sizer

Help & Documentation

Getting Started

Step-by-Step Guide

How Capacity Is Calculated

Worked Example

Protection Policies

Quick Comparison

Erasure Coding Schemes

Preset Schemes

Saving & Sharing

Accessing a Saved Configuration

TB vs TiB