The Storage Versus Repair-Bandwidth Trade-off for Clustered Storage Systems

Prakash, N; Abdrashitov, Vitaly; Medard, Muriel

Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/134886.2

Show simple item record

dc.contributor.author	Prakash, N
dc.contributor.author	Abdrashitov, Vitaly
dc.contributor.author	Medard, Muriel
dc.date.accessioned	2021-10-27T20:09:40Z
dc.date.available	2021-10-27T20:09:40Z
dc.date.issued	2018
dc.identifier.uri	https://hdl.handle.net/1721.1/134886
dc.description.abstract	© 1963-2012 IEEE. We study a generalization of the setting of regenerating codes, motivated by applications to storage systems consisting of clusters of storage nodes. There are $n$ clusters in total, with $m$ nodes per cluster. A data file is coded and stored across the $mn$ nodes, with each node storing α symbols. For availability of data, we require that the file be retrievable by downloading the entire content from any subset of $k$ clusters. Nodes represent entities that can fail. We distinguish between intra-cluster and inter-cluster bandwidth (BW) costs during node repair. Node-repair in a cluster is accomplished by downloading β symbols each from any set of $d$ other clusters, dubbed remote helper clusters, and also up to α symbols each from any set of $\ell $ surviving nodes, dubbed local helper nodes, in the host cluster. We first identify the optimal trade-off between storage-overhead and inter-cluster repair-bandwidth under functional repair, and also present optimal exact-repair code constructions for a class of parameters. The new trade-off is strictly better than what is achievable via space-sharing existing coding solutions, whenever ℓ > 0$. We then obtain sharp lower bounds on the necessary intra-cluster repair BW to achieve optimal trade-off. Under functional repair, random linear network codes (RLNCs) simultaneously optimize usage of both inter- and intra-cluster repair BW; simulation results based on RLNCs suggest optimality of the bounds on intra-cluster repair-bandwidth. Our bounds reveal the interesting fact that, while it is beneficial to increase the number of local helper nodes ℓ $ in order to improve the storage-vs-inter-cluster-repair-BW trade-off, increasing ℓ $ not only increases intra-cluster BW in the host-cluster, but also increases the intra-cluster BW in the remote helper clusters. We also analyze resilience of the clustered storage system against passive eavesdropping by providing file-size bounds and optimal code constructions.
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.isversionof	10.1109/TIT.2018.2806342
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source	arXiv
dc.title	The Storage Versus Repair-Bandwidth Trade-off for Clustered Storage Systems
dc.type	Article
dc.relation.journal	IEEE Transactions on Information Theory
dc.eprint.version	Author's final manuscript
dc.type.uri	http://purl.org/eprint/type/JournalArticle
eprint.status	http://purl.org/eprint/status/PeerReviewed
dc.date.updated	2019-06-21T12:57:50Z
dspace.orderedauthors	Prakash, N; Abdrashitov, V; Medard, M
dspace.date.submission	2019-06-21T12:57:52Z
mit.journal.volume	64
mit.journal.issue	8
mit.metadata.status	Authority Work and Publication Information Needed

Files in this item

Name:: 1701.04909.pdf
Size:: 2.811Mb
Format:: PDF
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record

Version	Item	Date	Summary
2	1721.1/134886.2	2022-07-06T19:48:37Z	Metadata changed: Verified or entered author name and department authority metadata.
1	1721.1/134886*	2021-10-27T20:09:40Z

*Selected version

DSpace@MIT

Notice

The Storage Versus Repair-Bandwidth Trade-off for Clustered Storage Systems

Files in this item

This item appears in the following Collection(s)

Version History