| dc.contributor.author | Li, Haoyuan | |
| dc.contributor.author | Ghodsi, Ali | |
| dc.contributor.author | Shenker, Scott | |
| dc.contributor.author | Stoica, Ion | |
| dc.contributor.author | Zaharia, Matei A. | |
| dc.date.accessioned | 2016-02-04T00:52:59Z | |
| dc.date.available | 2016-02-04T00:52:59Z | |
| dc.date.issued | 2014-11 | |
| dc.identifier.isbn | 9781450332521 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/101090 | |
| dc.description.abstract | Tachyon is a distributed file system enabling reliable data sharing at memory speed across cluster computing frameworks. While caching today improves read workloads, writes are either network or disk bound, as replication is used for fault-tolerance. Tachyon eliminates this bottleneck by pushing lineage, a well-known technique, into the storage layer. The key challenge in making a long-running lineage-based storage system is timely data recovery in case of failures. Tachyon addresses this issue by introducing a checkpointing algorithm that guarantees bounded recovery cost and resource allocation strategies for recomputation under commonly used resource schedulers. Our evaluation shows that Tachyon outperforms in-memory HDFS by 110x for writes. It also improves the end-to-end latency of a realistic workflow by 4x. Tachyon is open source and is deployed at multiple companies. | en_US |
| dc.description.sponsorship | National Science Foundation (U.S.) (CISE Expeditions Award CCF-1139158) | en_US |
| dc.description.sponsorship | Lawrence Berkeley National Laboratory (Award 7076018) | en_US |
| dc.description.sponsorship | United States. Defense Advanced Research Projects Agency (XData Award FA8750-12-2-0331) | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Association for Computing Machinery (ACM) | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.1145/2670979.2670985 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | MIT web domain | en_US |
| dc.title | Tachyon: Reliable, Memory Speed Storage for Cluster Computing Frameworks | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Haoyuan Li, Ali Ghodsi, Matei Zaharia, Scott Shenker, and Ion Stoica. 2014. Tachyon: Reliable, Memory Speed Storage for Cluster Computing Frameworks. In Proceedings of the ACM Symposium on Cloud Computing (SOCC '14). ACM, New York, NY, USA, Article 6 , 15 pages. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.contributor.mitauthor | Zaharia, Matei A. | en_US |
| dc.relation.journal | Proceedings of the ACM Symposium on Cloud Computing (SOCC '14) | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dspace.orderedauthors | Li, Haoyuan; Ghodsi, Ali; Zaharia, Matei; Shenker, Scott; Stoica, Ion | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0002-7547-7204 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |