Show simple item record

dc.contributor.authorCornejo Collado, Alex
dc.contributor.authorLynch, Nancy Ann
dc.contributor.authorSastry, Srikanth
dc.date.accessioned2014-09-25T19:28:28Z
dc.date.available2014-09-25T19:28:28Z
dc.date.issued2012-07
dc.identifier.isbn9781450314503
dc.identifier.urihttp://hdl.handle.net/1721.1/90357
dc.description.abstractFailure detectors - oracles that provide information about process crashes - are an important abstraction for crash tolerance in distributed systems. Although current failure-detector theory provides great generality and expressiveness, it also poses significant challenges in developing a robust hierarchy of failure detectors. We address some of these challenges by proposing a variant of failure detectors called asynchronous failure detectors and an associated modeling framework. Unlike the traditional failure-detector framework, our framework eschews real time completely. We show that asynchronous failure detectors are sufficiently expressive to include several popular failure detectors. Additionally, we show that asynchronous failure detectors satisfy many desirable properties: they are self-implementable, guarantee that stronger asynchronous failure detectors solve more problems, and ensure that their outputs encode no information other than process crashes. We introduce the notion of a failure detector being representative of a problem to capture the idea that some problems encode the same information about process crashes as their weakest failure detectors do. We show that a large class of problems, called finite problems, do not have representative failure detectors.en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (Science and Technology Center, grant agreement CCF-0939370) )en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (NSF Award Number CCF-0726514)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (NSF Award Number CCF-0937274)en_US
dc.description.sponsorshipUnited States. Air Force Office of Scientific Research (AFOSR Award Number FA9550-08-1-0159)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (NSF Award Number CNS-1035199)en_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machineryen_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/2332432.2332482en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleAsynchronous failure detectorsen_US
dc.typeArticleen_US
dc.identifier.citationConrejo, Alejandro, Nancy Lynch, and Srikanth Sastry. “Asynchronous Failure Detectors.” Proceedings of the 2012 ACM Symposium on Principles of Distributed Computing - PODC ’12 (2012), July 16–18, 2012, Madeira, Portugal. ACM New York, NY, USA. p.243-252.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorCornejo Collado, Alexen_US
dc.contributor.mitauthorLynch, Nancy Annen_US
dc.contributor.mitauthorSastry, Srikanthen_US
dc.relation.journalProceedings of the 2012 ACM symposium on Principles of distributed computing - PODC '12en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsConrejo, Alejandro; Lynch, Nancy; Sastry, Srikanthen_US
dc.identifier.orcidhttps://orcid.org/0000-0003-3045-265X
dspace.mitauthor.errortrue
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record