Show simple item record

dc.contributor.authorShah, Deval
dc.contributor.authorXue, Zi Yu
dc.contributor.authorPattabiraman, Karthik
dc.contributor.authorAamodt, Tor
dc.date.accessioned2023-11-01T17:24:24Z
dc.date.available2023-11-01T17:24:24Z
dc.identifier.issn2378-962X
dc.identifier.urihttps://hdl.handle.net/1721.1/152618
dc.description.abstractMotion planning is a computationally intensive and well-studied problem in autonomous robots. However, motion planning hardware accelerators (MPA) must be soft-error resilient for deployment in safety-critical applications, and blanket application of traditional mitigation techniques is ill-suited due to cost, power, and performance overheads. We propose Collision Exposure Factor (CEF), a novel metric to assess the failure vulnerability of circuits processing spatial relationships, including motion planning. CEF is based on the insight that the safety violation probability increases with the surface area of the physical space exposed by a bit-flip. We evaluate CEF on four MPAs. We demonstrate empirically that CEF is correlated with safety violation probability, and that CEF-aware selective error mitigation provides 12.3×, 9.6×, and 4.2× lower dangerous Failures-In-Time rate on average for the same amount of protected memory compared to uniform, bit-position, and access-frequency-aware selection of critical data. Furthermore, we show how to employ CEF to enable fault characterization using 23,000× fewer fault injection (FI) experiments than exhaustive FI, and evaluate our FI approach on different robots and MPAs. We demonstrate that CEF-aware FI can provide insights on vulnerable bits in an MPA while taking the same amount of time as uniform statistical FI. Finally, we use the CEF to formulate guidelines for designing soft-error resilient MPAs.en_US
dc.publisherACMen_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/3627828en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleCharacterizing and Improving Resilience of Accelerators to Memory Errors in Autonomous Robotsen_US
dc.typeArticleen_US
dc.identifier.citationShah, Deval, Xue, Zi Yu, Pattabiraman, Karthik and Aamodt, Tor. "Characterizing and Improving Resilience of Accelerators to Memory Errors in Autonomous Robots." ACM Transactions on Cyber-Physical Systems.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.relation.journalACM Transactions on Cyber-Physical Systemsen_US
dc.identifier.mitlicensePUBLISHER_POLICY
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2023-11-01T07:45:37Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2023-11-01T07:45:38Z
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record