dc.contributor.author | Sze, Vivienne | |
dc.contributor.author | Suleiman, Amr AbdulZahir | |
dc.contributor.author | Zhang, Zhengdong | |
dc.date.accessioned | 2017-04-11T13:12:28Z | |
dc.date.available | 2017-04-11T13:12:28Z | |
dc.date.issued | 2017-01 | |
dc.identifier.issn | 0018-9200 | |
dc.identifier.issn | 1558-173X | |
dc.identifier.uri | http://hdl.handle.net/1721.1/108035 | |
dc.description.abstract | This paper presents a programmable, energy-efficient, and real-time object detection hardware accelerator for low power and high throughput applications using deformable parts models, with 2x higher detection accuracy than traditional rigid body models. Three methods are used to address the high computational complexity of eight deformable parts detection: classification pruning for 33x fewer part classification, vector quantization for 15x memory size reduction, and feature basis projection for 2x reduction in the cost of each classification. The chip was fabricated in a 65 nm CMOS technology, and can process full high definition 1920 × 1080 videos at 60 frames/s without any OFF-chip storage. The chip has two programmable classification engines (CEs) for multiobject detection. At 30 frames/s, the chip consumes only 58.6 mW (0.94 nJ/pixel, 1168 GOPS/W). At a higher throughput of 60 frames/s, the CEs can be time multiplexed to detect even more than two object classes. This proposed accelerator enables object detection to be as energy-efficient as video compression, which is found in most cameras today. | en_US |
dc.description.sponsorship | United States. Defense Advanced Research Projects Agency | en_US |
dc.description.sponsorship | Texas Instruments Incorporated | en_US |
dc.language.iso | en_US | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.relation.isversionof | https://doi.org/10.1109/JSSC.2017.2648820 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | Sze | en_US |
dc.title | A 58.6mW 30fps Real-Time Programmable Multi-Object Detection Accelerator with Deformable Parts Models on Full HD 1920×1080 Videos | en_US |
dc.title.alternative | A 58.6 mW 30 Frames/s Real-Time Programmable Multiobject Detection Accelerator With Deformable Parts Models on Full HD 1920×1080 Videos | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Suleiman, Amr, Zhang, Zhengdong, and Sze, Vivienne. "A 58.6 mW 30 Frames/s Real-Time Programmable Multiobject Detection Accelerator With Deformable Parts Models on Full HD 1920×1080 Videos." IEEE Journal of Solid State Circuits, 52 (March 2017): 844-855.© 2017 Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.contributor.approver | Sze, Vivienne | en_US |
dc.contributor.mitauthor | Sze, Vivienne | |
dc.contributor.mitauthor | Suleiman, Amr AbdulZahir | |
dc.contributor.mitauthor | Zhang, Zhengdong | |
dc.relation.journal | IEEE Journal of Solid State Circuits | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
dspace.orderedauthors | Suleiman, Amr; Zhang, Zhengdong; Sze, Vivienne | en_US |
dspace.embargo.terms | N | en_US |
dc.identifier.orcid | https://orcid.org/0000-0003-4841-3990 | |
dc.identifier.orcid | https://orcid.org/0000-0002-0376-4220 | |
dc.identifier.orcid | https://orcid.org/0000-0002-0619-8199 | |
mit.license | OPEN_ACCESS_POLICY | en_US |