| dc.contributor.author | Kuehne, H. | |
| dc.contributor.author | Serre, T. | |
| dc.contributor.author | Jhuang, H. | |
| dc.contributor.author | Garrote, Estibaliz | |
| dc.contributor.author | Poggio, Tomaso A. | |
| dc.date.accessioned | 2012-04-11T17:45:39Z | |
| dc.date.available | 2012-04-11T17:45:39Z | |
| dc.date.issued | 2012-01 | |
| dc.date.submitted | 2011-11 | |
| dc.identifier.isbn | 978-1-4577-1101-5 | |
| dc.identifier.issn | 1550-5499 | |
| dc.identifier.other | INSPEC Accession Number: 12491176 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/69981 | |
| dc.description.abstract | With nearly one billion online videos viewed everyday, an emerging new frontier in computer vision research is recognition and search in video. While much effort has been devoted to the collection and annotation of large scalable static image datasets containing thousands of image categories, human action datasets lag far behind. Current action recognition databases contain on the order of ten different action categories collected under fairly controlled conditions. State-of-the-art performance on these datasets is now near ceiling and thus there is a need for the design and creation of new benchmarks. To address this issue we collected the largest action video database to-date with 51 action categories, which in total contain around 7,000 manually annotated clips extracted from a variety of sources ranging from digitized movies to YouTube. We use this database to evaluate the performance of two representative computer vision systems for action recognition and explore the robustness of these methods under various conditions such as camera motion, viewpoint, video quality and occlusion. | en_US |
| dc.description.sponsorship | United States. Defense Advanced Research Projects Agency. Information Processing Techniques Office | en_US |
| dc.description.sponsorship | United States. Defense Advanced Research Projects Agency. System Science Division. Defense Sciences Office | en_US |
| dc.description.sponsorship | National Science Foundation (U.S.) (NSF-0640097) | en_US |
| dc.description.sponsorship | National Science Foundation (U.S.) (NSF-0827427) | en_US |
| dc.description.sponsorship | United States. Air Force Office of Scientific Research (FA8650-05- C-7262) | en_US |
| dc.description.sponsorship | Adobe Systems | en_US |
| dc.description.sponsorship | King Abdullah University of Science and Technology | en_US |
| dc.description.sponsorship | NEC Electronics | en_US |
| dc.description.sponsorship | Sony Corporation | en_US |
| dc.description.sponsorship | Eugene McDermott Foundation | en_US |
| dc.description.sponsorship | Brown University. Center for Computing and Visualization | en_US |
| dc.description.sponsorship | Robert J. and Nancy D. Carney Fund for Scientific Innovation | en_US |
| dc.description.sponsorship | United States. Defense Advanced Research Projects Agency (DARPA-BAA-09-31) | en_US |
| dc.description.sponsorship | United States. Office of Naval Research (ONR-BAA-11-001) | en_US |
| dc.description.sponsorship | Ministry of Science, Research and the Arts of Baden Württemberg, Germany | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.1109/ICCV.2011.6126543 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike 3.0 | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ | en_US |
| dc.source | Prof. Poggio | en_US |
| dc.title | HMDB: A Large Video Database for Human Motion Recognition | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Kuehne, H. et al. “HMDB: A Large Video Database for Human Motion Recognition.” IEEE, 2011. 2556–2563. Web. 11 Apr. 2012. © 2012 Institute of Electrical and Electronics Engineers | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences | en_US |
| dc.contributor.approver | Poggio, Tomaso A. | |
| dc.contributor.mitauthor | Jhuang, H. | |
| dc.contributor.mitauthor | Garrote, Estibaliz | |
| dc.contributor.mitauthor | Poggio, Tomaso A. | |
| dc.relation.journal | 2011 IEEE International Conference on Computer Vision | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| dspace.orderedauthors | Kuehne, H.; Jhuang, H.; Garrote, E.; Poggio, T.; Serre, T. | en |
| dc.identifier.orcid | https://orcid.org/0000-0002-3944-0455 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |
| mit.metadata.status | Complete | |