MIT Libraries homeMIT Libraries logoDSpace@MIT

MIT
Search 
  • DSpace@MIT Home
  • Center for Brains, Minds & Machines
  • Publications
  • CBMM Memo Series
  • Search
  • DSpace@MIT Home
  • Center for Brains, Minds & Machines
  • Publications
  • CBMM Memo Series
  • Search
JavaScript is disabled for your browser. Some features of this site may not work without it.

Search

Show Advanced FiltersHide Advanced Filters

Filters

Use filters to refine the search results.

Now showing items 1-10 of 13

  • Sort Options:
  • Relevance
  • Title Asc
  • Title Desc
  • Issue Date Asc
  • Issue Date Desc
  • Results Per Page:
  • 5
  • 10
  • 20
  • 40
  • 60
  • 80
  • 100
Thumbnail

Robust Estimation of 3D Human Poses from a Single Image 

Wang, Chunyu; Wang, Yizhou; Lin, Zhouchen; Yuille, Alan L.; Gao, Wen (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-10)
Human pose estimation is a key step to action recognition. We propose a method of estimating 3D human poses from a single image, which works in conjunction with an existing 2D pose/joint detector. 3D pose estimation is ...
Thumbnail

Seeing What You’re Told: Sentence-Guided Activity Recognition In Video 

Siddharth, Narayanaswamy; Barbu, Andrei; Siskind, Jeffrey Mark (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-05-29)
We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, ...
Thumbnail

Unsupervised learning of clutter-resistant visual representations from natural videos 

Liao, Qianli; Leibo, Joel Z; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), arXiv, 2015-04-27)
Populations of neurons in inferotemporal cortex (IT) maintain an explicit code for object identity that also tolerates transformations of object appearance e.g., position, scale, viewing angle [1, 2, 3]. Though the learning ...
Thumbnail

When Computer Vision Gazes at Cognition 

Gao, Tao; Harari, Daniel; Tenenbaum, Joshua; Ullman, Shimon (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-12-12)
Joint attention is a core, early-developing form of social interaction. It is based on our ability to discriminate the third party objects that other people are looking at. While it has been shown that people can accurately ...
Thumbnail

The Invariance Hypothesis Implies Domain-Specific Regions in Visual Cortex 

Leibo, Joel Z; Liao, Qianli; Anselmi, Fabio; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), bioRxiv, 2015-04-26)
Is visual cortex made up of general-purpose information processing machinery, or does it consist of a collection of specialized modules? If prior knowledge, acquired from learning a set of objects is only transferable to ...
Thumbnail

Neural tuning size is a key factor underlying holistic face processing 

Tan, Cheston; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-14)
Faces are a class of visual stimuli with unique significance, for a variety of reasons. They are ubiquitous throughout the course of a person’s life, and face recognition is crucial for daily social interaction. Faces are ...
Thumbnail

Can a biologically-plausible hierarchy e ectively replace face detection, alignment, and recognition pipelines? 

Liao, Qianli; Leibo, Joel Z; Mroueh, Youssef; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-03-27)
The standard approach to unconstrained face recognition in natural photographs is via a detection, alignment, recognition pipeline. While that approach has achieved impressive results, there are several reasons to be ...
Thumbnail

Learning Real and Boolean Functions: When Is Deep Better Than Shallow 

Mhaskar, Hrushikesh; Liao, Qianli; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), arXiv, 2016-03-08)
We describe computational tasks - especially in vision - that correspond to compositional/hierarchical functions. While the universal approximation property holds both for hierarchical and shallow networks, we prove that ...
Thumbnail

Representation Learning in Sensory Cortex: a theory 

Anselmi, Fabio; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), 2014-11-14)
We review and apply a computational theory of the feedforward path of the ventral stream in visual cortex based on the hypothesis that its main function is the encoding of invariant representations of images. A key ...
Thumbnail

Fast, invariant representation for human action in the visual system 

Isik, Leyla; Tacchetti, Andrea; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), arXiv, 2016-01-06)
The ability to recognize the actions of others from visual input is essential to humans' daily lives. The neural computations underlying action recognition, however, are still poorly understood. We use magnetoencephalography ...
  • 1
  • 2

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Discover

AuthorPoggio, Tomaso (8)Liao, Qianli (4)Anselmi, Fabio (3)Leibo, Joel Z (3)Barbu, Andrei (2)Ullman, Shimon (2)Yuille, Alan L. (2)Berzak, Yevgeni (1)Freiwald, Winrich (1)Gao, Tao (1)... View MoreSubject
Computer vision (13)
Artificial Intelligence (4)Invariance (4)Machine Learning (4)Hierarchy (3)Computer Language (2)Face recognition (2)Action Recognition (1)Compositional Models (1)computational tasks (1)... View MoreDate Issued2014 (6)2016 (4)2015 (3)Has File(s)Yes (13)

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries homeMIT Libraries logo

MIT Libraries navigation

HomeSearchHours & locationsBorrow & requestResearch supportAbout the Libraries
MIT
Massachusetts Institute of Technology77 Massachusetts AvenueCambridge MA 02139-4307
All items in DSpace@MIT are protected by original copyright, with all rights reserved, unless otherwise indicated. Notify us about copyright concerns.