Now showing items 1-6 of 6
Robust Estimation of 3D Human Poses from a Single Image
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-10)
Human pose estimation is a key step to action recognition. We propose a method of estimating 3D human poses from a single image, which works in conjunction with an existing 2D pose/joint detector. 3D pose estimation is ...
Seeing What You’re Told: Sentence-Guided Activity Recognition In Video
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-05-29)
We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, ...
When Computer Vision Gazes at Cognition
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-12-12)
Joint attention is a core, early-developing form of social interaction. It is based on our ability to discriminate the third party objects that other people are looking at. While it has been shown that people can accurately ...
Neural tuning size is a key factor underlying holistic face processing
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-14)
Faces are a class of visual stimuli with unique significance, for a variety of reasons. They are ubiquitous throughout the course of a person’s life, and face recognition is crucial for daily social interaction. Faces are ...
Can a biologically-plausible hierarchy e ectively replace face detection, alignment, and recognition pipelines?
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-03-27)
The standard approach to unconstrained face recognition in natural photographs is via a detection, alignment, recognition pipeline. While that approach has achieved impressive results, there are several reasons to be ...
Representation Learning in Sensory Cortex: a theory
(Center for Brains, Minds and Machines (CBMM), 2014-11-14)
We review and apply a computational theory of the feedforward path of the ventral stream in visual cortex based on the hypothesis that its main function is the encoding of invariant representations of images. A key ...