Now showing items 1-6 of 6
When Computer Vision Gazes at Cognition
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-12-12)
Joint attention is a core, early-developing form of social interaction. It is based on our ability to discriminate the third party objects that other people are looking at. While it has been shown that people can accurately ...
Measuring and modeling the perception of natural and unconstrained gaze in humans and machines
(Center for Brains, Minds and Machines (CBMM), arXiv, 2016-11-28)
Humans are remarkably adept at interpreting the gaze direction of other individuals in their surroundings. This skill is at the core of the ability to engage in joint visual attention, which is essential for establishing ...
Image interpretation above and below the object level
(Center for Brains, Minds and Machines (CBMM), 2018-05-10)
Computational models of vision have advanced in recent years at a rapid rate, rivaling in some areas human- level performance. Much of the progress to date has focused on analyzing the visual scene at the object level – ...
Do You See What I Mean? Visual Resolution of Linguistic Ambiguities
(Center for Brains, Minds and Machines (CBMM), arXiv, 2016-06-10)
Understanding language goes hand in hand with the ability to integrate complex contextual information obtained via perception. In this work, we present a novel task for grounded language understanding: disambiguating a ...
Spatiotemporal interpretation features in the recognition of dynamic images
(Center for Brains, Minds and Machines (CBMM), 2018-11-21)
Objects and their parts can be visually recognized and localized from purely spatial information in static images and also from purely temporal information as in the perception of biological motion. Cortical regions have ...
Full interpretation of minimal images
(Center for Brains, Minds and Machines (CBMM), 2017-02-08)
The goal in this work is to model the process of ‘full interpretation’ of object images, which is the ability to identify and localize all semantic features and parts that are recognized by human observers. The task is ...