Search
Now showing items 1-10 of 122
Reconstructing Native Language Typology from Foreign Language Usage
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-04-25)
Linguists and psychologists have long been studying cross-linguistic transfer, the influence of native language properties on linguistic performance in a foreign language. In this work we provide empirical evidence for ...
Robust Estimation of 3D Human Poses from a Single Image
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-10)
Human pose estimation is a key step to action recognition. We propose a method of estimating 3D human poses from a single image, which works in conjunction with an existing 2D pose/joint detector. 3D pose estimation is ...
The Secrets of Salient Object Segmentation
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-13)
In this paper we provide an extensive evaluation of fixation prediction and salient object segmentation algorithms as well as statistics of major datasets. Our analysis identifies serious design flaws of existing salient ...
Abstracts of the 2014 Brains, Minds, and Machines Summer School
(Center for Brains, Minds and Machines (CBMM), 2014-09-26)
A compilation of abstracts from the student projects of the 2014 Brains, Minds, and Machines Summer School, held at Woods Hole Marine Biological Lab, May 29 - June 12, 2014.
On Invariance and Selectivity in Representation Learning
(Center for Brains, Minds and Machines (CBMM), arXiv, 2015-03-23)
We discuss data representation which can be learned automatically from data, are invariant to transformations, and at the same time selective, in the sense that two points have the same representation only if they are one ...
Towards a Programmer’s Apprentice (Again)
(Center for Brains, Minds and Machines (CBMM), 2015-04-03)
Programmers are loathe to interrupt their workflow to document their design rationale, leading to frequent errors when software is modified—often much later and by different programmers. A Pro- grammer’s Assistant could ...
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
(Center for Brains, Minds and Machines (CBMM), arXiv, 2015-05-07)
In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions. It directly models the probability distribution of generating a word given previous words and an image. ...
Seeing What You’re Told: Sentence-Guided Activity Recognition In Video
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-05-29)
We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, ...
Sensitivity to Timing and Order in Human Visual Cortex.
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-04-25)
Visual recognition takes a small fraction of a second and relies on the cascade of signals along the ventral visual stream. Given the rapid path through multiple processing steps between photoreceptors and higher visual ...
The Compositional Nature of Event Representations in the Human Brain
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-07-14)
How does the human brain represent simple compositions of constituents: actors, verbs, objects, directions, and locations? Subjects viewed videos during neuroimaging (fMRI) sessions from which sentential descriptions of ...