Search
Now showing items 1-5 of 5
Detect What You Can: Detecting and Representing Objects using Holistic Models and Body Parts
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-10)
Detecting objects becomes difficult when we need to deal with large shape deformation, occlusion and low resolution. We propose a novel approach to i) handle large deformations and partial occlusions in animals (as examples ...
Semantic Part Segmentation using Compositional Model combining Shape and Appearance
(Center for Brains, Minds and Machines (CBMM), arXiv, 2015-06-08)
In this paper, we study the problem of semantic part segmentation for animals. This is more challenging than standard object detection, object segmentation and pose estimation tasks because semantic parts of animals often ...
Robust Estimation of 3D Human Poses from a Single Image
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-10)
Human pose estimation is a key step to action recognition. We propose a method of estimating 3D human poses from a single image, which works in conjunction with an existing 2D pose/joint detector. 3D pose estimation is ...
Parsing Occluded People by Flexible Compositions
(Center for Brains, Minds and Machines (CBMM), arXiv, 2015-06-01)
This paper presents an approach to parsing humans when there is significant occlusion. We model humans using a graphical model which has a tree structure building on recent work [32, 6] and exploit the connectivity prior ...
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
(Center for Brains, Minds and Machines (CBMM), arXiv, 2015-05-07)
In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions. It directly models the probability distribution of generating a word given previous words and an image. ...