MIT Libraries homeMIT Libraries logoDSpace@MIT

MIT
Search 
  • DSpace@MIT Home
  • Center for Brains, Minds & Machines
  • Search
  • DSpace@MIT Home
  • Center for Brains, Minds & Machines
  • Search
JavaScript is disabled for your browser. Some features of this site may not work without it.

Search

Show Advanced FiltersHide Advanced Filters

Filters

Use filters to refine the search results.

Now showing items 1-10 of 18

  • Sort Options:
  • Relevance
  • Title Asc
  • Title Desc
  • Issue Date Asc
  • Issue Date Desc
  • Results Per Page:
  • 5
  • 10
  • 20
  • 40
  • 60
  • 80
  • 100
Thumbnail

Robust Estimation of 3D Human Poses from a Single Image 

Wang, Chunyu; Wang, Yizhou; Lin, Zhouchen; Yuille, Alan L.; Gao, Wen (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-10)
Human pose estimation is a key step to action recognition. We propose a method of estimating 3D human poses from a single image, which works in conjunction with an existing 2D pose/joint detector. 3D pose estimation is ...
Thumbnail

The Secrets of Salient Object Segmentation 

Li, Yin; Hou, Xiaodi; Koch, Christof; Rehg, James M.; Yuille, Alan L. (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-13)
In this paper we provide an extensive evaluation of fixation prediction and salient object segmentation algorithms as well as statistics of major datasets. Our analysis identifies serious design flaws of existing salient ...
Thumbnail

Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN) 

Mao, Junhua; Xu, Wei; Yang, Yi; Wang, Jiang; Huang, Zhiheng; e.a. (Center for Brains, Minds and Machines (CBMM), arXiv, 2015-05-07)
In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions. It directly models the probability distribution of generating a word given previous words and an image. ...
Thumbnail

Human-Machine CRFs for Identifying Bottlenecks in Holistic Scene Understanding 

Mottaghi, Roozbeh; Fidler, Sanja; Yuille, Alan L.; Urtasun, Raquel; Parikh, Devi (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-15)
Recent trends in image understanding have pushed for holistic scene understanding models that jointly reason about various tasks such as object detection, scene recognition, shape analysis, contextual reasoning, and local ...
Thumbnail

Single-Shot Object Detection with Enriched Semantics 

Zhang, Zhishuai; Qiao, Siyuan; Xie, Cihang; Shen, Wei; Wang, Bo; e.a. (Center for Brains, Minds and Machines (CBMM), 2018-06-19)
We propose a novel single shot object detection network named Detection with Enriched Semantics (DES). Our motivation is to enrich the semantics of object detection features within a typical deep detector, by a semantic ...
Thumbnail

DeepVoting: A Robust and Explainable Deep Network for Semantic Part Detection under Partial Occlusion 

Zhang, Zhishuai; Xie, Cihang; Wang, Jianyu; Xie, Lingxi; Yuille, Alan L. (Center for Brains, Minds and Machines (CBMM), 2018-06-19)
In this paper, we study the task of detecting semantic parts of an object, e.g., a wheel of a car, under partial occlusion. We propose that all models should be trained without seeing occlusions while being able to transfer ...
Thumbnail

Semantic Part Segmentation using Compositional Model combining Shape and Appearance 

Wang, Jianyu; Yuille, Alan L. (Center for Brains, Minds and Machines (CBMM), arXiv, 2015-06-08)
In this paper, we study the problem of semantic part segmentation for animals. This is more challenging than standard object detection, object segmentation and pose estimation tasks because semantic parts of animals often ...
Thumbnail

Detecting Semantic Parts on Partially Occluded Objects 

Wang, Jianyu; Xe, Cihang; Zhang, Zhishuai; Zhu, Jun; Xie, Lingxi; e.a. (Center for Brains, Minds and Machines (CBMM), 2017-09-04)
In this paper, we address the task of detecting semantic parts on partially occluded objects. We consider a scenario where the model is trained using non-occluded images but tested on occluded images. The motivation is ...
Thumbnail

Deep Nets: What have they ever done for Vision? 

Yuille, Alan L.; Liu, Chenxi (Center for Brains, Minds and Machines (CBMM), 2018-05-10)
This is an opinion paper about the strengths and weaknesses of Deep Nets. They are at the center of recent progress on Artificial Intelligence and are of growing importance in Cognitive Science and Neuroscience since they ...
Thumbnail

Detect What You Can: Detecting and Representing Objects using Holistic Models and Body Parts 

Chen, Xianjie; Mottaghi, Roozbeh; Liu, Xiaobai; Fidler, Sanja; Urtasun, Raquel; e.a. (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-10)
Detecting objects becomes difficult when we need to deal with large shape deformation, occlusion and low resolution. We propose a novel approach to i) handle large deformations and partial occlusions in animals (as examples ...
  • 1
  • 2

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CommunityBy Issue DateAuthorsTitlesSubjects

My Account

Login

Discover

Author
Yuille, Alan L. (18)
Wang, Jianyu (4)Zhang, Zhishuai (4)Liu, Chenxi (3)Mottaghi, Roozbeh (3)Shen, Wei (3)Xie, Cihang (3)Xie, Lingxi (3)Chen, Xianjie (2)Fidler, Sanja (2)... View MoreSubjectArtificial Intelligence (5)Object Recognition (5)Machine Learning (4)Compositional Models (2)Computer vision (2)Hierarchy (2)Vision (2)Action Recognition (1)Computer Language (1)Fixation Prediction (1)... View MoreDate Issued2018 (7)2014 (5)2015 (4)2017 (2)Has File(s)
Yes (18)

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries homeMIT Libraries logo

MIT Libraries navigation

HomeSearchHours & locationsBorrow & requestResearch supportAbout the Libraries
MIT
Massachusetts Institute of Technology77 Massachusetts AvenueCambridge MA 02139-4307
All items in DSpace@MIT are protected by original copyright, with all rights reserved, unless otherwise indicated. Notify us about copyright concerns.