Real-time large object category recognition using robust RGB-D segmentation features

Finman, Ross Edward

dc.contributor.advisor	Seth Teller.	en_US
dc.contributor.author	Finman, Ross Edward	en_US
dc.contributor.other	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.	en_US
dc.date.accessioned	2013-06-17T19:48:23Z
dc.date.available	2013-06-17T19:48:23Z
dc.date.copyright	2012	en_US
dc.date.issued	2013	en_US
dc.identifier.uri	http://hdl.handle.net/1721.1/79218
dc.description	Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2013.	en_US
dc.description	"February 2013." Cataloged from PDF version of thesis.	en_US
dc.description	Includes bibliographical references (p. 77-80).	en_US
dc.description.abstract	This thesis looks at the problem of large object category recognition for use in robotic systems. While many algorithms exist for object recognition, category recognition remains a challenge within robotics, particularly with the robustness and real-time constraints within robotics. Our system addresses category recognition by treating it as a segmentation problem and using the resulting segments to learn and detect large objects based on their 3D characteristics. The first part of this thesis examines how to efficiently do unsupervised segmentation of an RGB-D image in a way that is consistent across wide viewpoint and scale variance, and creating features from the resulting segments. The second part of this thesis explores how to do robust data association to keep temporally consistent segments between frames. Our higher-level module filters and matches relevant segments to a learned database of categories and outputs a pixel-accurate, labeled object mask. Our system has a run time that is nearly linear with the number of RGB-D samples and we evaluate it in a real-time robotic application.	en_US
dc.description.statementofresponsibility	by Ross Edward Finman.	en_US
dc.format.extent	80 p.	en_US
dc.language.iso	eng	en_US
dc.publisher	Massachusetts Institute of Technology	en_US
dc.rights	M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.	en_US
dc.rights.uri	http://dspace.mit.edu/handle/1721.1/7582	en_US
dc.subject	Electrical Engineering and Computer Science.	en_US
dc.title	Real-time large object category recognition using robust RGB-D segmentation features	en_US
dc.type	Thesis	en_US
dc.description.degree	S.M.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.identifier.oclc	844753203	en_US

Files in this item

Name:: 844753203-MIT.pdf
Size:: 8.979Mb
Format:: PDF
Description:: Full printable version

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record