Creating the Perception-based LADDER sketch recognition language

Hammond, Tracy A.; Davis, Randall

dc.contributor.author	Hammond, Tracy A.
dc.contributor.author	Davis, Randall
dc.date.accessioned	2012-06-08T15:25:56Z
dc.date.available	2012-06-08T15:25:56Z
dc.date.issued	2010-08
dc.date.submitted	2010-02
dc.identifier.isbn	978-1-4503-0103-9
dc.identifier.uri	http://hdl.handle.net/1721.1/71125
dc.description.abstract	Sketch recognition is automated understanding of hand-drawn diagrams. Current sketch recognition systems exist for only a handful of domains, which contain on the order of 10--20 shapes. Our goal was to create a generalized method for recognition that could work for many domains, increasing the number of shapes that could be recognized in real-time, while maintaining a high accuracy. In an effort to effectively recognize shapes while allowing drawing freedom (both drawing-style freedom and perceptually-valid variations), we created the shape description language modeled after the way people naturally describe shapes to 1) create an intuitive and easy to understand description, providing transparency to the underlying recognition process, and 2) to improve recognition by providing recognition flexibility (drawing freedom) that is aligned with how humans perceive shapes. This paper describes the results of a study performed to see how users naturally describe shapes. A sample of 35 subjects described or drew approximately 16 shapes each. Results show a common vocabulary related to Gestalt grouping and singularities. Results also show that perception, similarity, and context play an important role in how people describe shapes. This study resulted in a language (LADDER) that allows shape recognizers for any domain to be automatically generated from a single hand-drawn example of each shape. Sketch systems for over 30 different domains have been automatically generated based on this language. The largest domain contained 923 distinct shapes, and achieved a recognition accuracy of 83% (and a top-3 accuracy of 87%) on a corpus of over 11,000 sketches, which recognizes almost two orders of magnitude more shapes than any other existing system.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (grant 0757557)	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (grant 0943499)	en_US
dc.language.iso	en_US
dc.publisher	Association for Computing Machinery (ACM)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/1858171.1858197	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike 3.0	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/	en_US
dc.source	Davis via Amy Stout	en_US
dc.title	Creating the Perception-based LADDER sketch recognition language	en_US
dc.type	Article	en_US
dc.identifier.citation	Tracy Hammond and Randall Davis. 2010. Creating the perception-based LADDER sketch recognition language. In Proceedings of the 8th ACM Conference on Designing Interactive Systems (DIS '10). ACM, New York, NY, USA, 141-150. DOI=10.1145/1858171.1858197 http://doi.acm.org/10.1145/1858171.1858197	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.approver	Davis, Randall
dc.contributor.mitauthor	Davis, Randall
dc.relation.journal	Proceedings of the 8th ACM Conference on Designing Interactive Systems	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
dspace.orderedauthors	Hammond, Tracy; Davis, Randall	en
dc.identifier.orcid	https://orcid.org/0000-0001-5232-7281
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: 2010HammondDIS.pdf
Size:: 909.9Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record