Show simple item record

dc.contributor.authorHammond, Tracy A.
dc.contributor.authorDavis, Randall
dc.date.accessioned2012-06-08T15:25:56Z
dc.date.available2012-06-08T15:25:56Z
dc.date.issued2010-08
dc.date.submitted2010-02
dc.identifier.isbn978-1-4503-0103-9
dc.identifier.urihttp://hdl.handle.net/1721.1/71125
dc.description.abstractSketch recognition is automated understanding of hand-drawn diagrams. Current sketch recognition systems exist for only a handful of domains, which contain on the order of 10--20 shapes. Our goal was to create a generalized method for recognition that could work for many domains, increasing the number of shapes that could be recognized in real-time, while maintaining a high accuracy. In an effort to effectively recognize shapes while allowing drawing freedom (both drawing-style freedom and perceptually-valid variations), we created the shape description language modeled after the way people naturally describe shapes to 1) create an intuitive and easy to understand description, providing transparency to the underlying recognition process, and 2) to improve recognition by providing recognition flexibility (drawing freedom) that is aligned with how humans perceive shapes. This paper describes the results of a study performed to see how users naturally describe shapes. A sample of 35 subjects described or drew approximately 16 shapes each. Results show a common vocabulary related to Gestalt grouping and singularities. Results also show that perception, similarity, and context play an important role in how people describe shapes. This study resulted in a language (LADDER) that allows shape recognizers for any domain to be automatically generated from a single hand-drawn example of each shape. Sketch systems for over 30 different domains have been automatically generated based on this language. The largest domain contained 923 distinct shapes, and achieved a recognition accuracy of 83% (and a top-3 accuracy of 87%) on a corpus of over 11,000 sketches, which recognizes almost two orders of magnitude more shapes than any other existing system.en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (grant 0757557)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (grant 0943499)en_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machinery (ACM)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/1858171.1858197en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceDavis via Amy Stouten_US
dc.titleCreating the Perception-based LADDER sketch recognition languageen_US
dc.typeArticleen_US
dc.identifier.citationTracy Hammond and Randall Davis. 2010. Creating the perception-based LADDER sketch recognition language. In Proceedings of the 8th ACM Conference on Designing Interactive Systems (DIS '10). ACM, New York, NY, USA, 141-150. DOI=10.1145/1858171.1858197 http://doi.acm.org/10.1145/1858171.1858197en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverDavis, Randall
dc.contributor.mitauthorDavis, Randall
dc.relation.journalProceedings of the 8th ACM Conference on Designing Interactive Systemsen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsHammond, Tracy; Davis, Randallen
dc.identifier.orcidhttps://orcid.org/0000-0001-5232-7281
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record