Show simple item record

dc.contributor.authorChang, Tsung-Hsiang
dc.contributor.authorYeh, Tom
dc.contributor.authorMiller, Robert C.
dc.date.accessioned2019-06-27T16:21:20Z
dc.date.available2019-06-27T16:21:20Z
dc.date.issued2011-10
dc.identifier.isbn9781450307161
dc.identifier.urihttps://hdl.handle.net/1721.1/121428
dc.description.abstractPixel-based methods are emerging as a new and promising way to develop new interaction techniques on top of existing user interfaces. However, in order to maintain platform independence, other available low-level information about GUI widgets, such as accessibility metadata, was neglected intentionally. In this paper, we present a hybrid framework, PAX, which associates the visual representation of user interfaces (i.e. the pixels) and their internal hierarchical metadata (i.e. the content, role, and value). We identify challenges to building such a framework. We also develop and evaluate two new algorithms for detecting text at arbitrary places on the screen, and for segmenting a text image into individual word blobs. Finally, we validate our framework in implementations of three applications. We enhance an existing pixel-based system, Sikuli Script, and preserve the readability of its script code at the same time. Further, we create two novel applications, Screen Search and Screen Copy, to demonstrate how PAX can be applied to development of desktop-level interactive systems.en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (award number IIS - 0447800)en_US
dc.description.sponsorshipQuanta Computer Incorporated (as part of the TParty project)en_US
dc.language.isoen
dc.publisherAssociation for Computing Machinery (ACM)en_US
dc.relation.isversionof10.1145/2047196.2047228en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleAssociating the visual representation of user interfaces with their internal structures and metadataen_US
dc.typeArticleen_US
dc.identifier.citationChang, Tsung-Hsiang, Tom Yeh and Rob Miller. "Associating the visual representation of user interfaces with their internal structures and metadata." In UIST'11, Proceedings of the 24th annual ACM symposium on User interface software and technology, Santa Barbara, California, USA, October 16-19, 2011, pages 245-256.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.relation.journalProceedings of the 24th annual ACM symposium on User interface software and technologyen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-06-27T13:22:20Z
dspace.date.submission2019-06-27T13:22:35Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record