Show simple item record

dc.contributor.authorNguyen, Phu Tran
dc.contributor.authorWeir, Sarah
dc.contributor.authorGuo, Philip J.
dc.contributor.authorMiller, Robert C.
dc.contributor.authorGajos, Krzysztof Z.
dc.contributor.authorKim, Ju Ho
dc.date.accessioned2014-09-26T18:02:11Z
dc.date.available2014-09-26T18:02:11Z
dc.date.issued2014-04
dc.identifier.isbn9781450324731
dc.identifier.urihttp://hdl.handle.net/1721.1/90410
dc.description.abstractMillions of learners today use how-to videos to master new skills in a variety of domains. But browsing such videos is often tedious and inefficient because video player interfaces are not optimized for the unique step-by-step structure of such videos. This research aims to improve the learning experience of existing how-to videos with step-by-step annotations. We first performed a formative study to verify that annotations are actually useful to learners. We created ToolScape, an interactive video player that displays step descriptions and intermediate result thumbnails in the video timeline. Learners in our study performed better and gained more self-efficacy using ToolScape versus a traditional video player. To add the needed step annotations to existing how-to videos at scale, we introduce a novel crowdsourcing workflow. It extracts step-by-step structure from an existing video, including step times, descriptions, and before and after images. We introduce the Find-Verify-Expand design pattern for temporal and visual annotation, which applies clustering, text processing, and visual analysis algorithms to merge crowd output. The workflow does not rely on domain-specific customization, works on top of existing videos, and recruits untrained crowd workers. We evaluated the workflow with Mechanical Turk, using 75 cooking, makeup, and Photoshop videos on YouTube. Results show that our workflow can extract steps with a quality comparable to that of trained annotators across all three domains with 77% precision and 81% recall.en_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machinery (ACM)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/2556288.2556986en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceOther univ. web domainen_US
dc.titleCrowdsourcing step-by-step information extraction to enhance existing how-to videosen_US
dc.typeArticleen_US
dc.identifier.citationJuho Kim, Phu Tran Nguyen, Sarah Weir, Philip J. Guo, Robert C. Miller, and Krzysztof Z. Gajos. 2014. Crowdsourcing step-by-step information extraction to enhance existing how-to videos. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '14). ACM, New York, NY, USA, 4017-4026.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorKim, Ju Hoen_US
dc.contributor.mitauthorNguyen, Phu Tranen_US
dc.contributor.mitauthorWeir, Sarahen_US
dc.contributor.mitauthorMiller, Robert C.en_US
dc.relation.journalProceedings of the 32nd annual ACM conference on Human factors in computing systems (CHI '14)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsKim, Juho; Nguyen, Phu Tran; Weir, Sarah; Guo, Philip J.; Miller, Robert C.; Gajos, Krzysztof Z.en_US
dc.identifier.orcidhttps://orcid.org/0000-0001-6348-4127
dc.identifier.orcidhttps://orcid.org/0000-0002-0442-691X
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record