Show simple item record

dc.contributor.authorRoy, Brandon C.
dc.contributor.authorRoy, Deb K
dc.date.accessioned2011-12-05T16:41:57Z
dc.date.available2011-12-05T16:41:57Z
dc.date.issued2009-09
dc.identifier.urihttp://hdl.handle.net/1721.1/67363
dc.descriptionURL to conference session list. Title is under heading: Wed-Ses1-P1: Phonetics, Phonology, cross-language comparisons, pathologyen_US
dc.description.abstractWe introduce a new method for human-machine collaborative speech transcription that is significantly faster than existing transcription methods. In this approach, automatic audio processing algorithms are used to robustly detect speech in audio recordings and split speech into short, easy to transcribe segments. Sequences of speech segments are loaded into a transcription interface that enables a human transcriber to simply listen and type, obviating the need for manually finding and segmenting speech or explicitly controlling audio playback. As a result, playback stays synchronized to the transcriber's speed of transcription. In evaluations using naturalistic audio recordings made in everyday home situations, the new method is up to 6 times faster than other popular transcription tools while preserving transcription quality.en_US
dc.language.isoen_US
dc.publisherInternational Speech Communication Associationen_US
dc.relation.isversionofhttp://www.interspeech2009.org/conference/programme/sessionlist.phpen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleFast transcription of unstructured audio recordingsen_US
dc.typeArticleen_US
dc.identifier.citationRoy, Brandon C., Deb Roy. "Fast Transcription of Unstructured Audio Recordings." in Proceedings of the 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009, Brighton, UK, Sept. 6-10, 2009.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Media Laboratoryen_US
dc.contributor.departmentProgram in Media Arts and Sciences (Massachusetts Institute of Technology)en_US
dc.contributor.approverRoy, Deb K.
dc.contributor.mitauthorRoy, Brandon Cain
dc.contributor.mitauthorRoy, Deb K.
dc.relation.journalProceedings of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH 2009)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsRoy, Brandon C.; Roy, Deben_US
dc.identifier.orcidhttps://orcid.org/0000-0002-4333-7194
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record