Automatic Estimation of Transcription Accuracy and Difficulty

Roy, Brandon Cain; Vosoughi, Soroush; Roy, Deb K.

dc.contributor.author	Vosoughi, Soroush
dc.contributor.author	Roy, Brandon C.
dc.contributor.author	Roy, Deb K
dc.date.accessioned	2012-02-13T18:06:22Z
dc.date.available	2012-02-13T18:06:22Z
dc.date.issued	2010-09
dc.identifier.uri	http://hdl.handle.net/1721.1/69094
dc.description.abstract	Managing a large-scale speech transcription task with a team of human transcribers requires effective quality control and workload distribution. As it becomes easier and cheaper to collect massive audio corpora the problem is magnified. Relying on expert review or transcribing all speech multiple times is impractical. Furthermore, speech that is difficult to transcribe may be better handled by a more experienced transcriber or skipped entirely. We present a fully automatic system to address these issues. First, we use the system to estimate transcription accuracy from a a single transcript and show that it correlates well with intertranscriber agreement. Second, we use the system to estimate the transcription “difficulty” of a speech segment and show that it is strongly correlated with transcriber effort. This system can help a transcription manager determine when speech segments may require review, track transcriber performance, and efficiently manage the transcription process.	en_US
dc.language.iso	en_US
dc.publisher	International Speech Communication Association	en_US
dc.relation.isversionof	http://www.isca-speech.org/archive/interspeech_2010/i10_1902.html	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike 3.0	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/	en_US
dc.source	Soroush Vosoughi	en_US
dc.title	Automatic Estimation of Transcription Accuracy and Difficulty	en_US
dc.type	Article	en_US
dc.identifier.citation	Roy, Brandon C. / Vosoughi, Soroush / Roy, Deb (2010): "Automatic estimation of transcription accuracy and difficulty", In INTERSPEECH-2010, 1902-1905.	en_US
dc.contributor.department	Program in Media Arts and Sciences (Massachusetts Institute of Technology)	en_US
dc.contributor.approver	Vosoughi, Soroush
dc.contributor.mitauthor	Vosoughi, Soroush
dc.contributor.mitauthor	Roy, Brandon Cain
dc.contributor.mitauthor	Roy, Deb K.
dc.relation.journal	Proceedings of Interspeech 2010	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
dspace.orderedauthors	Roy, Brandon Cain; Vosoughi, Soroush; Roy, Deb K.
dc.identifier.orcid	https://orcid.org/0000-0002-2564-8909
dc.identifier.orcid	https://orcid.org/0000-0002-4333-7194
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: roy-vosoughi-roy_interspeech20 ...
Size:: 1.689Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record