Automatic Estimation of Transcription Accuracy and Difficulty
Author(s)
Vosoughi, Soroush; Roy, Brandon C.; Roy, Deb K
Downloadroy-vosoughi-roy_interspeech2010.pdf (1.689Mb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
Managing a large-scale speech transcription task with a team of
human transcribers requires effective quality control and workload
distribution. As it becomes easier and cheaper to collect
massive audio corpora the problem is magnified. Relying on
expert review or transcribing all speech multiple times is impractical.
Furthermore, speech that is difficult to transcribe may
be better handled by a more experienced transcriber or skipped
entirely.
We present a fully automatic system to address these issues.
First, we use the system to estimate transcription accuracy from
a a single transcript and show that it correlates well with intertranscriber
agreement. Second, we use the system to estimate
the transcription “difficulty” of a speech segment and show that
it is strongly correlated with transcriber effort. This system
can help a transcription manager determine when speech segments
may require review, track transcriber performance, and
efficiently manage the transcription process.
Date issued
2010-09Department
Program in Media Arts and Sciences (Massachusetts Institute of Technology)Journal
Proceedings of Interspeech 2010
Publisher
International Speech Communication Association
Citation
Roy, Brandon C. / Vosoughi, Soroush / Roy, Deb (2010): "Automatic estimation of transcription accuracy and difficulty", In INTERSPEECH-2010, 1902-1905.
Version: Author's final manuscript