dc.contributor.author | Vosoughi, Soroush | |
dc.contributor.author | Roy, Brandon C. | |
dc.contributor.author | Roy, Deb K | |
dc.date.accessioned | 2012-02-13T18:06:22Z | |
dc.date.available | 2012-02-13T18:06:22Z | |
dc.date.issued | 2010-09 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/69094 | |
dc.description.abstract | Managing a large-scale speech transcription task with a team of
human transcribers requires effective quality control and workload
distribution. As it becomes easier and cheaper to collect
massive audio corpora the problem is magnified. Relying on
expert review or transcribing all speech multiple times is impractical.
Furthermore, speech that is difficult to transcribe may
be better handled by a more experienced transcriber or skipped
entirely.
We present a fully automatic system to address these issues.
First, we use the system to estimate transcription accuracy from
a a single transcript and show that it correlates well with intertranscriber
agreement. Second, we use the system to estimate
the transcription “difficulty” of a speech segment and show that
it is strongly correlated with transcriber effort. This system
can help a transcription manager determine when speech segments
may require review, track transcriber performance, and
efficiently manage the transcription process. | en_US |
dc.language.iso | en_US | |
dc.publisher | International Speech Communication Association | en_US |
dc.relation.isversionof | http://www.isca-speech.org/archive/interspeech_2010/i10_1902.html | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike 3.0 | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ | en_US |
dc.source | Soroush Vosoughi | en_US |
dc.title | Automatic Estimation of Transcription Accuracy and Difficulty | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Roy, Brandon C. / Vosoughi, Soroush / Roy, Deb (2010): "Automatic estimation of transcription accuracy and difficulty", In INTERSPEECH-2010, 1902-1905. | en_US |
dc.contributor.department | Program in Media Arts and Sciences (Massachusetts Institute of Technology) | en_US |
dc.contributor.approver | Vosoughi, Soroush | |
dc.contributor.mitauthor | Vosoughi, Soroush | |
dc.contributor.mitauthor | Roy, Brandon Cain | |
dc.contributor.mitauthor | Roy, Deb K. | |
dc.relation.journal | Proceedings of Interspeech 2010 | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
dspace.orderedauthors | Roy, Brandon Cain; Vosoughi, Soroush; Roy, Deb K. | |
dc.identifier.orcid | https://orcid.org/0000-0002-2564-8909 | |
dc.identifier.orcid | https://orcid.org/0000-0002-4333-7194 | |
mit.license | OPEN_ACCESS_POLICY | en_US |
mit.metadata.status | Complete | |