Show simple item record

dc.contributor.authorCan, Dogan
dc.contributor.authorCooper, Erica
dc.contributor.authorSethy, Abhinav
dc.contributor.authorWhite, Chris
dc.contributor.authorRamabhadran, Bhuvana
dc.contributor.authorSaraclar, Murat
dc.date.accessioned2010-10-07T15:33:00Z
dc.date.available2010-10-07T15:33:00Z
dc.date.issued2009-05
dc.date.submitted2009-04
dc.identifier.isbn978-1-4244-2353-8
dc.identifier.issn1520-6149
dc.identifier.otherINSPEC Accession Number: 10700575
dc.identifier.urihttp://hdl.handle.net/1721.1/58936
dc.description.abstractThe spoken term detection (STD) task aims to return relevant segments from a spoken archive that contain the query terms whether or not they are in the system vocabulary. This paper focuses on pronunciation modeling for out-of-vocabulary (OOV) terms which frequently occur in STD queries. The STD system described in this paper indexes word-level and sub-word level lattices or confusion networks produced by an LVCSR system using weighted finite state transducers (WFST).We investigate the inclusion of n-best pronunciation variants for OOV terms (obtained from letter-to-sound rules) into the search and present the results obtained by indexing confusion networks as well as lattices. The following observations are worth mentioning: phone indexes generated from sub-words represent OOVs well and too many variants for the OOV terms degrade performance if pronunciations are not weighted.en_US
dc.description.sponsorshipBogazici University Research Funden_US
dc.description.sponsorshipScientific and Technical Research Council of Turkey (TUBITAK) (BIDEB)en_US
dc.language.isoen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/ICASSP.2009.4960494en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceIEEEen_US
dc.subjectWeighted Finite State Transducersen_US
dc.subjectSpoken Term Detectionen_US
dc.subjectSpeech Recognitionen_US
dc.subjectSpeech Indexing and Retrievalen_US
dc.titleEffect of pronunciations on OOV queries in spoken term detectionen_US
dc.typeArticleen_US
dc.identifier.citationCan, D. et al. “Effect of pronounciations on OOV queries in spoken term detection.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 3957-3960. Can, D. et al. “Effect of pronounciations on OOV queries in spoken term detection.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 3957-3960. © Copyright 2009 IEEEen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.contributor.approverCooper, Erica
dc.contributor.mitauthorCooper, Erica
dc.relation.journalProceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2009en_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsCan, Dogan; Cooper, Erica; Sethy, Abhinav; White, Chris; Ramabhadran, Bhuvana; Saraclar, Muraten
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record