| dc.contributor.author | Can, Dogan | |
| dc.contributor.author | Cooper, Erica | |
| dc.contributor.author | Sethy, Abhinav | |
| dc.contributor.author | White, Chris | |
| dc.contributor.author | Ramabhadran, Bhuvana | |
| dc.contributor.author | Saraclar, Murat | |
| dc.date.accessioned | 2010-10-07T15:33:00Z | |
| dc.date.available | 2010-10-07T15:33:00Z | |
| dc.date.issued | 2009-05 | |
| dc.date.submitted | 2009-04 | |
| dc.identifier.isbn | 978-1-4244-2353-8 | |
| dc.identifier.issn | 1520-6149 | |
| dc.identifier.other | INSPEC Accession Number: 10700575 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/58936 | |
| dc.description.abstract | The spoken term detection (STD) task aims to return relevant segments from a spoken archive that contain the query terms whether or not they are in the system vocabulary. This paper focuses on pronunciation modeling for out-of-vocabulary (OOV) terms which frequently occur in STD queries. The STD system described in this paper indexes word-level and sub-word level lattices or confusion networks produced by an LVCSR system using weighted finite state transducers (WFST).We investigate the inclusion of n-best pronunciation variants for OOV terms (obtained from letter-to-sound rules) into the search and present the results obtained by indexing confusion networks as well as lattices. The following observations are worth mentioning: phone indexes generated from sub-words represent OOVs well and too many variants for the OOV terms degrade performance if pronunciations are not weighted. | en_US |
| dc.description.sponsorship | Bogazici University Research Fund | en_US |
| dc.description.sponsorship | Scientific and Technical Research Council of Turkey (TUBITAK) (BIDEB) | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Institute of Electrical and Electronics Engineers | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.1109/ICASSP.2009.4960494 | en_US |
| dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
| dc.source | IEEE | en_US |
| dc.subject | Weighted Finite State Transducers | en_US |
| dc.subject | Spoken Term Detection | en_US |
| dc.subject | Speech Recognition | en_US |
| dc.subject | Speech Indexing and Retrieval | en_US |
| dc.title | Effect of pronunciations on OOV queries in spoken term detection | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Can, D. et al. “Effect of pronounciations on OOV queries in spoken term detection.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 3957-3960. Can, D. et al. “Effect of pronounciations on OOV queries in spoken term detection.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 3957-3960. © Copyright 2009 IEEE | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
| dc.contributor.approver | Cooper, Erica | |
| dc.contributor.mitauthor | Cooper, Erica | |
| dc.relation.journal | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2009 | en_US |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
| eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
| dspace.orderedauthors | Can, Dogan; Cooper, Erica; Sethy, Abhinav; White, Chris; Ramabhadran, Bhuvana; Saraclar, Murat | en |
| mit.license | PUBLISHER_POLICY | en_US |
| mit.metadata.status | Complete | |