Effect of pronunciations on OOV queries in spoken term detection

Can, Dogan; Cooper, Erica; Sethy, Abhinav; White, Chris; Ramabhadran, Bhuvana; Saraclar, Murat

dc.contributor.author	Can, Dogan
dc.contributor.author	Cooper, Erica
dc.contributor.author	Sethy, Abhinav
dc.contributor.author	White, Chris
dc.contributor.author	Ramabhadran, Bhuvana
dc.contributor.author	Saraclar, Murat
dc.date.accessioned	2010-10-07T15:33:00Z
dc.date.available	2010-10-07T15:33:00Z
dc.date.issued	2009-05
dc.date.submitted	2009-04
dc.identifier.isbn	978-1-4244-2353-8
dc.identifier.issn	1520-6149
dc.identifier.other	INSPEC Accession Number: 10700575
dc.identifier.uri	http://hdl.handle.net/1721.1/58936
dc.description.abstract	The spoken term detection (STD) task aims to return relevant segments from a spoken archive that contain the query terms whether or not they are in the system vocabulary. This paper focuses on pronunciation modeling for out-of-vocabulary (OOV) terms which frequently occur in STD queries. The STD system described in this paper indexes word-level and sub-word level lattices or confusion networks produced by an LVCSR system using weighted finite state transducers (WFST).We investigate the inclusion of n-best pronunciation variants for OOV terms (obtained from letter-to-sound rules) into the search and present the results obtained by indexing confusion networks as well as lattices. The following observations are worth mentioning: phone indexes generated from sub-words represent OOVs well and too many variants for the OOV terms degrade performance if pronunciations are not weighted.	en_US
dc.description.sponsorship	Bogazici University Research Fund	en_US
dc.description.sponsorship	Scientific and Technical Research Council of Turkey (TUBITAK) (BIDEB)	en_US
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/ICASSP.2009.4960494	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	IEEE	en_US
dc.subject	Weighted Finite State Transducers	en_US
dc.subject	Spoken Term Detection	en_US
dc.subject	Speech Recognition	en_US
dc.subject	Speech Indexing and Retrieval	en_US
dc.title	Effect of pronunciations on OOV queries in spoken term detection	en_US
dc.type	Article	en_US
dc.identifier.citation	Can, D. et al. “Effect of pronounciations on OOV queries in spoken term detection.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 3957-3960. Can, D. et al. “Effect of pronounciations on OOV queries in spoken term detection.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 3957-3960. © Copyright 2009 IEEE	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.contributor.approver	Cooper, Erica
dc.contributor.mitauthor	Cooper, Erica
dc.relation.journal	Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2009	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.orderedauthors	Can, Dogan; Cooper, Erica; Sethy, Abhinav; White, Chris; Ramabhadran, Bhuvana; Saraclar, Murat	en
mit.license	PUBLISHER_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Can-2009-Effect of pronunciations ...
Size:: 220.2Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record