Effect of pronunciations on OOV queries in spoken term detection

Can, Dogan; Cooper, Erica; Sethy, Abhinav; White, Chris; Ramabhadran, Bhuvana; Saraclar, Murat

Author(s)

Can, Dogan; Cooper, Erica; Sethy, Abhinav; White, Chris; Ramabhadran, Bhuvana; ... Show more

DownloadCan-2009-Effect of pronunciations on OOV queries in spoken term detection.pdf (220.2Kb)

PUBLISHER_POLICY

Terms of use

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.

Metadata

Show full item record

Abstract

The spoken term detection (STD) task aims to return relevant segments from a spoken archive that contain the query terms whether or not they are in the system vocabulary. This paper focuses on pronunciation modeling for out-of-vocabulary (OOV) terms which frequently occur in STD queries. The STD system described in this paper indexes word-level and sub-word level lattices or confusion networks produced by an LVCSR system using weighted finite state transducers (WFST).We investigate the inclusion of n-best pronunciation variants for OOV terms (obtained from letter-to-sound rules) into the search and present the results obtained by indexing confusion networks as well as lattices. The following observations are worth mentioning: phone indexes generated from sub-words represent OOVs well and too many variants for the OOV terms degrade performance if pronunciations are not weighted.

Date issued

2009-05

URI

http://hdl.handle.net/1721.1/58936

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Journal

Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2009

Publisher

Institute of Electrical and Electronics Engineers

Citation

Can, D. et al. “Effect of pronounciations on OOV queries in spoken term detection.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 3957-3960. Can, D. et al. “Effect of pronounciations on OOV queries in spoken term detection.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 3957-3960. © Copyright 2009 IEEE

Version: Final published version

Other identifiers

INSPEC Accession Number: 10700575

ISBN

978-1-4244-2353-8

ISSN

1520-6149

Keywords

Weighted Finite State Transducers, Spoken Term Detection, Speech Recognition, Speech Indexing and Retrieval

Collections

MIT Open Access Articles

DSpace@MIT