Rule based learning of word pronunciations from training corpora

Molnár, Lajos, 1975-

dc.contributor.advisor	Christopher M. Schmandt.	en_US
dc.contributor.author	Molnár, Lajos, 1975-	en_US
dc.contributor.other	Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.	en_US
dc.date.accessioned	2009-10-01T16:01:42Z
dc.date.available	2009-10-01T16:01:42Z
dc.date.copyright	1998	en_US
dc.date.issued	1998	en_US
dc.identifier.uri	http://hdl.handle.net/1721.1/47906
dc.description	Thesis (M.Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1998.	en_US
dc.description	Includes bibliographical references (leaves 83-85).	en_US
dc.description.abstract	This paper describes a text-to-pronunciation system using transformation-based error-driven learning for speech-recognition purposes. Efforts have been made to make the system language independent, automatic, robust and able to generate multiple pronunciations. The learner proposes initial pronunciations for the words and finds transformations that bring the pronunciations closer to the correct pronunciations. The pronunciation generator works by applying the transformations to a similar initial pronunciation. A dynamic aligner is used for the necessary alignment of phonemes and graphemes. The pronunciations are scored using a weighed string edit distance. Optimizations were made to make the learner and the rule applier fast. The system achieves 73.9% exact word accuracy with multiple pronunciations, 82.3% word accuracy with one correct pronunciation, and 95.3% phoneme accuracy for English words. For proper names, it achieves 50.5% exact word accuracy, 69.2% word accuracy, and 92.0% phoneme accuracy, which outperforms the compared neural network approach.	en_US
dc.description.statementofresponsibility	Lajos Molnár.	en_US
dc.format.extent	85 leaves	en_US
dc.language.iso	eng	en_US
dc.publisher	Massachusetts Institute of Technology	en_US
dc.rights	M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.	en_US
dc.rights.uri	http://dspace.mit.edu/handle/1721.1/7582	en_US
dc.subject	Electrical Engineering and Computer Science.	en_US
dc.title	Rule based learning of word pronunciations from training corpora	en_US
dc.type	Thesis	en_US
dc.description.degree	M.Eng.and S.B.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.identifier.oclc	48205509	en_US

Files in this item

Name:: 48205509-MIT.pdf
Size:: 6.369Mb
Format:: PDF
Description:: Full printable version

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record