dc.contributor.advisor | Christopher M. Schmandt. | en_US |
dc.contributor.author | Molnár, Lajos, 1975- | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2009-10-01T16:01:42Z | |
dc.date.available | 2009-10-01T16:01:42Z | |
dc.date.copyright | 1998 | en_US |
dc.date.issued | 1998 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/47906 | |
dc.description | Thesis (M.Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1998. | en_US |
dc.description | Includes bibliographical references (leaves 83-85). | en_US |
dc.description.abstract | This paper describes a text-to-pronunciation system using transformation-based error-driven learning for speech-recognition purposes. Efforts have been made to make the system language independent, automatic, robust and able to generate multiple pronunciations. The learner proposes initial pronunciations for the words and finds transformations that bring the pronunciations closer to the correct pronunciations. The pronunciation generator works by applying the transformations to a similar initial pronunciation. A dynamic aligner is used for the necessary alignment of phonemes and graphemes. The pronunciations are scored using a weighed string edit distance. Optimizations were made to make the learner and the rule applier fast. The system achieves 73.9% exact word accuracy with multiple pronunciations, 82.3% word accuracy with one correct pronunciation, and 95.3% phoneme accuracy for English words. For proper names, it achieves 50.5% exact word accuracy, 69.2% word accuracy, and 92.0% phoneme accuracy, which outperforms the compared neural network approach. | en_US |
dc.description.statementofresponsibility | Lajos Molnár. | en_US |
dc.format.extent | 85 leaves | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by
copyright. They may be viewed from this source for any purpose, but
reproduction or distribution in any format is prohibited without written
permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Rule based learning of word pronunciations from training corpora | en_US |
dc.type | Thesis | en_US |
dc.description.degree | M.Eng.and S.B. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.identifier.oclc | 48205509 | en_US |