Show simple item record

dc.contributor.advisorJames Glass and Suwon Shon.en_US
dc.contributor.authorRivera, Gabrielle Cristina.en_US
dc.contributor.otherMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.en_US
dc.date.accessioned2019-12-05T18:06:15Z
dc.date.available2019-12-05T18:06:15Z
dc.date.copyright2019en_US
dc.date.issued2019en_US
dc.identifier.urihttps://hdl.handle.net/1721.1/123151
dc.descriptionThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.en_US
dc.descriptionThesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2019en_US
dc.descriptionCataloged from student-submitted PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 61-65).en_US
dc.description.abstractMultilingual and multidialectal speakers commonly switch between languages and dialects while speaking, leading to the linguistic phenomenon known as code-switching. Most acoustic systems, such as automatic speech recognition systems, are unable to robustly handle input with unexpected language or dialect switching. Generally, this results from both a lack of available corpora and an increase in the difficulty of the task when applied to code-switching data. This thesis focuses on constructing an acoustic-based model to gather code-switching information from utterances containing Modern Standard Arabic and dialectal Arabic. We utilize the multidialectal GALE Arabic dataset to classify the code-switching style of an utterance and later to detect the location of code-switching within an utterance. We discuss the failed classification schemes and detection methods, providing analysis for why these approaches were unsuccessful. We also present an alignment-free classification scheme which is able to detect locations within an utterance where dialectal Arabic is likely being spoken. This method presents a marked improvement over the proposed baseline in average detection miss rate. By utilizing this information, Arabic acoustic systems will be more robust to dialectal shifts within a given input.en_US
dc.description.statementofresponsibilityby Gabrielle Cristina Rivera.en_US
dc.format.extent65 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectElectrical Engineering and Computer Science.en_US
dc.titleAutomatic detection of code-switching in Arabic dialectsen_US
dc.typeThesisen_US
dc.description.degreeM. Eng.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.identifier.oclc1128868510en_US
dc.description.collectionM.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Scienceen_US
dspace.imported2019-12-05T18:06:14Zen_US
mit.thesis.degreeMasteren_US
mit.thesis.departmentEECSen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record