Show simple item record

dc.contributor.authorFutrell, Richard Landy Jones
dc.contributor.authorAlbright, Adam
dc.contributor.authorGraff, Peter
dc.contributor.authorO’Donnell, Timothy J.
dc.date.accessioned2020-11-18T22:54:02Z
dc.date.available2020-11-18T22:54:02Z
dc.date.issued2017-12
dc.identifier.issn2307-387X
dc.identifier.urihttps://hdl.handle.net/1721.1/128532
dc.description.abstractWe present a probabilistic model of phonotactics, the set of well-formed phoneme sequences in a language. Unlike most computational models of phonotactics (Hayes and Wilson, 2008; Goldsmith and Riggle, 2012), we take a fully generative approach, modeling a process where forms are built up out of subparts by phonologically-informed structure building operations. We learn an inventory of subparts by applying stochastic memoization (Johnson et al., 2007; Goodman et al., 2008) to a generative process for phonemes structured as an and-or graph, based on concepts of feature hierarchy from generative phonology (Clements, 1985; Dresher, 2009). Subparts are combined in a way that allows tier-based feature interactions. We evaluate our models’ ability to capture phonotactic distributions in the lexicons of 14 languages drawn from the WOLEX corpus (Graff, 2012). Our full model robustly assigns higher probabilities to held-out forms than a sophisticated N-gram model for all languages. We also present novel analyses that probe model behavior in more detail.en_US
dc.description.sponsorshipNational Science Foundation (Grant 1551543)en_US
dc.language.isoen
dc.publisherMIT Pressen_US
dc.relation.isversionofhttp://dx.doi.org/10.1162/tacl_a_00047en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceMIT Pressen_US
dc.titleA Generative Model of Phonotacticsen_US
dc.typeArticleen_US
dc.identifier.citationFutrell, Richard et al. "A Generative Model of Phonotactics." Transactions of the Association for Computational Linguistics 5 (December 2017): 73-86 © 2017 Association for Computational Linguisticsen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Linguistics and Philosophyen_US
dc.relation.journalTransactions of the Association for Computational Linguisticsen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2019-09-25T17:19:31Z
dspace.date.submission2019-09-25T17:19:32Z
mit.journal.volume5en_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record