| dc.contributor.advisor | James Glass. | en_US |
| dc.contributor.author | Nadeem, Moin. | en_US |
| dc.contributor.other | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. | en_US |
| dc.date.accessioned | 2021-05-24T19:52:28Z | |
| dc.date.available | 2021-05-24T19:52:28Z | |
| dc.date.copyright | 2021 | en_US |
| dc.date.issued | 2021 | en_US |
| dc.identifier.uri | https://hdl.handle.net/1721.1/130705 | |
| dc.description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, February, 2021 | en_US |
| dc.description | Cataloged from the official PDF of thesis. | en_US |
| dc.description | Includes bibliographical references (pages 107-108). | en_US |
| dc.description.abstract | In the past several years, language modeling has made significant advances on artificial benchmarks. However, despite these advancements, language models still face significant issues when deployed in real-world settings. In particular, these models tend to hallucinate facts and demonstrate significant harmful societal biases that render them impractical in the real-world. This thesis introduces datasets, models, and methodologies for studying how language models incorporate world factuality into their decision making processes. First, I study how neural language models can be used to prove or disprove facts. Motivated by the results, I subsequently study how the choice of training tasks affects the stance detection model. In order to study the acquisition of harmful knowledge, I build a dataset to probe models for their societal stereotypes. Finally, I extend this evaluation to a generative setting, and study how the choice of sampling algorithm affects model factuality. Taken together, this thesis provides a comprehensive analysis of how language models capture world factuality via the pre-training process. | en_US |
| dc.description.statementofresponsibility | by Moin Nadeem. | en_US |
| dc.format.extent | 118 pages | en_US |
| dc.language.iso | eng | en_US |
| dc.publisher | Massachusetts Institute of Technology | en_US |
| dc.rights | MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. | en_US |
| dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
| dc.subject | Electrical Engineering and Computer Science. | en_US |
| dc.title | On factuality in neural language models | en_US |
| dc.type | Thesis | en_US |
| dc.description.degree | M. Eng. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.identifier.oclc | 1251800584 | en_US |
| dc.description.collection | M.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science | en_US |
| dspace.imported | 2021-05-24T19:52:28Z | en_US |
| mit.thesis.degree | Master | en_US |
| mit.thesis.department | EECS | en_US |