| dc.contributor.advisor | Tenenbaum, Joshua B. | |
| dc.contributor.advisor | Mansinghka, Vikash | |
| dc.contributor.author | Loula Guimarães de Campos, João | |
| dc.date.accessioned | 2026-03-16T15:44:03Z | |
| dc.date.available | 2026-03-16T15:44:03Z | |
| dc.date.issued | 2025-09 | |
| dc.date.submitted | 2025-10-15T16:19:49.308Z | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/165128 | |
| dc.description.abstract | This thesis addresses these challenges for the field of data science, developing probabilistic programming methods that enable rational AI agents in that domain. The work is organized into two parts: Part I introduces GenLM and Adaptive Weighted Rejection Sampling for translating natural language instructions into structured programs with both syntactic and semantic constraints, outperforming existing approaches across a number of domains. Part II develops Bayesian generative models for tabular data that can answer a wide range of questions, yield stable inferences across subpopulations of different sizes, and scale to hundreds of millions of rows on GPUs; as well as early work on Large Population Models that unify heterogeneous datasets. Together, these contributions provide first steps towards a unified framework for creating AI agents that can rationally formalize and answer questions about data. | |
| dc.publisher | Massachusetts Institute of Technology | |
| dc.rights | In Copyright - Educational Use Permitted | |
| dc.rights | Copyright retained by author(s) | |
| dc.rights.uri | https://rightsstatements.org/page/InC-EDU/1.0/ | |
| dc.title | Scaling Bayesian inference for generative models via probabilistic programming | |
| dc.type | Thesis | |
| dc.description.degree | Ph.D. | |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences | |
| dc.identifier.orcid | https://orcid.org/0000-0002-2018-2564 | |
| mit.thesis.degree | Doctoral | |
| thesis.degree.name | Doctor of Philosophy | |