Show simple item record

dc.contributor.advisorTenenbaum, Joshua B.
dc.contributor.advisorMansinghka, Vikash
dc.contributor.authorLoula Guimarães de Campos, João
dc.date.accessioned2026-03-16T15:44:03Z
dc.date.available2026-03-16T15:44:03Z
dc.date.issued2025-09
dc.date.submitted2025-10-15T16:19:49.308Z
dc.identifier.urihttps://hdl.handle.net/1721.1/165128
dc.description.abstractThis thesis addresses these challenges for the field of data science, developing probabilistic programming methods that enable rational AI agents in that domain. The work is organized into two parts: Part I introduces GenLM and Adaptive Weighted Rejection Sampling for translating natural language instructions into structured programs with both syntactic and semantic constraints, outperforming existing approaches across a number of domains. Part II develops Bayesian generative models for tabular data that can answer a wide range of questions, yield stable inferences across subpopulations of different sizes, and scale to hundreds of millions of rows on GPUs; as well as early work on Large Population Models that unify heterogeneous datasets. Together, these contributions provide first steps towards a unified framework for creating AI agents that can rationally formalize and answer questions about data.
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright retained by author(s)
dc.rights.urihttps://rightsstatements.org/page/InC-EDU/1.0/
dc.titleScaling Bayesian inference for generative models via probabilistic programming
dc.typeThesis
dc.description.degreePh.D.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciences
dc.identifier.orcidhttps://orcid.org/0000-0002-2018-2564
mit.thesis.degreeDoctoral
thesis.degree.nameDoctor of Philosophy


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record