Show simple item record

dc.contributor.authorZhang, Jiaqi
dc.contributor.authorCammarata, Louis
dc.contributor.authorSquires, Chandler
dc.contributor.authorSapsis, Themistoklis P.
dc.contributor.authorUhler, Caroline
dc.date.accessioned2024-04-18T17:10:24Z
dc.date.available2024-04-18T17:10:24Z
dc.date.issued2023-10-02
dc.identifier.issn2522-5839
dc.identifier.urihttps://hdl.handle.net/1721.1/154216
dc.description.abstractSequential experimental design to discover interventions that achieve a desired outcome is a key problem in various domains including science, engineering and public policy. When the space of possible interventions is large, making an exhaustive search infeasible, experimental design strategies are needed. In this context, encoding the causal relationships between the variables, and thus the effect of interventions on the system, is critical for identifying desirable interventions more efficiently. Here we develop a causal active learning strategy to identify interventions that are optimal, as measured by the discrepancy between the post-interventional mean of the distribution and a desired target mean. The approach employs a Bayesian update for the causal model and prioritizes interventions using a carefully designed, causally informed acquisition function. This acquisition function is evaluated in closed form, allowing for fast optimization. The resulting algorithms are theoretically grounded with information-theoretic bounds and provable consistency results for linear causal models with known causal graph. We apply our approach to both synthetic data and single-cell transcriptomic data from Perturb–CITE-sequencing experiments to identify optimal perturbations that induce a specific cell-state transition. The causally informed acquisition function generally outperforms existing criteria, allowing for optimal intervention design with fewer but carefully selected samples.en_US
dc.language.isoen
dc.publisherSpringer Science and Business Media LLCen_US
dc.relation.isversionof10.1038/s42256-023-00719-0en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceSpringer Science and Business Media LLCen_US
dc.subjectArtificial Intelligenceen_US
dc.subjectComputer Networks and Communicationsen_US
dc.subjectComputer Vision and Pattern Recognitionen_US
dc.subjectHuman-Computer Interactionen_US
dc.subjectSoftwareen_US
dc.titleActive learning for optimal intervention design in causal modelsen_US
dc.typeArticleen_US
dc.identifier.citationZhang, J., Cammarata, L., Squires, C. et al. Active learning for optimal intervention design in causal models. Nat Mach Intell 5, 1066–1075 (2023).en_US
dc.contributor.departmentMassachusetts Institute of Technology. Laboratory for Information and Decision Systems
dc.contributor.departmentStatistics and Data Science Center (Massachusetts Institute of Technology)
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.contributor.departmentMassachusetts Institute of Technology. Institute for Data, Systems, and Society
dc.relation.journalNature Machine Intelligenceen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2024-04-18T17:05:58Z
dspace.orderedauthorsZhang, J; Cammarata, L; Squires, C; Sapsis, TP; Uhler, Cen_US
dspace.date.submission2024-04-18T17:06:01Z
mit.journal.volume5en_US
mit.journal.issue10en_US
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record