Show simple item record

dc.contributor.advisorAviv Regev and Eric S. Lander.en_US
dc.contributor.authorCleary, Brian(Brian Lowman)en_US
dc.contributor.otherMassachusetts Institute of Technology. Computational and Systems Biology Program.en_US
dc.date.accessioned2019-11-04T20:20:59Z
dc.date.available2019-11-04T20:20:59Z
dc.date.issued2019en_US
dc.identifier.urihttps://hdl.handle.net/1721.1/122720
dc.descriptionThesis: Ph. D., Massachusetts Institute of Technology, Computational and Systems Biology Program, 2019en_US
dc.descriptionCataloged from PDF version of thesis. "June 2019."en_US
dc.descriptionIncludes bibliographical references.en_US
dc.description.abstractThe development of high-throughput techniques to observe and perturb biological systems has led to remarkable progress in the last several decades. From the tremendous amounts of data being accumulated, new opportunities have emerged, including the possibility of finding latent patterns in high-dimensional variables that are reflective of underlying biological processes. While these methods have led to countless discoveries and innovations, it is clear there is much more we could learn by measuring and perturbing at far greater scales. Here, I advance methods to understand and utilize latent patterns in new types of high-dimensional data. I devise a method of analyzing networks of 'frequency interactions' in 16S/18S time series data, showing that these can be used to identify microbial communities and associated environmental factors.en_US
dc.description.abstractThen, as part of a highly collaborative project, I show how latent patterns in single cell RNA-Seq can be used together with optimal transport analysis to identify cell types and cell type trajectories, regulatory pathways, and cell-cell interactions in a time-course of developmental reprogramming. I then step back to ask a fundamental question: how do we choose which observations and perturbations to make, and how many of each are necessary? I approach this question on the basis of the inherency of latent structure in biology, and on foundational mathematical results concerning the analysis of highly-structured data. I present the beginnings of a framework to formalize how random composite experiments can make biological discovery more efficient by leveraging latent patterns. I first show how to recover individual genomes using covariance patterns in a series of composite (meta-) genomic data.en_US
dc.description.abstractI then describe how random composite measurements and compressed sensing can be used to make gene expression profiling more efficient. Finally, I apply this idea to in situ imaging transcriptomics, demonstrating how many individual gene images can be efficiently recovered from a small number of composite gene images.en_US
dc.description.statementofresponsibilityby Brian Cleary.en_US
dc.format.extent349 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectComputational and Systems Biology Program.en_US
dc.titleLeveraging latent patterns in the study of living systemsen_US
dc.typeThesisen_US
dc.description.degreePh. D.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computational and Systems Biology Programen_US
dc.identifier.oclc1124074039en_US
dc.description.collectionPh.D. Massachusetts Institute of Technology, Computational and Systems Biology Programen_US
dspace.imported2019-11-04T20:20:58Zen_US
mit.thesis.degreeDoctoralen_US
mit.thesis.departmentCSBen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record