dc.contributor.advisor | Tommi S. Jaakkola and David K. Gifford. | en_US |
dc.contributor.author | Mueller, Jonas Weylin | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2015-11-09T19:53:26Z | |
dc.date.available | 2015-11-09T19:53:26Z | |
dc.date.copyright | 2015 | en_US |
dc.date.issued | 2015 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/99857 | |
dc.description | Thesis: S.M. in Computer Science and Engineering, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2015. | en_US |
dc.description | Cataloged from PDF version of thesis. | en_US |
dc.description | Includes bibliographical references (pages 75-79). | en_US |
dc.description.abstract | We present a nonparametric framework for modeling an evolving sequence of (estimated) probability distributions which distinguishes the effects of sequential progression on the observed distribution from extraneous sources of noise (i.e. latent variables which perturb the distributions independently of the sequence-index). To discriminate between these two types of variation, our methods leverage the underlying assumption that the effects of sequential-progression follow a consistent trend. Our methods are motivated by the recent rise of single-cell RNA-sequencing time course experiments, in which an important analytic goal is the identification of genes relevant to the progression of a biological process of interest at cellular resolution. As existing statistical tools are not suited for this task, we introduce a new regression model for (ordinal-value , univariate-distribution) covariate-response pairs where the class of regression-functions reflects coherent changes to the distributions over increasing levels of the covariate, a concept we refer to as trends in distributions. Through simulation study and extensive application of our ideas to data from recent single-cell gene-expression time course experiments, we demonstrate numerous strengths of our framework. Finally, we characterize both theoretical properties of the proposed estimators and the generality of our trend-assumption across diverse types of underlying sequential-progression effects, thus highlighting the utility of our framework for a wide variety of other applications involving the analysis of distributions with associated ordinal labels. | en_US |
dc.description.statementofresponsibility | by Jonas Weylin Mueller. | en_US |
dc.format.extent | 97 pages | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Modeling temporally-regulated effects on distributions | en_US |
dc.type | Thesis | en_US |
dc.description.degree | S.M. in Computer Science and Engineering | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.identifier.oclc | 927701697 | en_US |