Show simple item record

dc.contributor.advisorPhilippe Rigollet.en_US
dc.contributor.authorWeed, Jonathan Daniel.en_US
dc.contributor.otherMassachusetts Institute of Technology. Department of Mathematics.en_US
dc.date.accessioned2019-09-16T22:34:56Z
dc.date.available2019-09-16T22:34:56Z
dc.date.copyright2019en_US
dc.date.issued2019en_US
dc.identifier.urihttps://hdl.handle.net/1721.1/122183
dc.descriptionThesis: Ph. D. in Mathematics and Statistics, Massachusetts Institute of Technology, Department of Mathematics, 2019en_US
dc.descriptionCataloged from PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 169-181).en_US
dc.description.abstractHow should you analyze complicated data? Faced with scans of handwritten digits, noisy snapshots of large biomolecules, three-dimensional LIDAR data, or corrupted social networks, what practical techniques and theoretical guarantees does the statistician have at her disposal? This thesis develops new theory for statistical problems involving data with geometric structure of this kind. First, we study the Wasserstein distance, a metric on the space of probability measures on an arbitrary metric space. We prove sharp rates of convergence for empirical measures in Wasserstein distance on sufficiently regular compact metric spaces, improving on a line of work going back to Dudley (1969). We give the first nearly-optimal minimax lower bounds for the problem of estimating the Wasserstein distance between two measures, and we prove much better rates can be obtained under three different structural assumptions on the measures. These assumptions, inspired by practice and theory, reveal novel statistical features of the Wasserstein distance. Second, we consider data corrupted by group transformations. These problems are motivated by cryo-electron microscopy, an important technique in structural biology, the use of which requires reconstructing the structure of biological macromolecules on the basis of noisy, randomly rotated images. We prove the first minimax rates of estimation for a two-dimensional version of this problem. Along the way, we develop a general theory for problems of this kind, applicable to arbitrary compact groups acting on R[superscript d], and provide a novel analysis of the maximum-likelihood estimator for Gaussian mixtures with algebraic structure.en_US
dc.description.statementofresponsibilityby Jonathan Daniel Weed.en_US
dc.format.extent181 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectMathematics.en_US
dc.titleStatistical problems in transport and alignmenten_US
dc.typeThesisen_US
dc.description.degreePh. D. in Mathematics and Statisticsen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Mathematicsen_US
dc.identifier.oclc1117775024en_US
dc.description.collectionPh.D.inMathematicsandStatistics Massachusetts Institute of Technology, Department of Mathematicsen_US
dspace.imported2019-09-16T22:34:54Zen_US
mit.thesis.degreeDoctoralen_US
mit.thesis.departmentMathen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record