Show simple item record

dc.contributor.authorSusiluoto, Jouni
dc.contributor.authorSpantini, Alessio
dc.contributor.authorHaario, Heikki
dc.contributor.authorHärkönen, Teemu
dc.contributor.authorMarzouk, Youssef
dc.date.accessioned2021-10-27T20:23:39Z
dc.date.available2021-10-27T20:23:39Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/135484
dc.description.abstract© 2020 Author(s). Satellite remote sensing provides a global view to processes on Earth that has unique benefits compared to making measurements on the ground, such as global coverage and enormous data volume. The typical downsides are spatial and temporal gaps and potentially low data quality. Meaningful statistical inference from such data requires overcoming these problems and developing efficient and robust computational tools. We design and implement a computationally efficient multi-scale Gaussian process (GP) software package, satGP, geared towards remote sensing applications. The software is able to handle problems of enormous sizes and to compute marginals and sample from the random field conditioning on at least hundreds of millions of observations. This is achieved by optimizing the computation by, e.g., randomization and splitting the problem into parallel local subproblems which aggressively discard uninformative data. We describe the mean function of the Gaussian process by approximating marginals of a Markov random field (MRF). Variability around the mean is modeled with a multi-scale covariance kernel, which consists of Matérn, exponential, and periodic components. We also demonstrate how winds can be used to inform covariances locally. The covariance kernel parameters are learned by calculating an approximate marginal maximum likelihood estimate, and the validity of both the multi-scale approach and the method used to learn the kernel parameters is verified in synthetic experiments. We apply these techniques to a moderate size ozone data set produced by an atmospheric chemistry model and to the very large number of observations retrieved from the Orbiting Carbon Observatory 2 (OCO-2) satellite. The satGP software is released under an open-source license.
dc.language.isoen
dc.publisherCopernicus GmbH
dc.relation.isversionof10.5194/GMD-13-3439-2020
dc.rightsCreative Commons Attribution 4.0 International license
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.sourceCopernicus Publications
dc.titleEfficient multi-scale Gaussian process regression for massive remote sensing data with satGP v0.1.2
dc.typeArticle
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronautics
dc.relation.journalGeoscientific Model Development
dc.eprint.versionFinal published version
dc.type.urihttp://purl.org/eprint/type/JournalArticle
eprint.statushttp://purl.org/eprint/status/PeerReviewed
dc.date.updated2021-05-03T15:01:46Z
dspace.orderedauthorsSusiluoto, J; Spantini, A; Haario, H; Härkönen, T; Marzouk, Y
dspace.date.submission2021-05-03T15:01:53Z
mit.journal.volume13
mit.journal.issue7
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Needed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record