PASS-GLM: Polynomial approximate sufficient statistics for scalable Bayesian GLM inference

Huggins, Jonathan H.; Broderick, Tamara A

dc.contributor.author	Huggins, Jonathan H.
dc.contributor.author	Broderick, Tamara A
dc.date.accessioned	2020-12-10T18:13:44Z
dc.date.available	2020-12-10T18:13:44Z
dc.date.issued	2017-12
dc.identifier.issn	1049-5258
dc.identifier.uri	https://hdl.handle.net/1721.1/128777
dc.description.abstract	Generalized linear models (GLMs) - such as logistic regression, Poisson regression, and robust regression - provide interpretable models for diverse data types. Probabilistic approaches, particularly Bayesian ones, allow coherent estimates of uncertainty, incorporation of prior information, and sharing of power across experiments via hierarchical models. In practice, however, the approximate Bayesian methods necessary for inference have either failed to scale to large data sets or failed to provide theoretical guarantees on the quality of inference. We propose a new approach based on constructing polynomial approximate sufficient statistics for GLMs (PASS-GLM). We demonstrate that our method admits a simple algorithm as well as trivial streaming and distributed extensions that do not compound error across computations. We provide theoretical guarantees on the quality of point (MAP) estimates, the approximate posterior, and posterior mean and uncertainty estimates. We validate our approach empirically in the case of logistic regression using a quadratic approximation and show competitive performance with stochastic gradient descent, MCMC, and the Laplace approximation in terms of speed and multiple measures of accuracy - including on an advertising data set with 40 million data points and 20, 000 covariates.	en_US
dc.description.sponsorship	United States. Office of Naval Research (Grant N00014-17-1-2072)	en_US
dc.description.sponsorship	United States. Office of Naval Research. Multidisciplinary University Research Initiative (Grant N00014-11-1-0688)	en_US
dc.language.iso	en
dc.relation.isversionof	https://papers.nips.cc/paper/2017/hash/07811dc6c422334ce36a09ff5cd6fe71-Abstract.html	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Neural Information Processing Systems (NIPS)	en_US
dc.title	PASS-GLM: Polynomial approximate sufficient statistics for scalable Bayesian GLM inference	en_US
dc.type	Article	en_US
dc.identifier.citation	Huggins, Jonathan H., Ryan P. Adams and Tamara Broderick. “PASS-GLM: Polynomial approximate sufficient statistics for scalable Bayesian GLM inference.” Advances in Neural Information Processing Systems, 2017-December (December 2017) © 2017 The Author(s)	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.relation.journal	Advances in Neural Information Processing Systems	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2020-12-03T17:53:56Z
dspace.orderedauthors	Huggins, JH; Adams, RP; Broderick, T	en_US
dspace.date.submission	2020-12-03T17:53:58Z
mit.journal.volume	2017-December	en_US
mit.license	PUBLISHER_POLICY
mit.metadata.status	Complete

Files in this item

Name:: NIPS-2017-pass-glm-polynomial- ...
Size:: 627.5Kb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record