Learning a Mixture of Gaussians via Mixed-Integer Optimization

Bandi, Hari; Bertsimas, Dimitris; Mazumder, Rahul

dc.contributor.author	Bandi, Hari
dc.contributor.author	Bertsimas, Dimitris
dc.contributor.author	Mazumder, Rahul
dc.date.accessioned	2021-10-27T20:35:50Z
dc.date.available	2021-10-27T20:35:50Z
dc.date.issued	2019
dc.identifier.uri	https://hdl.handle.net/1721.1/136540
dc.description.abstract	<jats:p> We consider the problem of estimating the parameters of a multivariate Gaussian mixture model (GMM) given access to n samples that are believed to have come from a mixture of multiple subpopulations. State-of-the-art algorithms used to recover these parameters use heuristics to either maximize the log-likelihood of the sample or try to fit first few moments of the GMM to the sample moments. In contrast, we present here a novel mixed-integer optimization (MIO) formulation that optimally recovers the parameters of the GMM by minimizing a discrepancy measure (either the Kolmogorov–Smirnov or the total variation distance) between the empirical distribution function and the distribution function of the GMM whenever the mixture component weights are known. We also present an algorithm for multidimensional data that optimally recovers corresponding means and covariance matrices. We show that the MIO approaches are practically solvable for data sets with n in the tens of thousands in minutes and achieve an average improvement of 60%–70% and 50%–60% on mean absolute percentage error in estimating the means and the covariance matrices, respectively, over the expectation–maximization (EM) algorithm independent of the sample size n. As the separation of the Gaussians decreases and, correspondingly, the problem becomes more difficult, the edge in performance in favor of the MIO methods widens. Finally, we also show that the MIO methods outperform the EM algorithm with an average improvement of 4%–5% on the out-of-sample accuracy for real-world data sets. </jats:p>
dc.language.iso	en
dc.publisher	Institute for Operations Research and the Management Sciences (INFORMS)
dc.relation.isversionof	10.1287/IJOO.2018.0009
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source	Other repository
dc.title	Learning a Mixture of Gaussians via Mixed-Integer Optimization
dc.type	Article
dc.contributor.department	Massachusetts Institute of Technology. Operations Research Center
dc.contributor.department	Sloan School of Management
dc.relation.journal	INFORMS Journal on Optimization
dc.eprint.version	Original manuscript
dc.type.uri	http://purl.org/eprint/type/JournalArticle
eprint.status	http://purl.org/eprint/status/NonPeerReviewed
dc.date.updated	2021-02-05T19:33:00Z
dspace.orderedauthors	Bandi, H; Bertsimas, D; Mazumder, R
dspace.date.submission	2021-02-05T19:33:02Z
mit.journal.volume	1
mit.journal.issue	3
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed

Files in this item

Name:: 942258eee96954f29de26f32132240 ...
Size:: 542.3Kb
Format:: PDF
Description:: Submitted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record