Pareto-Optimal Clustering with the Primal Deterministic Information Bottleneck

Tan, Andrew K.; Tegmark, Max; Chuang, Isaac L.

dc.contributor.author	Tan, Andrew K.
dc.contributor.author	Tegmark, Max
dc.contributor.author	Chuang, Isaac L.
dc.date.accessioned	2022-06-10T13:07:43Z
dc.date.available	2022-06-10T13:07:43Z
dc.date.issued	2022-05-30
dc.identifier.uri	https://hdl.handle.net/1721.1/142922
dc.description.abstract	At the heart of both lossy compression and clustering is a trade-off between the fidelity and size of the learned representation. Our goal is to map out and study the Pareto frontier that quantifies this trade-off. We focus on the optimization of the Deterministic Information Bottleneck (DIB) objective over the space of hard clusterings. To this end, we introduce the <i>primal</i> DIB problem, which we show results in a much richer frontier than its previously studied Lagrangian relaxation when optimized over discrete search spaces. We present an algorithm for mapping out the Pareto frontier of the primal DIB trade-off that is also applicable to other two-objective clustering problems. We study general properties of the Pareto frontier, and we give both analytic and numerical evidence for logarithmic sparsity of the frontier in general. We provide evidence that our algorithm has polynomial scaling despite the super-exponential search space, and additionally, we propose a modification to the algorithm that can be used where sampling noise is expected to be significant. Finally, we use our algorithm to map the DIB frontier of three different tasks: compressing the English alphabet, extracting informative color classes from natural images, and compressing a group theory-inspired dataset, revealing interesting features of frontier, and demonstrating how the structure of the frontier can be used for model selection with a focus on points previously hidden by the cloak of the convex hull.	en_US
dc.publisher	Multidisciplinary Digital Publishing Institute	en_US
dc.relation.isversionof	http://dx.doi.org/10.3390/e24060771	en_US
dc.rights	Creative Commons Attribution	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0	en_US
dc.source	Multidisciplinary Digital Publishing Institute	en_US
dc.title	Pareto-Optimal Clustering with the Primal Deterministic Information Bottleneck	en_US
dc.type	Article	en_US
dc.identifier.citation	Entropy 24 (6): 771 (2022)	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Physics
dc.contributor.department	Massachusetts Institute of Technology. Research Laboratory of Electronics
dc.identifier.mitlicense	PUBLISHER_CC
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2022-06-09T13:40:40Z
dspace.date.submission	2022-06-09T13:40:40Z
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: entropy-24-00771-v2.pdf
Size:: 9.770Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record