Summarized Causal Explanations For Aggregate Views

Youngmann, Brit; Cafarella, Michael; Gilad, Amir; Roy, Sudeepa

dc.contributor.author	Youngmann, Brit
dc.contributor.author	Cafarella, Michael
dc.contributor.author	Gilad, Amir
dc.contributor.author	Roy, Sudeepa
dc.date.accessioned	2024-04-04T17:40:42Z
dc.date.available	2024-04-04T17:40:42Z
dc.date.issued	2024-03-12
dc.identifier.issn	2836-6573
dc.identifier.uri	https://hdl.handle.net/1721.1/154070
dc.description.abstract	SQL queries with group-by and average are frequently used and plotted as bar charts in several data analysis applications. Understanding the reasons behind the results in such an aggregate view may be a highly non-trivial and time-consuming task, especially for large datasets with multiple attributes. Hence, generating automated explanations for aggregate views can allow users to gain better insights into the results while saving time in data analysis. When providing explanations for such views, it is paramount to ensure that they are succinct yet comprehensive, reveal different types of insights that hold for different aggregate answers in the view, and, most importantly, they reflect reality and arm users to make informed data-driven decisions, i.e., the explanations do not only consider correlations but are causal. In this paper, we present CauSumX, a framework for generating summarized causal explanations for the entire aggregate view. Using background knowledge captured in a causal DAG, CauSumX finds the most effective causal treatments for different groups in the view. We formally define the framework and the optimization problem, study its complexity, and devise an efficient algorithm using the Apriori algorithm, LP rounding, and several optimizations. We experimentally show that our system generates useful summarized causal explanations compared to prior work and scales well for large high-dimensional data.	en_US
dc.publisher	Association for Computing Machinery	en_US
dc.relation.isversionof	10.1145/3639328	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	ACM	en_US
dc.title	Summarized Causal Explanations For Aggregate Views	en_US
dc.type	Article	en_US
dc.identifier.citation	Brit Youngmann, Michael Cafarella, Amir Gilad, and Sudeepa Roy. 2024. Summarized Causal Explanations For Aggregate Views. Proc. ACM Manag. Data 2, 1 (SIGMOD), Article 71 (February 2024), 27 pages.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.relation.journal	Proceedings of the ACM on Management of Data	en_US
dc.identifier.mitlicense	PUBLISHER_POLICY
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2024-04-01T07:49:27Z
dc.language.rfc3066	en
dc.rights.holder	The author(s)
dspace.date.submission	2024-04-01T07:49:28Z
mit.journal.volume	2	en_US
mit.journal.issue	1	en_US
mit.license	PUBLISHER_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 3639328.pdf
Size:: 1.911Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record