Learning Theory Analysis for Association Rules and Sequential Event Prediction
Author(s)
Rudin, Cynthia; Letham, Benjamin; Madigan, David
DownloadRudin-2013-Learning Theory Analysis.pdf (1.878Mb)
PUBLISHER_POLICY
Publisher Policy
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Terms of use
Metadata
Show full item recordAbstract
We present a theoretical analysis for prediction algorithms based on association rules. As part of this analysis, we introduce a problem for which rules are particularly natural, called “sequential event prediction." In sequential event prediction, events in a sequence are revealed one by one, and the goal is to determine which event will next be revealed. The training set is a collection of past sequences of events. An example application is to predict which item will next be placed into a customer's online shopping cart, given his/her past purchases. In the context of this problem, algorithms based on association rules have distinct advantages over classical statistical and machine learning methods: they look at correlations based on subsets of co-occurring past events (items a and b imply item c), they can be applied to the sequential event prediction problem in a natural way, they can potentially handle the “cold start" problem where the training set is small, and they yield interpretable predictions. In this work, we present two algorithms that incorporate association rules. These algorithms can be used both for sequential event prediction and for supervised classification, and they are simple enough that they can possibly be understood by users, customers, patients, managers, etc. We provide generalization guarantees on these algorithms based on algorithmic stability analysis from statistical learning theory. We include a discussion of the strict minimum support threshold often used in association rule mining, and introduce an “adjusted confidence" measure that provides a weaker minimum support condition that has advantages over the strict minimum support. The paper brings together ideas from statistical learning theory, association rule mining and Bayesian analysis.
Date issued
2013-11Department
Massachusetts Institute of Technology. Operations Research Center; Sloan School of ManagementJournal
Journal of Machine Learning Research
Publisher
Association for Computing Machinery (ACM)
Citation
Rudin, Cynthia, Benjamin Letham, and David Madigan. "Learning Theory Analysis for Association Rules and Sequential Event Prediction." Journal of Machine Learning Research 14 (2013): 3441-3492.
Version: Final published version
ISSN
1532-4435
1533-7928