Targeting Prospective Customers: Robustness of Machine-Learning Methods to Typical Data Challenges
Author(s)
Simester, Duncan; Timoshenko, Artem; Zoumpoulis, Spyros I.
DownloadSubmitted version (2.693Mb)
Open Access Policy
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
We investigate how firms can use the results of field experiments to optimize the targeting of promotions when prospecting for new customers. We evaluate seven widely used machine-learning methods using a series of two large-scale field experiments. The first field experiment generates a common pool of training data for each of the seven methods. We then validate the seven optimized policies provided by each method together with uniform benchmark policies in a second field experiment. The findings not only compare the performance of the targeting methods, but also demonstrate how well the methods address common data challenges. Our results reveal that when the training data are ideal, model-driven methods perform better than distance-driven methods and classification methods. However, the performance advantage vanishes in the presence of challenges that affect the quality of the training data, including the extent to which the training data captures details of the implementation setting. The challenges we study are covariate shift, concept shift, information loss through aggregation, and imbalanced data. Intuitively, the model-driven methods make better use of the information available in the training data, but the performance of these methods is more sensitive to deterioration in the quality of this information. The classification methods we tested performed relatively poorly. We explain the poor performance of the classification methods in our setting and describe how the performance of these methods could be improved.
Date issued
2019-11Department
Sloan School of ManagementJournal
Management Science
Publisher
Institute for Operations Research and the Management Sciences (INFORMS)
Citation
Simester, Duncan et al. "Targeting Prospective Customers: Robustness of Machine-Learning Methods to Typical Data Challenges." Management Science 66, 6 (June 2020): 2291-2334. © 2019 INFORMS
Version: Original manuscript
ISSN
0025-1909
1526-5501