Targeting Prospective Customers: Robustness of Machine-Learning Methods to Typical Data Challenges

Simester, Duncan; Timoshenko, Artem; Zoumpoulis, Spyros I.

Author(s)

Simester, Duncan; Timoshenko, Artem; Zoumpoulis, Spyros I.

DownloadSubmitted version (2.693Mb)

Open Access Policy

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/

Metadata

Show full item record

Abstract

We investigate how firms can use the results of field experiments to optimize the targeting of promotions when prospecting for new customers. We evaluate seven widely used machine-learning methods using a series of two large-scale field experiments. The first field experiment generates a common pool of training data for each of the seven methods. We then validate the seven optimized policies provided by each method together with uniform benchmark policies in a second field experiment. The findings not only compare the performance of the targeting methods, but also demonstrate how well the methods address common data challenges. Our results reveal that when the training data are ideal, model-driven methods perform better than distance-driven methods and classification methods. However, the performance advantage vanishes in the presence of challenges that affect the quality of the training data, including the extent to which the training data captures details of the implementation setting. The challenges we study are covariate shift, concept shift, information loss through aggregation, and imbalanced data. Intuitively, the model-driven methods make better use of the information available in the training data, but the performance of these methods is more sensitive to deterioration in the quality of this information. The classification methods we tested performed relatively poorly. We explain the poor performance of the classification methods in our setting and describe how the performance of these methods could be improved.

Date issued

2019-11

URI

https://hdl.handle.net/1721.1/130508

Department

Sloan School of Management

Journal

Management Science

Publisher

Institute for Operations Research and the Management Sciences (INFORMS)

Citation

Version: Original manuscript

ISSN

0025-1909

1526-5501

Collections

MIT Open Access Articles