Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes

Boominathan, Soorajnath; Oberst, Michael; Zhou, Helen; Kanjilal, Sanjat; Sontag, David

dc.contributor.author	Boominathan, Soorajnath
dc.contributor.author	Oberst, Michael
dc.contributor.author	Zhou, Helen
dc.contributor.author	Kanjilal, Sanjat
dc.contributor.author	Sontag, David
dc.date.accessioned	2021-11-08T16:46:37Z
dc.date.available	2021-11-08T16:46:37Z
dc.date.issued	2020-08
dc.identifier.uri	https://hdl.handle.net/1721.1/137708
dc.description.abstract	© 2020 Owner/Author. In several medical decision-making problems, such as antibiotic prescription, laboratory testing can provide precise indications for how a patient will respond to different treatment options. This enables us to "fully observe" all potential treatment outcomes, but while present in historical data, these results are infeasible to produce in real-time at the point of the initial treatment decision. Moreover, treatment policies in these settings often need to trade off between multiple competing objectives, such as effectiveness of treatment and harmful side effects. We present, compare, and evaluate three approaches for learning individualized treatment policies in this setting: First, we consider two indirect approaches, which use predictive models of treatment response to construct policies optimal for different trade-offs between objectives. Second, we consider a direct approach that constructs such a set of policies without intermediate models of outcomes. Using a medical dataset of Urinary Tract Infection (UTI) patients, we show that all approaches learn policies that achieve strictly better performance on all outcomes than clinicians, while also trading off between different objectives. We demonstrate additional benefits of the direct approach, including flexibly incorporating other goals such as deferral to physicians on simple cases.	en_US
dc.language.iso	en
dc.publisher	ACM	en_US
dc.relation.isversionof	10.1145/3394486.3403245	en_US
dc.rights	Creative Commons Attribution 4.0 International license	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	ACM	en_US
dc.title	Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes	en_US
dc.type	Article	en_US
dc.identifier.citation	Boominathan, Soorajnath, Oberst, Michael, Zhou, Helen, Kanjilal, Sanjat and Sontag, David. 2020. "Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes." Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.contributor.department	Massachusetts Institute of Technology. Institute for Medical Engineering & Science
dc.relation.journal	Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2021-01-26T18:52:35Z
dspace.orderedauthors	Boominathan, S; Oberst, M; Zhou, H; Kanjilal, S; Sontag, D	en_US
dspace.date.submission	2021-01-26T18:52:39Z
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 3394486.3403245.pdf
Size:: 1.803Mb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record