Show simple item record

dc.contributor.authorUnhelkar, Vaibhav Vasant
dc.contributor.authorShah, Julie A
dc.date.accessioned2020-06-19T17:59:13Z
dc.date.available2020-06-19T17:59:13Z
dc.date.issued2019
dc.identifier.issn2374-3468
dc.identifier.urihttps://hdl.handle.net/1721.1/125889
dc.description.abstractArtificial agents that interact with other (human or artificial) agents require models in order to reason about those other agents’ behavior. In addition to the predictive utility of these models, maintaining a model that is aligned with an agent’s true generative model of behavior is critical for effective human-agent interaction. In applications wherein observations and partial specification of the agent’s behavior are available, achieving model alignment is challenging for a variety of reasons. For one, the agent’s decision factors are often not completely known; further, prior approaches that rely upon observations of agents’ behavior alone can fail to recover the true model, since multiple models can explain observed behavior equally well. To achieve better model alignment, we provide a novel approach capable of learning aligned models that conform to partial knowledge of the agent’s behavior. Central to our approach are a factored model of behavior (AMM), along with Bayesian nonparametric priors, and an inference approach capable of incorporating partial specifications as constraints for model learning. We evaluate our approach in experiments and demonstrate improvements in metrics of model alignment.en_US
dc.language.isoen
dc.relation.isversionof10.1609/aaai.v33i01.33012522en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleLearning models of sequential decision-making with partial specification of agent behavioren_US
dc.typeArticleen_US
dc.identifier.citationUnhelkar, Vaibhav V., and Julie A. Shah, "Learning models of sequential decision-making with partial specification of agent behavior." Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, Jan. 27–Feb. 1, 2019, Honolulu, Hawai'i, AAAI Press, 2019: doi 10.1609/aaai.v33i01.33012522 ©2019 Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.relation.journalProceedings of the AAAI Conference on Artificial Intelligenceen_US
dc.eprint.versionOriginal manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-11-01T12:53:18Z
dspace.date.submission2019-11-01T12:53:20Z
mit.journal.volume33en_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record