Latent Space Alignment Using Adversarially Guided Self-Play

Tucker, Mycal; Zhou, Yilun; Shah, Julie A

Author(s)

Tucker, Mycal; Zhou, Yilun; Shah, Julie A

DownloadPublished version (2.887Mb)

Publisher with Creative Commons License

Terms of use

Creative Commons Attribution-NonCommercial-NoDerivatives https://creativecommons.org/licenses/by-nc-nd/4.0/

Metadata

Show full item record

Abstract

We envision a world in which robots serve as capable partners in heterogeneous teams composed of other robots or humans. A crucial step towards such a world is enabling robots to learn to use the same representations as their partners; with a shared representation scheme, information may be passed among teammates. We define the problem of learning a fixed partner’s representation scheme as that of latent space alignment and propose metrics for evaluating the quality of alignment. While techniques from prior art in other fields may be applied to the latent space alignment problem, they often require interaction with partners during training time or large amounts of training data. We developed a technique, Adversarially Guided Self-Play (ASP), that trains agents to solve the latent space alignment problem with little training data and no access to their pre-trained partners. Simulation results confirmed that, despite using less training data, agents trained by ASP aligned better with other agents than agents trained by other techniques. Subsequent human-participant studies involving hundreds of Amazon Mechanical Turk workers showed how laypeople understood our machines enough to perform well on team tasks and anticipate their machine partner’s successes or failures.

Date issued

2022-08-26

URI

https://hdl.handle.net/1721.1/163618

Department

Massachusetts Institute of Technology. Department of Aeronautics and Astronautics

Journal

International Journal of Human–Computer Interaction

Publisher

Taylor & Francis

Citation

Tucker, M., Zhou, Y., & Shah, J. A. (2022). Latent Space Alignment Using Adversarially Guided Self-Play. International Journal of Human–Computer Interaction, 38(18–20), 1753–1771.

Version: Final published version

Collections

MIT Open Access Articles