Cooperation and Fairness in Multi-Agent Reinforcement Learning

Aloor, Jasmine; Nayak, Siddharth Nagar; Dolan, Sydney; Balakrishnan, Hamsa

dc.contributor.author	Aloor, Jasmine
dc.contributor.author	Nayak, Siddharth Nagar
dc.contributor.author	Dolan, Sydney
dc.contributor.author	Balakrishnan, Hamsa
dc.date.accessioned	2024-11-14T21:09:42Z
dc.date.available	2024-11-14T21:09:42Z
dc.date.issued	2024-10-29
dc.identifier.issn	2833-0528
dc.identifier.uri	https://hdl.handle.net/1721.1/157544
dc.description.abstract	Multi-agent systems are trained to maximize shared cost objectives, which typically reflect system-level efficiency. However, in the resource-constrained environments of mobility and transportation systems, efficiency may be achieved at the expense of fairness --- certain agents may incur significantly greater costs or lower rewards compared to others. Tasks could be distributed inequitably, leading to some agents receiving an unfair advantage while others incur disproportionately high costs. It is, therefore, important to consider the tradeoffs between efficiency and fairness in such settings. We consider the problem of fair multi-agent navigation for a group of decentralized agents using multi-agent reinforcement learning (MARL). We consider the reciprocal of the coefficient of variation of the distances traveled by different agents as a measure of fairness and investigate whether agents can learn to be fair without significantly sacrificing efficiency (i.e., increasing the total distance traveled). We find that by training agents using min-max fair distance goal assignments along with a reward term that incentivizes fairness as they move towards their goals, the agents (1) learn a fair assignment of goals and (2) achieve almost perfect goal coverage in navigation scenarios using only local observations. For goal coverage scenarios, we find that, on average, the proposed model yields a 14% improvement in efficiency and a 5% improvement in fairness over a baseline model that is trained using random assignments. Furthermore, an average of 21% improvement in fairness can be achieved by the proposed model as compared to a model trained on optimally efficient assignments; this increase in fairness comes at the expense of only a 7% decrease in efficiency. Finally, we extend our method to environments in which agents must complete coverage tasks in prescribed formations and show that it is possible to do so without tailoring the models to specific formation shapes.	en_US
dc.publisher	ACM	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/3702012	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Association for Computing Machinery	en_US
dc.title	Cooperation and Fairness in Multi-Agent Reinforcement Learning	en_US
dc.type	Article	en_US
dc.identifier.citation	Aloor, Jasmine, Nayak, Siddharth Nagar, Dolan, Sydney and Balakrishnan, Hamsa. 2024. "Cooperation and Fairness in Multi-Agent Reinforcement Learning." ACM Journal on Autonomous Transportation Systems.
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics	en_US
dc.relation.journal	ACM Journal on Autonomous Transportation Systems	en_US
dc.identifier.mitlicense	PUBLISHER_CC
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2024-11-01T07:45:59Z
dc.language.rfc3066	en
dc.rights.holder	The author(s)
dspace.date.submission	2024-11-01T07:45:59Z
mit.license	PUBLISHER_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: license_rdf
Size:: 40bytes
Format:: application/rdf+xml

View/Open

Name:: 3702012.pdf
Size:: 1.139Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record