A Multi-Objective Framework for Balancing Fairness and Accuracy in Debiasing Machine Learning Models
Author(s)
Nagpal, Rashmi; Khan, Ariba; Borkar, Mihir; Gupta, Amar
Downloadmake-06-00105-v2.pdf (3.481Mb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
Machine learning algorithms significantly impact decision-making in high-stakes domains, necessitating a balance between fairness and accuracy. This study introduces an in-processing, multi-objective framework that leverages the Reject Option Classification (ROC) algorithm to simultaneously optimize fairness and accuracy while safeguarding protected attributes such as age and gender. Our approach seeks a multi-objective optimization solution that balances accuracy, group fairness loss, and individual fairness loss. The framework integrates fairness objectives without relying on a weighted summation method, instead focusing on directly optimizing the trade-offs. Empirical evaluations on publicly available datasets, including German Credit, Adult Income, and COMPAS, reveal several significant findings: the ROC-based approach demonstrates superior performance, achieving an accuracy of 94.29%, an individual fairness loss of 0.04, and a group fairness loss of 0.06 on the German Credit dataset. These results underscore the effectiveness of our framework, particularly the ROC component, in enhancing both the fairness and performance of machine learning models.
Date issued
2024-09-20Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence LaboratoryJournal
machine learning & knowledge extraction
Publisher
Multidisciplinary Digital Publishing Institute
Citation
Nagpal, R.; Khan, A.; Borkar, M.; Gupta, A. A Multi-Objective Framework for Balancing Fairness and Accuracy in Debiasing Machine Learning Models. Mach. Learn. Knowl. Extr. 2024, 6, 2130-2148.
Version: Final published version