Show simple item record

dc.contributor.authorBalagopalan, Aparna
dc.contributor.authorMadras, David
dc.contributor.authorYang, David H.
dc.contributor.authorHadfield-Menell, Dylan
dc.contributor.authorHadfield, Gillian K.
dc.contributor.authorGhassemi, Marzyeh
dc.date.accessioned2024-02-09T20:48:29Z
dc.date.available2024-02-09T20:48:29Z
dc.date.issued2023-05-12
dc.identifier.issn2375-2548
dc.identifier.urihttps://hdl.handle.net/1721.1/153492
dc.description.abstractAs governments and industry turn to increased use of automated decision systems, it becomes essential to consider how closely such systems can reproduce human judgment. We identify a core potential failure, finding that annotators label objects differently depending on whether they are being asked a factual question or a normative question. This challenges a natural assumption maintained in many standard machine-learning (ML) data acquisition procedures: that there is no difference between predicting the factual classification of an object and an exercise of judgment about whether an object violates a rule premised on those facts. We find that using factual labels to train models intended for normative judgments introduces a notable measurement error. We show that models trained using factual labels yield significantly different judgments than those trained using normative labels and that the impact of this effect on model performance can exceed that of other factors (e.g., dataset size) that routinely attract attention from ML researchers and practitioners.en_US
dc.language.isoen_US
dc.publisherAmerican Association for the Advancement of Science (AAAS)en_US
dc.relation.isversionof10.1126/sciadv.abq0701en_US
dc.rightsCreative Commons Attributionen_US
dc.rightsAn error occurred on the license name.*
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAmerican Association for the Advancement of Scienceen_US
dc.subjectMultidisciplinaryen_US
dc.titleJudging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling dataen_US
dc.typeArticleen_US
dc.identifier.citationAparna Balagopalan et al. ,Judging facts, judging norms: Training machine learning models to judge humans requires a modified approach to labeling data.Sci. Adv.9,eabq0701(2023).en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.date.submission2024-02-09T20:46:57Z
mit.journal.volume9en_US
mit.journal.issue19en_US
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record