When Human Coders (and Machines) Disagree on the Meaning of Facial Affect in Spontaneous Videos

Hoque, Mohammed E.; Kaliouby, Rana; Picard, Rosalind W.

dc.contributor.author	Picard, Rosalind W.
dc.contributor.author	Hoque, Mohammed Ehasanul
dc.contributor.author	El Kaliouby, Rana
dc.date.accessioned	2010-07-15T19:06:40Z
dc.date.available	2010-07-15T19:06:40Z
dc.date.issued	2009-09
dc.identifier.isbn	978-3-642-04379-6
dc.identifier.uri	http://hdl.handle.net/1721.1/56633
dc.description.abstract	This paper describes the challenges of getting ground truth affective labels for spontaneous video, and presents implications for systems such as virtual agents that have automated facial analysis capabilities. We first present a dataset from an intelligent tutoring application and describe the most prevalent approach to labeling such data. We then present an alternative labeling approach, which closely models how the majority of automated facial analysis systems are designed. We show that while participants, peers and trained judges report high inter-rater agreement on expressions of delight, confusion, flow, frustration, boredom, surprise, and neutral when shown the entire 30 minutes of video for each participant, inter-rater agreement drops below chance when human coders are asked to watch and label short 8 second clips for the same set of labels. We also perform discriminative analysis for facial action units for each affective state represented in the clips. The results emphasize that human coders heavily rely on factors such as familiarity of the person and context of the interaction to correctly infer a person’s affective state; without this information, the reliability of humans as well as machines attributing affective labels to spontaneous facial-head movements drops significantly.	en_US
dc.language.iso	en_US
dc.publisher	Springer Berlin	en_US
dc.relation.isversionof	http://dx.doi.org/10.1007/978-3-642-04380-2_37	en_US
dc.rights	Attribution-Noncommercial-Share Alike 3.0 Unported	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/	en_US
dc.source	Alex Khitrik [akhitrik@media.mit.edu] after request by Rosalyn Picard	en_US
dc.title	When Human Coders (and Machines) Disagree on the Meaning of Facial Affect in Spontaneous Videos	en_US
dc.type	Article	en_US
dc.identifier.citation	Hoque, M. E., R. El Kaliouby, and R. W. Picard. "When Human Coders (and Machines) Disagree on the Meaning of Facial Affect in Spontaneous Videos." Intelligent Virtual Agents, Proceedings 5773 (2009): 337-43.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Media Laboratory	en_US
dc.contributor.department	Program in Media Arts and Sciences (Massachusetts Institute of Technology)	en_US
dc.contributor.approver	Picard, Rosalind W.
dc.contributor.mitauthor	Picard, Rosalind W.
dc.contributor.mitauthor	Hoque, Mohammed Ehasanul
dc.contributor.mitauthor	El Kaliouby, Rana
dc.relation.journal	Intelligent Virtual Agents, 9th International Conference, IVA 2009 Amsterdam, The Netherlands, September 14-16, 2009 Proceedings	en_US
dc.eprint.version	Original manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
dspace.orderedauthors	Hoque, Mohammed E.; Kaliouby, Rana; Picard, Rosalind W.	en
dc.identifier.orcid	https://orcid.org/0000-0002-5661-0022
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Picard_When Human Coders.pdf
Size:: 76.48Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record