Deriving Machine Attention from Human Rationales

Bao, Yujia; Chang, Shiyu; Yu, Mo; Barzilay, Regina

dc.contributor.author	Bao, Yujia
dc.contributor.author	Chang, Shiyu
dc.contributor.author	Yu, Mo
dc.contributor.author	Barzilay, Regina
dc.date.accessioned	2021-02-09T22:36:48Z
dc.date.available	2021-02-09T22:36:48Z
dc.date.issued	2018-10
dc.identifier.uri	https://hdl.handle.net/1721.1/129732
dc.description.abstract	Attention-based models are successful when trained on large amounts of data. In this paper, we demonstrate that even in the low-resource scenario, attention can be learned effectively. To this end, we start with discrete human-annotated rationales and map them into continuous attention. Our central hypothesis is that this mapping is general across domains, and thus can be transferred from resource-rich domains to low-resource ones. Our model jointly learns a domain-invariant representation and induces the desired mapping between rationales and attention. Our empirical results validate this hypothesis and show that our approach delivers significant gains over state-of-the-art baselines, yielding over 15% average error reduction on benchmark datasets.	en_US
dc.language.iso	en
dc.publisher	Association for Computational Linguistics (ACL)	en_US
dc.relation.isversionof	http://dx.doi.org/10.18653/v1/d18-1216	en_US
dc.rights	Creative Commons Attribution 4.0 International license	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	Association for Computational Linguistics	en_US
dc.title	Deriving Machine Attention from Human Rationales	en_US
dc.type	Article	en_US
dc.identifier.citation	Bao, Yujia et al. "Deriving Machine Attention from Human Rationales." Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, October-November 2018, Brussels, Belgium, Association for Computational Linguistics, October 2018. © 2018 Association for Computational Linguistics	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.relation.journal	Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2020-12-01T12:55:59Z
dspace.orderedauthors	Bao, Y; Chang, S; Yu, M; Barzilay, R	en_US
dspace.date.submission	2020-12-01T12:56:04Z
mit.license	PUBLISHER_CC
mit.metadata.status	Complete

Files in this item

Name:: D18-1216.pdf
Size:: 1.184Mb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record