| dc.contributor.author | Bao, Yujia | |
| dc.contributor.author | Chang, Shiyu | |
| dc.contributor.author | Yu, Mo | |
| dc.contributor.author | Barzilay, Regina | |
| dc.date.accessioned | 2021-02-09T22:36:48Z | |
| dc.date.available | 2021-02-09T22:36:48Z | |
| dc.date.issued | 2018-10 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/129732 | |
| dc.description.abstract | Attention-based models are successful when trained on large amounts of data. In this paper, we demonstrate that even in the low-resource scenario, attention can be learned effectively. To this end, we start with discrete human-annotated rationales and map them into continuous attention. Our central hypothesis is that this mapping is general across domains, and thus can be transferred from resource-rich domains to low-resource ones. Our model jointly learns a domain-invariant representation and induces the desired mapping between rationales and attention. Our empirical results validate this hypothesis and show that our approach delivers significant gains over state-of-the-art baselines, yielding over 15% average error reduction on benchmark datasets. | en_US |
| dc.language.iso | en | |
| dc.publisher | Association for Computational Linguistics (ACL) | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.18653/v1/d18-1216 | en_US |
| dc.rights | Creative Commons Attribution 4.0 International license | en_US |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_US |
| dc.source | Association for Computational Linguistics | en_US |
| dc.title | Deriving Machine Attention from Human Rationales | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Bao, Yujia et al. "Deriving Machine Attention from Human Rationales." Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, October-November 2018, Brussels, Belgium, Association for Computational Linguistics, October 2018. © 2018 Association for Computational Linguistics | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.relation.journal | Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing | en_US |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2020-12-01T12:55:59Z | |
| dspace.orderedauthors | Bao, Y; Chang, S; Yu, M; Barzilay, R | en_US |
| dspace.date.submission | 2020-12-01T12:56:04Z | |
| mit.license | PUBLISHER_CC | |
| mit.metadata.status | Complete | |