Deriving Machine Attention from Human Rationales

Mo Yu; Regina Barzilay; Shiyu Chang; Yujia Bao

arxiv: 1808.09367 · v1 · pith:LSQ5SI6Hnew · submitted 2018-08-28 · 💻 cs.CL

Deriving Machine Attention from Human Rationales

Yujia Bao , Shiyu Chang , Mo Yu , Regina Barzilay This is my paper

classification 💻 cs.CL

keywords attentionrationalesdomainshypothesislow-resourcemappingacrossamounts

0 comments

read the original abstract

Attention-based models are successful when trained on large amounts of data. In this paper, we demonstrate that even in the low-resource scenario, attention can be learned effectively. To this end, we start with discrete human-annotated rationales and map them into continuous attention. Our central hypothesis is that this mapping is general across domains, and thus can be transferred from resource-rich domains to low-resource ones. Our model jointly learns a domain-invariant representation and induces the desired mapping between rationales and attention. Our empirical results validate this hypothesis and show that our approach delivers significant gains over state-of-the-art baselines, yielding over 15% average error reduction on benchmark datasets.

This paper has not been read by Pith yet.

Deriving Machine Attention from Human Rationales

discussion (0)