pith. sign in

arxiv: 1704.06567 · v1 · pith:66OBG34Xnew · submitted 2017-04-21 · 💻 cs.CL · cs.NE

Attention Strategies for Multi-Source Sequence-to-Sequence Learning

classification 💻 cs.CL cs.NE
keywords attentionmethodstaskslearningmulti-sourceproposedresultssequence-to-sequence
0
0 comments X
read the original abstract

Modeling attention in neural multi-source sequence-to-sequence learning remains a relatively unexplored area, despite its usefulness in tasks that incorporate multiple source languages or modalities. We propose two novel approaches to combine the outputs of attention mechanisms over each source sequence, flat and hierarchical. We compare the proposed methods with existing techniques and present results of systematic evaluation of those methods on the WMT16 Multimodal Translation and Automatic Post-editing tasks. We show that the proposed methods achieve competitive results on both tasks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.