Improving Cross-Domain Performance for Relation Extraction via Dependency Prediction and Information Flow Control

Amir Pouran Ben Veyseh; Dejing Dou; Thien Huu Nguyen

arxiv: 1907.03230 · v1 · pith:ZPYF65RCnew · submitted 2019-07-07 · 💻 cs.CL

Improving Cross-Domain Performance for Relation Extraction via Dependency Prediction and Information Flow Control

Amir Pouran Ben Veyseh , Thien Huu Nguyen , Dejing Dou This is my paper

Pith reviewed 2026-05-25 01:47 UTC · model grok-4.3

classification 💻 cs.CL

keywords relation extractiondependency predictioninformation flow controlcross-domain performancedeep learningsemantic relationsentity mentions

0 comments

The pith

Jointly predicting dependency and semantic relations with entity-based flow control improves cross-domain relation extraction.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a deep learning model for relation extraction that jointly predicts dependency trees and semantic relations. It also introduces a mechanism to control information flow based on the positions of the input entity mentions. This approach aims to capture context beyond just the syntactic structures provided by dependency trees. Experiments on benchmark datasets demonstrate significant outperformance over existing methods, particularly for cross-domain generalization.

Core claim

The model jointly predicts dependency and semantics relations together with an information-flow control mechanism based on entity mentions, allowing it to outperform existing methods for relation extraction significantly on benchmark datasets by capturing important context beyond syntactic structures.

What carries the argument

Joint prediction of dependency trees and semantic relations combined with an entity-mention-based information flow control mechanism.

If this is right

The model can better capture context information beyond syntactic structures.
It achieves improved cross-domain generalization in relation extraction.
It significantly outperforms current deep learning models on benchmark datasets.
Dependency information is used more effectively without limiting the model to syntactic paths.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This suggests that multi-task learning with syntax and semantics can help in other information extraction tasks.
Future work could explore applying the flow control to other graph structures in NLP.
Testing the model on additional domains would further confirm the cross-domain benefits.

Load-bearing premise

Jointly predicting dependency trees and controlling information flow based on entity mentions will allow capturing important context beyond syntactic structures.

What would settle it

An experiment showing that the proposed model does not outperform standard dependency-guided models on cross-domain relation extraction benchmarks would falsify the claim.

read the original abstract

Relation Extraction (RE) is one of the fundamental tasks in Information Extraction and Natural Language Processing. Dependency trees have been shown to be a very useful source of information for this task. The current deep learning models for relation extraction has mainly exploited this dependency information by guiding their computation along the structures of the dependency trees. One potential problem with this approach is it might prevent the models from capturing important context information beyond syntactic structures and cause the poor cross-domain generalization. This paper introduces a novel method to use dependency trees in RE for deep learning models that jointly predicts dependency and semantics relations. We also propose a new mechanism to control the information flow in the model based on the input entity mentions. Our extensive experiments on benchmark datasets show that the proposed model outperforms the existing methods for RE significantly.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper claims to fix cross-domain RE but only reports in-domain benchmark results, so the central motivation stays untested.

read the letter

The headline takeaway is that this work proposes joint dependency-semantics prediction plus an entity-gated information flow mechanism for relation extraction, yet its experiments never actually measure cross-domain transfer. The abstract and title set up the problem as poor generalization from strict dependency-guided computation, but the reported results stay on standard benchmarks with no train-on-one-corpus test-on-another splits. That mismatch leaves the main claim unsupported even if the in-domain numbers look better than prior models. The joint prediction idea and the entity-based gating are presented as a new combination, and the motivation section lays out the limitation of existing dependency-guided models in a straightforward way. Those pieces are clear and could be useful to someone already working on syntactic structure in IE. The soft spot is the evaluation gap. Without domain-shift protocols the outperformance on benchmarks could simply reflect better in-domain fitting rather than the advertised generalization benefit. The paper does not appear to contain any ablation that isolates the cross-domain effect either. This is the kind of work that might interest a narrow slice of RE researchers who want to try the joint-prediction plus gating pattern on their own data. A reader could pull the architectural sketch and test it themselves, but the missing cross-domain evidence makes the paper hard to recommend as is. I would not send it for peer review until the authors add proper domain-transfer experiments that match the stated goal.

Referee Report

2 major / 2 minor

Summary. The paper claims that a novel deep learning model for relation extraction, which jointly predicts dependency and semantic relations together with an information-flow control mechanism based on entity mentions, significantly outperforms existing methods on benchmark datasets. The motivation is that strict dependency-guided computation prevents capturing important context beyond syntactic structures and leads to poor cross-domain generalization.

Significance. If the cross-domain results hold, the joint prediction of dependencies and relations plus the entity-based flow control would be a useful architectural response to a recognized limitation in dependency-guided RE models. The approach directly targets the tension between syntactic guidance and semantic flexibility.

major comments (2)

[Abstract] Abstract: the central claim concerns improved cross-domain generalization, yet the experiments are reported only on standard benchmark datasets with no domain-transfer protocols (train on one corpus, test on another). This directly undermines the title and motivation.
[Abstract] Abstract: the claim of significant outperformance is asserted without any architecture diagram, loss formulation, baseline list, metric values, or statistical tests, preventing evaluation of the result.

minor comments (2)

[Abstract] The abstract describes the method as an additive extension; a clearer statement of how the joint objective is formulated would improve readability.
[Abstract] No mention of specific evaluation metrics (e.g., F1) or statistical significance testing is supplied in the abstract.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major comment below, indicating where revisions to the manuscript will be made.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim concerns improved cross-domain generalization, yet the experiments are reported only on standard benchmark datasets with no domain-transfer protocols (train on one corpus, test on another). This directly undermines the title and motivation.

Authors: The title and introduction emphasize that avoiding overly rigid dependency-guided computation can aid cross-domain generalization. The reported results on standard benchmarks (ACE05, SemEval) already show gains over syntax-heavy baselines, which we interpret as evidence of improved flexibility. That said, the referee is correct that explicit train-on-one/test-on-another protocols are not presented. We will add such experiments (e.g., ACE05→SemEval and vice versa) with the same metrics and significance tests in the revised version. revision: yes
Referee: [Abstract] Abstract: the claim of significant outperformance is asserted without any architecture diagram, loss formulation, baseline list, metric values, or statistical tests, preventing evaluation of the result.

Authors: Abstracts are space-constrained summaries; all requested elements appear in the body: architecture diagram (Figure 1), joint loss (Eq. 3 in Section 3.2), baseline descriptions (Section 4.1), F1 scores (Table 2), and paired significance tests (Section 4.4). We therefore see no need to alter the abstract itself. revision: no

Circularity Check

0 steps flagged

No circularity detected; proposal is additive model extension without self-referential derivation

full rationale

The provided abstract and description introduce a novel joint prediction model and information-flow control as an extension to existing dependency-guided RE models. No equations, parameter fits, self-citations, or uniqueness theorems are referenced that reduce any claimed result to its own inputs by construction. The central claim is an empirical performance improvement on benchmarks rather than a first-principles derivation, making the derivation chain self-contained and non-circular.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The abstract rests on the standard NLP premise that dependency trees are useful for RE and introduces two new modeling components without listing numerical free parameters or new physical entities.

axioms (1)

domain assumption Dependency trees are a useful source of information for relation extraction
Opening sentence of the abstract.

pith-pipeline@v0.9.0 · 5665 in / 1160 out tokens · 23488 ms · 2026-05-25T01:47:04.786039+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages

[1]

A shortest path dependency kernel for relation extraction

[Bunescu and Mooney, 2005] Razvan C Bunescu and Ray- mond J Mooney. A shortest path dependency kernel for relation extraction. In EMNLP,

work page 2005
[2]

Chan and Dan Roth

[Chan and Roth, 2010 ] Y ee S. Chan and Dan Roth. Ex- ploiting background knowledge for relation extraction. In COLING,

work page 2010
[3]

Domain adaptation for rela- tion extraction with domain adversarial neural network

[Fu et al., 2017] Lisheng Fu, Thien Huu Nguyen, Bonan Min, and Ralph Grishman. Domain adaptation for rela- tion extraction with domain adversarial neural network. In IJCNLP,

work page 2017
[4]

A case study on learn- ing a uniﬁed encoder of relations

[Fu et al., 2018] Lisheng Fu, Bonan Min, Thien Huu Nguyen, and Ralph Grishman. A case study on learn- ing a uniﬁed encoder of relations. In Proceedings of the 4th W orkshop on Noisy User-generated T ext (W-NUT) at EMNLP 2018,

work page 2018
[5]

Improved relation extraction with feature- rich compositional embedding models

[Gormley et al., 2015] Matthew R Gormley, Mo Y u, and Mark Dredze. Improved relation extraction with feature- rich compositional embedding models. EMNLP,

work page 2015
[6]

Semeval-2010 task 8: Multi-way clas- siﬁcation of semantic relations between pairs of nominals

[Hendrickx et al., 2010] Iris Hendrickx, Su Nam Kim, Zor- nitsa Kozareva, Preslav Nakov, Diarmuid ´O S´ eaghdha, Se- bastian Pad´ o, Marco Pennacchiotti, Lorenza Romano, and Stan Szpakowicz. Semeval-2010 task 8: Multi-way clas- siﬁcation of semantic relations between pairs of nominals. In Proceedings of SEW-2009,

work page 2010
[7]

A dependency-based neural network for relation classiﬁcation

[Liu et al., 2015] Y ang Liu, Furu Wei, Sujian Li, Heng Ji, Ming Zhou, and Houfeng Wang. A dependency-based neural network for relation classiﬁcation. In ACL,

work page 2015
[8]

End-to-end relation extraction using lstms on sequences and tree structures

[Miwa and Bansal, 2016 ] Makoto Miwa and Mohit Bansal. End-to-end relation extraction using lstms on sequences and tree structures. ACL,

work page 2016
[9]

Employing word representations and regularization for domain adaptation of relation extraction

[Nguyen and Grishman, 2014 ] Thien Huu Nguyen and Ralph Grishman. Employing word representations and regularization for domain adaptation of relation extraction. In ACL,

work page 2014
[10]

Relation extraction: Perspective from convolutional neural networks

[Nguyen and Grishman, 2015a ] Thien Huu Nguyen and Ralph Grishman. Relation extraction: Perspective from convolutional neural networks. In The NAACL W orkshop on V ector Space Modeling for NLP (VSM), 2015a. [Nguyen and Grishman, 2016 ] Thien Huu Nguyen and Ralph Grishman. Combining neural networks and log- linear models to improve relation extraction. Pro...

work page 2016
[11]

Who is killed by police: Introducing supervised attention for hierarchical lstms

[Nguyen and Nguyen, 2018b ] Minh Nguyen and Thien Huu Nguyen. Who is killed by police: Introducing supervised attention for hierarchical lstms. In Proceedings of COL- ING, 2018b. [Nguyen et al., 2015b] Thien Huu Nguyen, Barbara Plank, and Ralph Grishman. Semantic representations for domain adaptation: A case study on the tree kernel-based method for relat...

work page 2013
[12]

Genre separation network with adversarial training for cross- genre relation extraction

[Shi et al., 2018] Ge Shi, Chong Feng, Lifu Huang, Boliang Zhang, Heng Ji, Lejian Liao, and Heyan Huang. Genre separation network with adversarial training for cross- genre relation extraction. In EMNLP,

work page 2018
[13]

Linguistically-informed self-attention for semantic rol e labeling

[Strubell et al., 2018] Emma Strubell, Patrick V erga, Daniel Andor, David Weiss, and Andrew McCallum. Linguistically-informed self-attention for semantic rol e labeling. In EMNLP,

work page 2018
[14]

Semi-supervised relation extraction with large- scale word clustering

[Sun et al., 2011] Ang Sun, Ralph Grishman, and Satoshi Sekine. Semi-supervised relation extraction with large- scale word clustering. In ACL,

work page 2011
[15]

Attention is all you need

[V aswaniet al., 2017] Ashish V aswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Ł ukasz Kaiser, and Illia Polosukhin. Attention is all you need. In NIPS,

work page 2017
[16]

Combining word embeddings and feature embed- dings for ﬁne-grained relation extraction

[Y uet al., 2015] Mo Y u, Matthew R Gormley, and Mark Dredze. Combining word embeddings and feature embed- dings for ﬁne-grained relation extraction. In NAACL-HLT,

work page 2015
[17]

Relation classiﬁcation via convolutional deep neural network

[Zeng et al., 2014] Daojian Zeng, Kang Liu, Siwei Lai, Guangyou Zhou, and Jun Zhao. Relation classiﬁcation via convolutional deep neural network. In COLING,

work page 2014
[18]

Position-aware attention and supervised data improve slot ﬁlling

[Zhang et al., 2017] Y uhao Zhang, Victor Zhong, Danqi Chen, Gabor Angeli, and Christopher D Manning. Position-aware attention and supervised data improve slot ﬁlling. In Proceedings of EMNLP, pages 35–45,

work page 2017
[19]

Graph convolution over pruned depen- dency trees improves relation extraction

[Zhang et al., 2018] Y uhao Zhang, Peng Qi, and Christo- pher D Manning. Graph convolution over pruned depen- dency trees improves relation extraction. In EMNLP,

work page 2018
[20]

Exploring various knowledge in relation ex- traction

[Zhou et al., 2005] Guodong Zhou, Jian Su, Jie Zhang, and Min Zhang. Exploring various knowledge in relation ex- traction. In ACL,

work page 2005
[21]

Attention- based bidirectional long short-term memory networks for relation classiﬁcation

[Zhou et al., 2016] Peng Zhou, Wei Shi, Jun Tian, Zhenyu Qi, Bingchen Li, Hongwei Hao, and Bo Xu. Attention- based bidirectional long short-term memory networks for relation classiﬁcation. In ACL, 2016

work page 2016

[1] [1]

A shortest path dependency kernel for relation extraction

[Bunescu and Mooney, 2005] Razvan C Bunescu and Ray- mond J Mooney. A shortest path dependency kernel for relation extraction. In EMNLP,

work page 2005

[2] [2]

Chan and Dan Roth

[Chan and Roth, 2010 ] Y ee S. Chan and Dan Roth. Ex- ploiting background knowledge for relation extraction. In COLING,

work page 2010

[3] [3]

Domain adaptation for rela- tion extraction with domain adversarial neural network

[Fu et al., 2017] Lisheng Fu, Thien Huu Nguyen, Bonan Min, and Ralph Grishman. Domain adaptation for rela- tion extraction with domain adversarial neural network. In IJCNLP,

work page 2017

[4] [4]

A case study on learn- ing a uniﬁed encoder of relations

[Fu et al., 2018] Lisheng Fu, Bonan Min, Thien Huu Nguyen, and Ralph Grishman. A case study on learn- ing a uniﬁed encoder of relations. In Proceedings of the 4th W orkshop on Noisy User-generated T ext (W-NUT) at EMNLP 2018,

work page 2018

[5] [5]

Improved relation extraction with feature- rich compositional embedding models

[Gormley et al., 2015] Matthew R Gormley, Mo Y u, and Mark Dredze. Improved relation extraction with feature- rich compositional embedding models. EMNLP,

work page 2015

[6] [6]

Semeval-2010 task 8: Multi-way clas- siﬁcation of semantic relations between pairs of nominals

[Hendrickx et al., 2010] Iris Hendrickx, Su Nam Kim, Zor- nitsa Kozareva, Preslav Nakov, Diarmuid ´O S´ eaghdha, Se- bastian Pad´ o, Marco Pennacchiotti, Lorenza Romano, and Stan Szpakowicz. Semeval-2010 task 8: Multi-way clas- siﬁcation of semantic relations between pairs of nominals. In Proceedings of SEW-2009,

work page 2010

[7] [7]

A dependency-based neural network for relation classiﬁcation

[Liu et al., 2015] Y ang Liu, Furu Wei, Sujian Li, Heng Ji, Ming Zhou, and Houfeng Wang. A dependency-based neural network for relation classiﬁcation. In ACL,

work page 2015

[8] [8]

End-to-end relation extraction using lstms on sequences and tree structures

[Miwa and Bansal, 2016 ] Makoto Miwa and Mohit Bansal. End-to-end relation extraction using lstms on sequences and tree structures. ACL,

work page 2016

[9] [9]

Employing word representations and regularization for domain adaptation of relation extraction

[Nguyen and Grishman, 2014 ] Thien Huu Nguyen and Ralph Grishman. Employing word representations and regularization for domain adaptation of relation extraction. In ACL,

work page 2014

[10] [10]

Relation extraction: Perspective from convolutional neural networks

[Nguyen and Grishman, 2015a ] Thien Huu Nguyen and Ralph Grishman. Relation extraction: Perspective from convolutional neural networks. In The NAACL W orkshop on V ector Space Modeling for NLP (VSM), 2015a. [Nguyen and Grishman, 2016 ] Thien Huu Nguyen and Ralph Grishman. Combining neural networks and log- linear models to improve relation extraction. Pro...

work page 2016

[11] [11]

Who is killed by police: Introducing supervised attention for hierarchical lstms

[Nguyen and Nguyen, 2018b ] Minh Nguyen and Thien Huu Nguyen. Who is killed by police: Introducing supervised attention for hierarchical lstms. In Proceedings of COL- ING, 2018b. [Nguyen et al., 2015b] Thien Huu Nguyen, Barbara Plank, and Ralph Grishman. Semantic representations for domain adaptation: A case study on the tree kernel-based method for relat...

work page 2013

[12] [12]

Genre separation network with adversarial training for cross- genre relation extraction

[Shi et al., 2018] Ge Shi, Chong Feng, Lifu Huang, Boliang Zhang, Heng Ji, Lejian Liao, and Heyan Huang. Genre separation network with adversarial training for cross- genre relation extraction. In EMNLP,

work page 2018

[13] [13]

Linguistically-informed self-attention for semantic rol e labeling

[Strubell et al., 2018] Emma Strubell, Patrick V erga, Daniel Andor, David Weiss, and Andrew McCallum. Linguistically-informed self-attention for semantic rol e labeling. In EMNLP,

work page 2018

[14] [14]

Semi-supervised relation extraction with large- scale word clustering

[Sun et al., 2011] Ang Sun, Ralph Grishman, and Satoshi Sekine. Semi-supervised relation extraction with large- scale word clustering. In ACL,

work page 2011

[15] [15]

Attention is all you need

[V aswaniet al., 2017] Ashish V aswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Ł ukasz Kaiser, and Illia Polosukhin. Attention is all you need. In NIPS,

work page 2017

[16] [16]

Combining word embeddings and feature embed- dings for ﬁne-grained relation extraction

[Y uet al., 2015] Mo Y u, Matthew R Gormley, and Mark Dredze. Combining word embeddings and feature embed- dings for ﬁne-grained relation extraction. In NAACL-HLT,

work page 2015

[17] [17]

Relation classiﬁcation via convolutional deep neural network

[Zeng et al., 2014] Daojian Zeng, Kang Liu, Siwei Lai, Guangyou Zhou, and Jun Zhao. Relation classiﬁcation via convolutional deep neural network. In COLING,

work page 2014

[18] [18]

Position-aware attention and supervised data improve slot ﬁlling

[Zhang et al., 2017] Y uhao Zhang, Victor Zhong, Danqi Chen, Gabor Angeli, and Christopher D Manning. Position-aware attention and supervised data improve slot ﬁlling. In Proceedings of EMNLP, pages 35–45,

work page 2017

[19] [19]

Graph convolution over pruned depen- dency trees improves relation extraction

[Zhang et al., 2018] Y uhao Zhang, Peng Qi, and Christo- pher D Manning. Graph convolution over pruned depen- dency trees improves relation extraction. In EMNLP,

work page 2018

[20] [20]

Exploring various knowledge in relation ex- traction

[Zhou et al., 2005] Guodong Zhou, Jian Su, Jie Zhang, and Min Zhang. Exploring various knowledge in relation ex- traction. In ACL,

work page 2005

[21] [21]

Attention- based bidirectional long short-term memory networks for relation classiﬁcation

[Zhou et al., 2016] Peng Zhou, Wei Shi, Jun Tian, Zhenyu Qi, Bingchen Li, Hongwei Hao, and Bo Xu. Attention- based bidirectional long short-term memory networks for relation classiﬁcation. In ACL, 2016

work page 2016