Quantifying Similarity between Relations with Fact Distribution

Hao Zhu; Maosong Sun; Weize Chen; Xu Han; Zhiyuan Liu

arxiv: 1907.08937 · v1 · pith:ODLQLIRSnew · submitted 2019-07-21 · 💻 cs.AI · cs.CL· cs.LG· stat.ML

Quantifying Similarity between Relations with Fact Distribution

Weize Chen , Hao Zhu , Xu Han , Zhiyuan Liu , Maosong Sun This is my paper

Pith reviewed 2026-05-24 18:51 UTC · model grok-4.3

classification 💻 cs.AI cs.CLcs.LGstat.ML

keywords relation similarityknowledge basesfact distributionopen information extractionrelational classificationnegative samplingdistribution divergence

0 comments

The pith

Divergence between conditional distributions over entity pairs quantifies similarity between relations in knowledge bases.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents a method to measure how similar two relations are by computing the divergence between the conditional probability distributions over the entity pairs each relation connects. These distributions are modeled by a simple neural network, with sampling used to approximate the otherwise intractable divergence value. The resulting similarity scores show strong correlation with human judgments of relation similarity. When applied to downstream tasks, the scores identify redundant relations produced by open information extraction systems, expose classification errors made by strong models on near-identical relations, and can be plugged into negative sampling and softmax training to reduce those errors.

Core claim

The similarity between two relations equals the divergence between the conditional probability distributions over entity pairs that instantiate each relation; a sampling procedure from a neural network that parameterizes these distributions yields a practical approximation whose outputs align with human similarity judgments.

What carries the argument

Divergence between conditional probability distributions over entity pairs, parameterized by a simple neural network and approximated by sampling.

If this is right

The scores can detect redundant relations extracted by open information extraction models.
Even the strongest relational classification models still confuse very similar relations.
The measure can be inserted into negative sampling and softmax classification to reduce errors on similar relations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same distributional divergence idea could be tested on other structured prediction settings such as event or attribute similarity.
Knowledge-base merging pipelines might use these scores to decide when two relation schemas should be aligned or collapsed.
Model confusion matrices for relation extraction could be re-ranked by this measure to prioritize error analysis on the most semantically close pairs.

Load-bearing premise

Semantic similarity between relations is captured by low divergence between the distributions of entity pairs they relate.

What would settle it

If the computed divergences show no significant correlation with a fresh set of human similarity ratings collected independently of the original experiments, the central claim would be falsified.

read the original abstract

We introduce a conceptually simple and effective method to quantify the similarity between relations in knowledge bases. Specifically, our approach is based on the divergence between the conditional probability distributions over entity pairs. In this paper, these distributions are parameterized by a very simple neural network. Although computing the exact similarity is in-tractable, we provide a sampling-based method to get a good approximation. We empirically show the outputs of our approach significantly correlate with human judgments. By applying our method to various tasks, we also find that (1) our approach could effectively detect redundant relations extracted by open information extraction (Open IE) models, that (2) even the most competitive models for relational classification still make mistakes among very similar relations, and that (3) our approach could be incorporated into negative sampling and softmax classification to alleviate these mistakes. The source code and experiment details of this paper can be obtained from https://github.com/thunlp/relation-similarity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper scores relation similarity by divergence between NN-parameterized distributions over entity pairs, shows human correlation, and demonstrates uses in spotting OpenIE duplicates and fixing classification errors.

read the letter

The paper defines relation similarity directly as the divergence between the conditional distributions P(entity pairs | relation). They parameterize those distributions with a simple neural network and approximate the divergence via sampling because exact computation is intractable. They report that the resulting scores correlate with human judgments and then apply the measure to three tasks: detecting redundant relations from OpenIE, exposing confusions among similar relations in strong classifiers, and improving negative sampling plus softmax training. Code is released, which helps reproducibility. The core idea is straightforward and the downstream experiments tie the measure to concrete problems in KB work and relation extraction. The main soft spot is dependence on the neural parameterization: if the network does not capture the true distributions well, the divergence may not track semantic similarity as cleanly as the human correlations suggest. The abstract gives little detail on architecture, training, or sampling quality, so those choices need checking in the full paper. This is aimed at people building or cleaning knowledge bases and training relation models. It has a clear, testable hypothesis plus released code and multiple task results, so it deserves a serious referee rather than desk rejection.

Referee Report

0 major / 3 minor

Summary. The paper introduces a method to quantify similarity between relations in knowledge bases by measuring the divergence between conditional probability distributions over entity pairs, parameterized by a simple neural network. It provides a sampling-based approximation to the intractable exact computation and reports that the resulting similarity scores significantly correlate with human judgments. The method is then applied to detect redundant relations from Open IE systems, to analyze errors among similar relations in competitive relational classification models, and to improve negative sampling and softmax-based classification.

Significance. If the reported human correlation and task improvements hold under the sampling approximation, the work supplies a practical, data-driven similarity measure for relations that can be directly integrated into KB construction and classification pipelines. The public release of code and experiment details is a clear strength that supports reproducibility and follow-on work.

minor comments (3)

[§3.1] §3.1: the neural network architecture used to parameterize P(entity-pairs | relation) is described only at a high level; an explicit diagram or layer sizes would clarify the 'very simple' claim and aid replication.
[Table 2] Table 2: the reported correlation coefficients lack confidence intervals or p-values, making it difficult to assess whether the 'significant' correlation is robust across the sampled entity pairs.
[§4.3] §4.3: when the similarity measure is incorporated into negative sampling, the paper does not state how many negative samples are drawn per positive or whether the sampling is performed once or dynamically during training.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive summary, recognition of the method's practicality, and recommendation for minor revision. We appreciate the note on reproducibility via code release.

Circularity Check

0 steps flagged

No significant circularity; measure is explicitly defined and externally validated

full rationale

The paper defines relation similarity directly as divergence between NN-parameterized P(entity-pairs | relation) distributions, provides a sampling approximation, and evaluates the resulting scores via human correlation and task utility. No step reduces a claimed prediction or first-principles result to a fitted parameter or self-citation by construction; the core hypothesis is the object of empirical testing rather than an unverified premise. Code release and direct human judgments supply independent external anchors.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Review limited to abstract; no explicit free parameters, axioms, or invented entities are detailed beyond the high-level premise that distribution divergence quantifies relation similarity.

axioms (1)

domain assumption Divergence between conditional probability distributions over entity pairs captures relation similarity
Stated as the basis of the approach in the abstract.

pith-pipeline@v0.9.0 · 5700 in / 1201 out tokens · 26355 ms · 2026-05-24T18:51:37.430783+00:00 · methodology

Quantifying Similarity between Relations with Fact Distribution

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)