LLM-based Atomic Propositions help weak extractors: Evaluation of a Propositioner for triplet extraction

· 2026 · cs.CL · arXiv 2604.02866

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Knowledge Graph construction from natural language requires extracting structured triplets from complex, information-dense sentences. In this paper, we investigate if the decomposition of text into atomic propositions (minimal, semantically autonomous units of information) can improve the triplet extraction. We introduce MPropositionneur-V2, a small multilingual model covering six European languages trained by knowledge distillation from Qwen3-32B into a Qwen3-0.6B architecture, and we evaluate its integration into two extraction paradigms: entity-centric (GLiREL) and generative (Qwen3). Experiments on SMiLER, FewRel, DocRED and CaRB show that atomic propositions benefit weaker extractors (GLiREL, CoreNLP, 0.6B models), improving relation recall and, in the multilingual setting, overall accuracy. For stronger LLMs, a fallback combination strategy recovers entity recall losses while preserving the gains in relation extraction. These results show that atomic propositions are an interpretable intermediate data structure that complements extractors without replacing them.

representative citing papers

LLM-based Atomic Propositions help weak extractors: Evaluation of a Propositioner for triplet extraction

cs.CL · 2026-04-03 · unverdicted · novelty 5.0

Atomic propositions improve relation recall for weak triplet extractors like GLiREL and small models across SMiLER, FewRel, DocRED and CaRB, while requiring fallback for stronger LLMs.

citing papers explorer

Showing 1 of 1 citing paper.

LLM-based Atomic Propositions help weak extractors: Evaluation of a Propositioner for triplet extraction cs.CL · 2026-04-03 · unverdicted · none · ref 1 · internal anchor
Atomic propositions improve relation recall for weak triplet extractors like GLiREL and small models across SMiLER, FewRel, DocRED and CaRB, while requiring fallback for stronger LLMs.

LLM-based Atomic Propositions help weak extractors: Evaluation of a Propositioner for triplet extraction

fields

years

verdicts

representative citing papers

citing papers explorer