hub

author Montani, I

Ines Montani, Matthew Honnibal, Adriane Boyd, Sofie Van Landeghem, Henning Peters · 2020 · Zenodo (CERN European Organization for Nuclear Research) · DOI 10.5281/zenodo.1212303

28 Pith papers cite this work, alongside 425 external citations. Polarity classification is still indexing.

28 Pith papers citing it

425 external citations · Crossref

open at publisher browse 28 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

method 2

citation-polarity summary

use method 2

representative citing papers

From Circuit Evidence to Mechanistic Theory: An Inductive Logic Approach

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

Introduces Causal Functional Signatures grounded in causal evidence and ILP-learned architectural signatures to enable explicit, comparable, and portable mechanistic claims across model scales.

Linked Multi-Model Data on Russian Domestic and Foreign Policy Speeches

cs.CL · 2026-05-15 · unverdicted · novelty 7.0

A new linked multimodal dataset of Russian domestic and foreign policy speeches with texts, images, captions, harmonized metadata, and expert-refined topic annotations is introduced to support analyses in political communication and LLM applications.

Matching Meaning at Scale: Evaluating Semantic Search for 18th-Century Intellectual History through the Case of Locke

cs.CL · 2026-05-10 · unverdicted · novelty 7.0 · 2 refs

Semantic search retrieves substantially more implicit receptions of Locke's work than lexical baselines in 18th-century corpora, yet remains constrained by lexical gatekeeping.

Mapping Emerging Climate Misinformation Playbooks in the Global South

cs.SI · 2026-04-27 · unverdicted · novelty 7.0

Brazilian YouTube climate videos show a transition from traditional denial of climate science to 'new denial' that undermines solutions, with the latter attracting more engagement from diverse actors.

Semantic Needles in Document Haystacks: Sensitivity Testing of LLM-as-a-Judge Similarity Scoring

cs.CL · 2026-04-20 · unverdicted · novelty 7.0

LLMs exhibit positional bias and context-dependent scoring patterns when judging document similarity, with each model showing a stable scoring fingerprint but a shared hierarchy of sensitivity to different semantic perturbations.

Saying More Than They Know: A Framework for Quantifying Epistemic-Rhetorical Miscalibration in Large Language Models

cs.CL · 2026-03-27 · unverdicted · novelty 7.0

LLMs display a consistent pattern of elevated form-meaning divergence and uniform rhetorical device use in argumentative texts compared to humans, quantified by new metrics FMD, GPR, and RDDE.

SAM 3: Segment Anything with Concepts

cs.CV · 2025-11-20 · unverdicted · novelty 7.0

SAM 3 introduces promptable concept segmentation that doubles accuracy of prior systems on images and videos while improving standard SAM segmentation performance.

TSVer: A Benchmark for Fact Verification Against Time-Series Evidence

cs.CL · 2025-11-02 · unverdicted · novelty 7.0

TSVer is a new benchmark dataset for fact verification against time-series evidence, with 304 annotated real-world claims, 400 time series, verdicts, and justifications, plus baseline results showing current models struggle.

Phonemes to the Rescue: Multilingual Tokenization Based on International Phonetic Alphabet

cs.CL · 2026-06-18 · unverdicted · novelty 6.0

IPA-based subword tokenizers trained across 24 languages improve tokenization quality and generalization to unseen languages compared to standard text tokenizers, especially for non-Latin scripts.

On The Effectiveness-Fluency Trade-Off In LLM Conditioning: A Systematic Study

cs.CL · 2026-06-10 · unverdicted · novelty 6.0

Systematic experiments reveal that activation steering trades fluency for concept control, is less effective on instruction-tuned models, and that prompting/SFT excel at injection but not removal, with textual metrics correlating to LLM judges.

Arabic Sentence Segmentation Across Genres and Punctuation Conditions

cs.CL · 2026-06-06 · unverdicted · novelty 6.0

AraSEG is a genre-diverse Arabic sentence segmentation corpus showing lightweight encoders and dependency parsers outperform LLMs under challenging punctuation while improving downstream parsing.

Isolating LLM Lexical Bias: A Curation-Free Triangulated Metric for Preference-Stage Learning

cs.CL · 2026-05-29 · unverdicted · novelty 6.0

Introduces a triangulation-based metric to quantify lexical shifts attributable to preference tuning without requiring manual curation of examples.

Probing Minimalist Phase Structure in LLMs: What Universal Dependencies Cannot Represent

cs.CL · 2026-05-26 · unverdicted · novelty 6.0

Structural probes on UD-invariant wh-movement stimuli reveal phase-count gradients and phase-internal cohesion effects in 12-13 of 13 LLMs, indicating syntactic abstractions beyond UD annotations.

Towards Continuous Sign Language Conversation from Isolated Signs

cs.CV · 2026-05-14 · unverdicted · novelty 6.0

Constructs continuous sign conversation data from isolated signs using retrieval and diffusion models to train a direct sign-to-sign conversational AI.

ATD-Trans: A Geographically Grounded Japanese-English Travelogue Translation Dataset

cs.CL · 2026-05-13 · conditional · novelty 6.0

ATD-Trans is a new geographically annotated Japanese-English travelogue dataset that reveals Japanese-enhanced models perform better on geo-entity translation while domestic Japanese locations remain harder to translate accurately.

Beyond Decodability: Reconstructing Language Model Representations with an Encoding Probe

cs.CL · 2026-05-01 · unverdicted · novelty 6.0

An encoding probe reconstructs transformer representations from acoustic, phonetic, syntactic, lexical and speaker features, showing independent syntactic/lexical contributions and training-dependent speaker effects.

MediaGraph: A Network Theoretic Framework to Analyze Reporting Preferences in Indian News Media

cs.SI · 2026-04-22 · unverdicted · novelty 6.0

MediaGraph uses co-occurrence networks from Indian news on farmer protests and a new link predictability metric to reveal source-specific reporting preferences and under-representation of farmer leaders.

A systematic framework for generating novel experimental hypotheses from language models

cs.CL · 2024-08-09 · unverdicted · novelty 6.0

A framework using language models to simulate non-existent experiments and derive novel testable hypotheses on dative verb acquisition and cross-structural generalization in children.

"Don't Say It!": Constraints, Compliance, and Communication when Language Models Play Taboo

cs.CL · 2026-07-01 · unverdicted · novelty 5.0

LLMs exhibit different trade-offs between rule compliance and communicative success across prompting, generation constraints, and representation interventions, but remain substantially weaker than humans at guessing under lexical constraints.

Revisiting Compositionality in Dual-Encoder Vision-Language Models: The Role of Inference

cs.CV · 2026-04-13 · unverdicted · novelty 5.0

Dual-encoder VLMs gain robust compositional generalization by learning localized alignments from frozen patch and token embeddings instead of using global similarity.

Contradictions in Context: Challenges for Retrieval-Augmented Generation in Healthcare

cs.IR · 2025-11-10 · unverdicted · novelty 5.0

Contradictions between highly similar medical abstracts degrade the factual accuracy and consistency of LLM responses in retrieval-augmented generation.

Reducing Redundancy in Retrieval-Augmented Generation through Chunk Filtering

cs.CL · 2026-04-27 · unverdicted · novelty 4.0

Entity-based chunk filtering reduces RAG vector index size by 25-36% with retrieval quality near baseline levels.

LLM-Redactor: An Empirical Evaluation of Eight Techniques for Privacy-Preserving LLM Requests

cs.CR · 2026-04-13 · unverdicted · novelty 4.0

No single privacy technique wins; combining local inference, redaction, and semantic rephrasing limits PII leaks to 0.6% and proprietary code leaks to 31.3% on a 1,300-sample benchmark, with code released.

Best Preprocessing Techniques for Sentiment Analysis

cs.CL · 2026-06-23 · unverdicted · novelty 3.0

Empirical comparison finds tokenization most important and recommends specific preprocessing order for Twitter sentiment analysis models.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Utility-Preserving De-Identification for Math Tutoring: Investigating Numeric Ambiguity in the MathEd-PII Benchmark Dataset cs.CL · 2026-02-18 · unreviewed · ref 20

author Montani, I

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer