hub

author Montani, I

Ines Montani, Matthew Honnibal, Adriane Boyd, Sofie Van Landeghem, Henning Peters · 2020 · Zenodo (CERN European Organization for Nuclear Research) · DOI 10.5281/zenodo.1212303

28 Pith papers cite this work, alongside 425 external citations. Polarity classification is still indexing.

28 Pith papers citing it

425 external citations · Crossref

open at publisher browse 28 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

method 2

citation-polarity summary

use method 2

representative citing papers

From Circuit Evidence to Mechanistic Theory: An Inductive Logic Approach

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

Introduces Causal Functional Signatures grounded in causal evidence and ILP-learned architectural signatures to enable explicit, comparable, and portable mechanistic claims across model scales.

Linked Multi-Model Data on Russian Domestic and Foreign Policy Speeches

cs.CL · 2026-05-15 · unverdicted · novelty 7.0

A new linked multimodal dataset of Russian domestic and foreign policy speeches with texts, images, captions, harmonized metadata, and expert-refined topic annotations is introduced to support analyses in political communication and LLM applications.

Matching Meaning at Scale: Evaluating Semantic Search for 18th-Century Intellectual History through the Case of Locke

cs.CL · 2026-05-10 · unverdicted · novelty 7.0 · 2 refs

Semantic search retrieves substantially more implicit receptions of Locke's work than lexical baselines in 18th-century corpora, yet remains constrained by lexical gatekeeping.

Mapping Emerging Climate Misinformation Playbooks in the Global South

cs.SI · 2026-04-27 · unverdicted · novelty 7.0

Brazilian YouTube climate videos show a transition from traditional denial of climate science to 'new denial' that undermines solutions, with the latter attracting more engagement from diverse actors.

Semantic Needles in Document Haystacks: Sensitivity Testing of LLM-as-a-Judge Similarity Scoring

cs.CL · 2026-04-20 · unverdicted · novelty 7.0

LLMs exhibit positional bias and context-dependent scoring patterns when judging document similarity, with each model showing a stable scoring fingerprint but a shared hierarchy of sensitivity to different semantic perturbations.

Saying More Than They Know: A Framework for Quantifying Epistemic-Rhetorical Miscalibration in Large Language Models

cs.CL · 2026-03-27 · unverdicted · novelty 7.0

LLMs display a consistent pattern of elevated form-meaning divergence and uniform rhetorical device use in argumentative texts compared to humans, quantified by new metrics FMD, GPR, and RDDE.

SAM 3: Segment Anything with Concepts

cs.CV · 2025-11-20 · unverdicted · novelty 7.0

SAM 3 introduces promptable concept segmentation that doubles accuracy of prior systems on images and videos while improving standard SAM segmentation performance.

TSVer: A Benchmark for Fact Verification Against Time-Series Evidence

cs.CL · 2025-11-02 · unverdicted · novelty 7.0

TSVer is a new benchmark dataset for fact verification against time-series evidence, with 304 annotated real-world claims, 400 time series, verdicts, and justifications, plus baseline results showing current models struggle.

Phonemes to the Rescue: Multilingual Tokenization Based on International Phonetic Alphabet

cs.CL · 2026-06-18 · unverdicted · novelty 6.0

IPA-based subword tokenizers trained across 24 languages improve tokenization quality and generalization to unseen languages compared to standard text tokenizers, especially for non-Latin scripts.

On The Effectiveness-Fluency Trade-Off In LLM Conditioning: A Systematic Study

cs.CL · 2026-06-10 · unverdicted · novelty 6.0

Systematic experiments reveal that activation steering trades fluency for concept control, is less effective on instruction-tuned models, and that prompting/SFT excel at injection but not removal, with textual metrics correlating to LLM judges.

Arabic Sentence Segmentation Across Genres and Punctuation Conditions

cs.CL · 2026-06-06 · unverdicted · novelty 6.0

AraSEG is a genre-diverse Arabic sentence segmentation corpus showing lightweight encoders and dependency parsers outperform LLMs under challenging punctuation while improving downstream parsing.

Isolating LLM Lexical Bias: A Curation-Free Triangulated Metric for Preference-Stage Learning

cs.CL · 2026-05-29 · unverdicted · novelty 6.0

Introduces a triangulation-based metric to quantify lexical shifts attributable to preference tuning without requiring manual curation of examples.

Probing Minimalist Phase Structure in LLMs: What Universal Dependencies Cannot Represent

cs.CL · 2026-05-26 · unverdicted · novelty 6.0

Structural probes on UD-invariant wh-movement stimuli reveal phase-count gradients and phase-internal cohesion effects in 12-13 of 13 LLMs, indicating syntactic abstractions beyond UD annotations.

Towards Continuous Sign Language Conversation from Isolated Signs

cs.CV · 2026-05-14 · unverdicted · novelty 6.0

Constructs continuous sign conversation data from isolated signs using retrieval and diffusion models to train a direct sign-to-sign conversational AI.

ATD-Trans: A Geographically Grounded Japanese-English Travelogue Translation Dataset

cs.CL · 2026-05-13 · conditional · novelty 6.0

ATD-Trans is a new geographically annotated Japanese-English travelogue dataset that reveals Japanese-enhanced models perform better on geo-entity translation while domestic Japanese locations remain harder to translate accurately.

Beyond Decodability: Reconstructing Language Model Representations with an Encoding Probe

cs.CL · 2026-05-01 · unverdicted · novelty 6.0

An encoding probe reconstructs transformer representations from acoustic, phonetic, syntactic, lexical and speaker features, showing independent syntactic/lexical contributions and training-dependent speaker effects.

MediaGraph: A Network Theoretic Framework to Analyze Reporting Preferences in Indian News Media

cs.SI · 2026-04-22 · unverdicted · novelty 6.0

MediaGraph uses co-occurrence networks from Indian news on farmer protests and a new link predictability metric to reveal source-specific reporting preferences and under-representation of farmer leaders.

A systematic framework for generating novel experimental hypotheses from language models

cs.CL · 2024-08-09 · unverdicted · novelty 6.0

A framework using language models to simulate non-existent experiments and derive novel testable hypotheses on dative verb acquisition and cross-structural generalization in children.

"Don't Say It!": Constraints, Compliance, and Communication when Language Models Play Taboo

cs.CL · 2026-07-01 · unverdicted · novelty 5.0

LLMs exhibit different trade-offs between rule compliance and communicative success across prompting, generation constraints, and representation interventions, but remain substantially weaker than humans at guessing under lexical constraints.

Revisiting Compositionality in Dual-Encoder Vision-Language Models: The Role of Inference

cs.CV · 2026-04-13 · unverdicted · novelty 5.0

Dual-encoder VLMs gain robust compositional generalization by learning localized alignments from frozen patch and token embeddings instead of using global similarity.

Contradictions in Context: Challenges for Retrieval-Augmented Generation in Healthcare

cs.IR · 2025-11-10 · unverdicted · novelty 5.0

Contradictions between highly similar medical abstracts degrade the factual accuracy and consistency of LLM responses in retrieval-augmented generation.

Reducing Redundancy in Retrieval-Augmented Generation through Chunk Filtering

cs.CL · 2026-04-27 · unverdicted · novelty 4.0

Entity-based chunk filtering reduces RAG vector index size by 25-36% with retrieval quality near baseline levels.

LLM-Redactor: An Empirical Evaluation of Eight Techniques for Privacy-Preserving LLM Requests

cs.CR · 2026-04-13 · unverdicted · novelty 4.0

No single privacy technique wins; combining local inference, redaction, and semantic rephrasing limits PII leaks to 0.6% and proprietary code leaks to 31.3% on a 1,300-sample benchmark, with code released.

Best Preprocessing Techniques for Sentiment Analysis

cs.CL · 2026-06-23 · unverdicted · novelty 3.0

Empirical comparison finds tokenization most important and recommends specific preprocessing order for Twitter sentiment analysis models.

citing papers explorer

Showing 26 of 26 citing papers after filters.

From Circuit Evidence to Mechanistic Theory: An Inductive Logic Approach cs.LG · 2026-05-20 · unverdicted · none · ref 64
Introduces Causal Functional Signatures grounded in causal evidence and ILP-learned architectural signatures to enable explicit, comparable, and portable mechanistic claims across model scales.
Linked Multi-Model Data on Russian Domestic and Foreign Policy Speeches cs.CL · 2026-05-15 · unverdicted · none · ref 51
A new linked multimodal dataset of Russian domestic and foreign policy speeches with texts, images, captions, harmonized metadata, and expert-refined topic annotations is introduced to support analyses in political communication and LLM applications.
Matching Meaning at Scale: Evaluating Semantic Search for 18th-Century Intellectual History through the Case of Locke cs.CL · 2026-05-10 · unverdicted · none · ref 21 · 2 links
Semantic search retrieves substantially more implicit receptions of Locke's work than lexical baselines in 18th-century corpora, yet remains constrained by lexical gatekeeping.
Mapping Emerging Climate Misinformation Playbooks in the Global South cs.SI · 2026-04-27 · unverdicted · none · ref 27
Brazilian YouTube climate videos show a transition from traditional denial of climate science to 'new denial' that undermines solutions, with the latter attracting more engagement from diverse actors.
Semantic Needles in Document Haystacks: Sensitivity Testing of LLM-as-a-Judge Similarity Scoring cs.CL · 2026-04-20 · unverdicted · none · ref 49
LLMs exhibit positional bias and context-dependent scoring patterns when judging document similarity, with each model showing a stable scoring fingerprint but a shared hierarchy of sensitivity to different semantic perturbations.
Saying More Than They Know: A Framework for Quantifying Epistemic-Rhetorical Miscalibration in Large Language Models cs.CL · 2026-03-27 · unverdicted · none · ref 20
LLMs display a consistent pattern of elevated form-meaning divergence and uniform rhetorical device use in argumentative texts compared to humans, quantified by new metrics FMD, GPR, and RDDE.
SAM 3: Segment Anything with Concepts cs.CV · 2025-11-20 · unverdicted · none · ref 44
SAM 3 introduces promptable concept segmentation that doubles accuracy of prior systems on images and videos while improving standard SAM segmentation performance.
TSVer: A Benchmark for Fact Verification Against Time-Series Evidence cs.CL · 2025-11-02 · unverdicted · none · ref 31
TSVer is a new benchmark dataset for fact verification against time-series evidence, with 304 annotated real-world claims, 400 time series, verdicts, and justifications, plus baseline results showing current models struggle.
Phonemes to the Rescue: Multilingual Tokenization Based on International Phonetic Alphabet cs.CL · 2026-06-18 · unverdicted · none · ref 22
IPA-based subword tokenizers trained across 24 languages improve tokenization quality and generalization to unseen languages compared to standard text tokenizers, especially for non-Latin scripts.
On The Effectiveness-Fluency Trade-Off In LLM Conditioning: A Systematic Study cs.CL · 2026-06-10 · unverdicted · none · ref 201
Systematic experiments reveal that activation steering trades fluency for concept control, is less effective on instruction-tuned models, and that prompting/SFT excel at injection but not removal, with textual metrics correlating to LLM judges.
Arabic Sentence Segmentation Across Genres and Punctuation Conditions cs.CL · 2026-06-06 · unverdicted · none · ref 28
AraSEG is a genre-diverse Arabic sentence segmentation corpus showing lightweight encoders and dependency parsers outperform LLMs under challenging punctuation while improving downstream parsing.
Isolating LLM Lexical Bias: A Curation-Free Triangulated Metric for Preference-Stage Learning cs.CL · 2026-05-29 · unverdicted · none · ref 64
Introduces a triangulation-based metric to quantify lexical shifts attributable to preference tuning without requiring manual curation of examples.
Probing Minimalist Phase Structure in LLMs: What Universal Dependencies Cannot Represent cs.CL · 2026-05-26 · unverdicted · none · ref 13
Structural probes on UD-invariant wh-movement stimuli reveal phase-count gradients and phase-internal cohesion effects in 12-13 of 13 LLMs, indicating syntactic abstractions beyond UD annotations.
Towards Continuous Sign Language Conversation from Isolated Signs cs.CV · 2026-05-14 · unverdicted · none · ref 28
Constructs continuous sign conversation data from isolated signs using retrieval and diffusion models to train a direct sign-to-sign conversational AI.
Beyond Decodability: Reconstructing Language Model Representations with an Encoding Probe cs.CL · 2026-05-01 · unverdicted · none · ref 23
An encoding probe reconstructs transformer representations from acoustic, phonetic, syntactic, lexical and speaker features, showing independent syntactic/lexical contributions and training-dependent speaker effects.
MediaGraph: A Network Theoretic Framework to Analyze Reporting Preferences in Indian News Media cs.SI · 2026-04-22 · unverdicted · none · ref 20
MediaGraph uses co-occurrence networks from Indian news on farmer protests and a new link predictability metric to reveal source-specific reporting preferences and under-representation of farmer leaders.
A systematic framework for generating novel experimental hypotheses from language models cs.CL · 2024-08-09 · unverdicted · none · ref 56
A framework using language models to simulate non-existent experiments and derive novel testable hypotheses on dative verb acquisition and cross-structural generalization in children.
"Don't Say It!": Constraints, Compliance, and Communication when Language Models Play Taboo cs.CL · 2026-07-01 · unverdicted · none · ref 23
LLMs exhibit different trade-offs between rule compliance and communicative success across prompting, generation constraints, and representation interventions, but remain substantially weaker than humans at guessing under lexical constraints.
Revisiting Compositionality in Dual-Encoder Vision-Language Models: The Role of Inference cs.CV · 2026-04-13 · unverdicted · none · ref 14
Dual-encoder VLMs gain robust compositional generalization by learning localized alignments from frozen patch and token embeddings instead of using global similarity.
Contradictions in Context: Challenges for Retrieval-Augmented Generation in Healthcare cs.IR · 2025-11-10 · unverdicted · none · ref 11
Contradictions between highly similar medical abstracts degrade the factual accuracy and consistency of LLM responses in retrieval-augmented generation.
Reducing Redundancy in Retrieval-Augmented Generation through Chunk Filtering cs.CL · 2026-04-27 · unverdicted · none · ref 18
Entity-based chunk filtering reduces RAG vector index size by 25-36% with retrieval quality near baseline levels.
LLM-Redactor: An Empirical Evaluation of Eight Techniques for Privacy-Preserving LLM Requests cs.CR · 2026-04-13 · unverdicted · none · ref 11
No single privacy technique wins; combining local inference, redaction, and semantic rephrasing limits PII leaks to 0.6% and proprietary code leaks to 31.3% on a 1,300-sample benchmark, with code released.
Best Preprocessing Techniques for Sentiment Analysis cs.CL · 2026-06-23 · unverdicted · none · ref 6
Empirical comparison finds tokenization most important and recommends specific preprocessing order for Twitter sentiment analysis models.
Archi: Agentic Operations at the CMS Experiment hep-ex · 2026-06-03 · unverdicted · none · ref 4
Archi deploys configurable agents on ingested documentation, historical data, and live monitoring to support CMS computing operators at CERN, with positive results on real queries and competitive performance from local open-weight models.
Geolocating News about Extreme Climate Events: A Comparative Analysis of Off-the-Shelf Tools for Toponym Identification in German cs.CL · 2026-05-05 · unverdicted · none · ref 31
Off-the-shelf German NER tools produce divergent toponym sets that lead to distinct country assignments for climate event news, affecting assessments of national prominence in media coverage.
UOL@IDEM at BEA 2026 Shared Task 1: Neural Fusion and Feature-Rich Modeling for L1-Aware Vocabulary Difficulty Prediction cs.CL · 2026-06-23 · unverdicted · none · ref 27
A feature-rich regression model using multilingual embeddings and features for frequency, cognate similarity, and predictability reports RMSE scores of 1.132, 1.037, and 0.891 for L1-aware vocabulary difficulty prediction on Spanish, German, and Chinese.

author Montani, I

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer