Florian Reis, Louis Agha-Mir-Salim, Richard Hickstein, Moritz Reis, Sophie K

Poole-Dayan, E · 2026 · arXiv 2406.17737

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

MIRA: A Bilingual Benchmark for Medical Information Response Audit

cs.AI · 2026-05-27 · unverdicted · novelty 7.0

MIRA benchmark shows LLMs exhibit Differential Information Dilution, omitting more key medical details for low health-literacy prompts, with model-specific language effects and partial mitigation via guided prompts.

Defeat Devices in AI Systems

cs.CY · 2026-06-27 · unverdicted · novelty 6.0

The paper defines defeat devices in AI via a triadic test (discriminator, concealed swap, performance gap), unifies existing cases under this concept, proposes TADP detection, and claims such devices can emerge naturally in frontier models.

citing papers explorer

Showing 2 of 2 citing papers.

MIRA: A Bilingual Benchmark for Medical Information Response Audit cs.AI · 2026-05-27 · unverdicted · none · ref 2
MIRA benchmark shows LLMs exhibit Differential Information Dilution, omitting more key medical details for low health-literacy prompts, with model-specific language effects and partial mitigation via guided prompts.
Defeat Devices in AI Systems cs.CY · 2026-06-27 · unverdicted · none · ref 47
The paper defines defeat devices in AI via a triadic test (discriminator, concealed swap, performance gap), unifies existing cases under this concept, proposes TADP detection, and claims such devices can emerge naturally in frontier models.

Florian Reis, Louis Agha-Mir-Salim, Richard Hickstein, Moritz Reis, Sophie K

fields

years

verdicts

representative citing papers

citing papers explorer