LLMs exhibit a Weakest Link Effect in multi-hop QA where performance collapses to the least visible evidence position; MFAI resolves recognition bottlenecks with up to 11.49% gains in low-visibility spots.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
Passages made from high-convergence sentences improve LLM performance on inferential questions compared to cosine similarity selection.
UserGPT introduces a generative LLM framework with a behavior simulation engine, semantization module, and DF-GRPO post-training that scores 0.7325 on tag prediction and 0.7528 on summary generation on HPR-Bench while compressing records by up to 97.9%.
citing papers explorer
-
Failure Modes in Multi-Hop QA: The Weakest Link Effect and the Recognition Bottleneck
LLMs exhibit a Weakest Link Effect in multi-hop QA where performance collapses to the least visible evidence position; MFAI resolves recognition bottlenecks with up to 11.49% gains in low-visibility spots.
-
Context Convergence Improves Answering Inferential Questions
Passages made from high-convergence sentences improve LLM performance on inferential questions compared to cosine similarity selection.
-
UserGPT Technical Report
UserGPT introduces a generative LLM framework with a behavior simulation engine, semantization module, and DF-GRPO post-training that scores 0.7325 on tag prediction and 0.7528 on summary generation on HPR-Bench while compressing records by up to 97.9%.