arXiv preprint arXiv:2502.18474 (2025)

Jiayimei Wang, Tao Ni, Wei-Bin Lee, Qingchuan Zhao · 2025 · arXiv 2502.18474

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Satisfiability Solving with LLMs: A Matched-Pair Evaluation of Reasoning Capability

cs.AI · 2026-05-27 · unverdicted · novelty 7.0

A matched-pair protocol and Accurate Differentiation Rate metric reveal that conventional LLM accuracy on SAT problems is often inflated by over-predicting satisfiability, while cross-representation agreement exceeds 80 percent for most models.

Learning When to Remember: Risk-Sensitive Contextual Bandits for Abstention-Aware Memory Retrieval in LLM-Based Coding Agents

cs.CL · 2026-04-30 · unverdicted · novelty 6.0

RSCB-MC is a risk-sensitive contextual bandit memory controller for LLM coding agents that chooses safe actions including abstention, achieving 60.5% proxy success with 0% false positives and low latency in 200-case validation.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Satisfiability Solving with LLMs: A Matched-Pair Evaluation of Reasoning Capability cs.AI · 2026-05-27 · unverdicted · none · ref 45
A matched-pair protocol and Accurate Differentiation Rate metric reveal that conventional LLM accuracy on SAT problems is often inflated by over-predicting satisfiability, while cross-representation agreement exceeds 80 percent for most models.
Learning When to Remember: Risk-Sensitive Contextual Bandits for Abstention-Aware Memory Retrieval in LLM-Based Coding Agents cs.CL · 2026-04-30 · unverdicted · none · ref 1
RSCB-MC is a risk-sensitive contextual bandit memory controller for LLM coding agents that chooses safe actions including abstention, achieving 60.5% proxy success with 0% false positives and low latency in 200-case validation.

arXiv preprint arXiv:2502.18474 (2025)

fields

years

verdicts

representative citing papers

citing papers explorer