arXiv preprint arXiv:2402.13950 , year=

Paul, D · 2024 · arXiv 2402.13950

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

The Reasoning Trap: An Information-Theoretic Bound on Closed-System Multi-Step LLM Reasoning

cs.CL · 2026-05-03 · unverdicted · novelty 6.0

Closed-system multi-step LLM reasoning is subject to an information-theoretic bound where mutual information with evidence decreases, preserving accuracy while eroding faithfulness, with EGSR recovering it on SciFact and FEVER.

ReSS: Learning Reasoning Models for Tabular Data Prediction via Symbolic Scaffold

cs.AI · 2026-04-15 · unverdicted · novelty 6.0 · 2 refs

ReSS extracts decision paths from trees as scaffolds to guide LLM reasoning generation, fine-tunes the LLM on the resulting dataset with scaffold-invariant augmentation, and reports up to 10% gains on medical and financial tabular benchmarks with new faithfulness metrics.

Mechanism Plausibility in Generative Agent-Based Modeling

cs.MA · 2026-05-12 · unverdicted · novelty 5.0 · 2 refs

Introduces the Mechanism Plausibility Scale, a four-level framework separating generative sufficiency from mechanistic plausibility in LLM-based agent-based models.

Training Language Models to Use Prolog as a Tool

cs.CL · 2025-12-08 · unverdicted · novelty 4.0

Fine-tuning Qwen2.5-3B with GRPO on GSM8K to use Prolog yields competitive zero-shot MMLU performance but exposes an accuracy-auditability trade-off interpreted as reward hacking.

citing papers explorer

Showing 4 of 4 citing papers.

The Reasoning Trap: An Information-Theoretic Bound on Closed-System Multi-Step LLM Reasoning cs.CL · 2026-05-03 · unverdicted · none · ref 23
Closed-system multi-step LLM reasoning is subject to an information-theoretic bound where mutual information with evidence decreases, preserving accuracy while eroding faithfulness, with EGSR recovering it on SciFact and FEVER.
ReSS: Learning Reasoning Models for Tabular Data Prediction via Symbolic Scaffold cs.AI · 2026-04-15 · unverdicted · none · ref 26 · 2 links
ReSS extracts decision paths from trees as scaffolds to guide LLM reasoning generation, fine-tunes the LLM on the resulting dataset with scaffold-invariant augmentation, and reports up to 10% gains on medical and financial tabular benchmarks with new faithfulness metrics.
Mechanism Plausibility in Generative Agent-Based Modeling cs.MA · 2026-05-12 · unverdicted · none · ref 65 · 2 links
Introduces the Mechanism Plausibility Scale, a four-level framework separating generative sufficiency from mechanistic plausibility in LLM-based agent-based models.
Training Language Models to Use Prolog as a Tool cs.CL · 2025-12-08 · unverdicted · none · ref 7
Fine-tuning Qwen2.5-3B with GRPO on GSM8K to use Prolog yields competitive zero-shot MMLU performance but exposes an accuracy-auditability trade-off interpreted as reward hacking.

arXiv preprint arXiv:2402.13950 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer