arXiv preprint arXiv:2506.02532 , year=

Reasoningflow: Semantic structure of complex reasoning traces , author= · 2025 · arXiv 2506.02532

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

ThinkProbe: Beyond Accuracy -- Structural Profiling of Open-Ended LLM Reasoning Traces via Non-Generative Thought Graphs

cs.CL · 2026-06-27 · unverdicted · novelty 7.0

ThinkProbe builds non-generative Thought Graphs from 4200 LLM traces across 7 models and 200 questions to extract 5D cognitive profiles, finding model-level stability in reasoning structure that exceeds domain effects in four dimensions.

Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs

cs.CL · 2026-05-26 · unverdicted · novelty 7.0

DATG framework diagnoses that non-English reasoning in Qwen3 models shows reduced mathematical anchor coverage and dependency fidelity, with Loop-Retry and Formula-Retry improving target-language accuracy.

ReasonOps: Operator Segmentation for LLM Reasoning Traces

cs.AI · 2026-05-28 · unverdicted · novelty 6.0

Unsupervised clustering on sentence-initial 3-token pivots extracts 7 universal reasoning operators from 44k traces across 12 LLMs that enable model fingerprinting and answer-correctness prediction.

citing papers explorer

Showing 3 of 3 citing papers after filters.

ThinkProbe: Beyond Accuracy -- Structural Profiling of Open-Ended LLM Reasoning Traces via Non-Generative Thought Graphs cs.CL · 2026-06-27 · unverdicted · none · ref 3
ThinkProbe builds non-generative Thought Graphs from 4200 LLM traces across 7 models and 200 questions to extract 5D cognitive profiles, finding model-level stability in reasoning structure that exceeds domain effects in four dimensions.
Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs cs.CL · 2026-05-26 · unverdicted · none · ref 4
DATG framework diagnoses that non-English reasoning in Qwen3 models shows reduced mathematical anchor coverage and dependency fidelity, with Loop-Retry and Formula-Retry improving target-language accuracy.
ReasonOps: Operator Segmentation for LLM Reasoning Traces cs.AI · 2026-05-28 · unverdicted · none · ref 14
Unsupervised clustering on sentence-initial 3-token pivots extracts 7 universal reasoning operators from 44k traces across 12 LLMs that enable model fingerprinting and answer-correctness prediction.

arXiv preprint arXiv:2506.02532 , year=

fields

years

verdicts

representative citing papers

citing papers explorer