THOUGHT-LIKE-PRO: Enhancing reasoning of large language models through self-driven prolog-based chain-of-thought.arXiv:2407.14562, 2024

Xiaoyu Tan, Yongxin Deng, Xihe Qiu, Weidi Xu, Chao Qu, Wei Chu, Yinghui Xu, Yuan Qi · 2024 · arXiv 2407.14562

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Training Language Models to Use Prolog as a Tool

cs.CL · 2025-12-08 · unverdicted · novelty 4.0

Fine-tuning Qwen2.5-3B with GRPO on GSM8K to use Prolog yields competitive zero-shot MMLU performance but exposes an accuracy-auditability trade-off interpreted as reward hacking.

citing papers explorer

Showing 1 of 1 citing paper.

Training Language Models to Use Prolog as a Tool cs.CL · 2025-12-08 · unverdicted · none · ref 9
Fine-tuning Qwen2.5-3B with GRPO on GSM8K to use Prolog yields competitive zero-shot MMLU performance but exposes an accuracy-auditability trade-off interpreted as reward hacking.

THOUGHT-LIKE-PRO: Enhancing reasoning of large language models through self-driven prolog-based chain-of-thought.arXiv:2407.14562, 2024

fields

years

verdicts

representative citing papers

citing papers explorer