Tree of uncertain thoughts reasoning for large language models

Shentong Mo, Miao Xin · 2023 · arXiv 2309.07694

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning

cs.LG · 2026-04-15 · unverdicted · novelty 6.0

CoUR uses LLMs for efficient RL reward design through uncertainty quantification and similarity selection, achieving better performance and lower evaluation costs on IsaacGym and Bidexterous Manipulation benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning cs.LG · 2026-04-15 · unverdicted · none · ref 12
CoUR uses LLMs for efficient RL reward design through uncertainty quantification and similarity selection, achieving better performance and lower evaluation costs on IsaacGym and Bidexterous Manipulation benchmarks.

Tree of uncertain thoughts reasoning for large language models

fields

years

verdicts

representative citing papers

citing papers explorer