ReAct: Synergizing reasoning and acting in language models,

· 2023

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

representative citing papers

Multi-Dimensional Behavioral Evaluation of Agentic Stock Prediction Systems Using Large Language Model Judges with Closed-Loop Reinforcement Learning Feedback

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

A multi-dimensional behavioral scoring system using LLM judges evaluates agentic stock predictors and feeds scores into closed-loop RL to improve one-day MAPE by 11.5% on held-out data.

Measuring the Unmeasurable: Markov Chain Reliability for LLM Agents

cs.SE · 2026-04-27 · unverdicted · novelty 7.0

TraceToChain models LLM agent traces as absorbing DTMCs using automatic clustering and smoothed MLE, with KS and AIC validation, to reconcile pass@k, pass^k, and RDC as projections of a single first-passage success-time distribution.

PFAgent: A Tractable and Self-Evolving Power-Flow Agent for Interactive Grid Analysis

eess.SY · 2026-04-12 · unverdicted · novelty 5.0

PFAgent automates interactive power-flow analysis by combining intent parsing, tool execution, verification-driven self-evolution, and an evaluation framework, with demonstrations on IEEE benchmark systems.

Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models

cs.AI · 2026-04-23 · unverdicted · novelty 4.0

Nemobot is an LLM-powered platform for creating and refining strategic game agents across dictionary, solvable, heuristic, and learning-based games, moving toward self-programming AI.

citing papers explorer

Showing 4 of 4 citing papers.

Multi-Dimensional Behavioral Evaluation of Agentic Stock Prediction Systems Using Large Language Model Judges with Closed-Loop Reinforcement Learning Feedback cs.LG · 2026-05-07 · unverdicted · none · ref 9
A multi-dimensional behavioral scoring system using LLM judges evaluates agentic stock predictors and feeds scores into closed-loop RL to improve one-day MAPE by 11.5% on held-out data.
Measuring the Unmeasurable: Markov Chain Reliability for LLM Agents cs.SE · 2026-04-27 · unverdicted · none · ref 4
TraceToChain models LLM agent traces as absorbing DTMCs using automatic clustering and smoothed MLE, with KS and AIC validation, to reconcile pass@k, pass^k, and RDC as projections of a single first-passage success-time distribution.
PFAgent: A Tractable and Self-Evolving Power-Flow Agent for Interactive Grid Analysis eess.SY · 2026-04-12 · unverdicted · none · ref 19
PFAgent automates interactive power-flow analysis by combining intent parsing, tool execution, verification-driven self-evolution, and an evaluation framework, with demonstrations on IEEE benchmark systems.
Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models cs.AI · 2026-04-23 · unverdicted · none · ref 15
Nemobot is an LLM-powered platform for creating and refining strategic game agents across dictionary, solvable, heuristic, and learning-based games, moving toward self-programming AI.

ReAct: Synergizing reasoning and acting in language models,

fields

years

verdicts

representative citing papers

citing papers explorer