ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Re- flection

Jeonghye Kim, Sojeong Rhee, Minbeom Kim, Dohyung Kim, Sangmook Lee, Youngchul Sung, Kyomin Jung · 2025 · DOI 10.18653/v1/2025.emnlp-main.1697

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

cs.LG · 2026-06-03 · unverdicted · novelty 6.0

Agentic Monte Carlo enables RL-style optimization of black-box LLM agents by sampling from the optimal policy posterior using Sequential Monte Carlo.

Unified Context Evolution for LLM Agents

cs.CL · 2026-06-01 · unverdicted · novelty 6.0

UCE builds a typed, evolving library of Memory, Strategy, Workflow and Skill units from agent trajectories, improving ALFWorld success from 75.4% to 96.3% and WebShop score from 45.1% to 61.3% while transferring to new actor models.

SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes

cs.AI · 2026-06-01 · unverdicted · novelty 6.0

SMH-Bench supplies 1,100 stratified tasks in a verifiable smart-home simulator to measure LLM performance on explicit control, scheduling, ambiguity, and personalization as environment complexity grows.

citing papers explorer

Showing 3 of 3 citing papers.

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents cs.LG · 2026-06-03 · unverdicted · none · ref 35
Agentic Monte Carlo enables RL-style optimization of black-box LLM agents by sampling from the optimal policy posterior using Sequential Monte Carlo.
Unified Context Evolution for LLM Agents cs.CL · 2026-06-01 · unverdicted · none · ref 24
UCE builds a typed, evolving library of Memory, Strategy, Workflow and Skill units from agent trajectories, improving ALFWorld success from 75.4% to 96.3% and WebShop score from 45.1% to 61.3% while transferring to new actor models.
SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes cs.AI · 2026-06-01 · unverdicted · none · ref 33
SMH-Bench supplies 1,100 stratified tasks in a verifiable smart-home simulator to measure LLM performance on explicit control, scheduling, ambiguity, and personalization as environment complexity grows.

ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Re- flection

fields

years

verdicts

representative citing papers

citing papers explorer