pith. sign in

super hub Canonical reference

O'Brien and Carrie Jun Cai and Meredith Ringel Morris and Percy Liang and Michael S

Canonical reference. 94% of citing Pith papers cite this work as background.

119 Pith papers citing it
Background 94% of classified citations

hub tools

citation-role summary

background 16

citation-polarity summary

claims ledger

  • background MAS decomposes high-level objectives into coordinated subtasks executed by specialized agents under the orchestration of an LLM. This paradigm has been extensively explored in recent literature [16, 42, 59]. To facilitate the development of these complex systems, a growing array of open-source frame- works, such as Microsoft AutoGen [34], CrewAI [19], Camel [2], and Praison [40], has emerged. These frameworks offer high-level modular abstractions, enabling developers to easily integrate custom t
  • background marking: we provide controlled experiments (5-12 agents, varying policies) and extended runs (2 h, 12 h) to study negotiation under sustained load. Adaptation and Feedback.Another complementary line of work focuses on improving LLM agents via instruction tuning and feedback alignment. Instruction-tuned models such as Flan- T5 [4], Alpaca [27], and GPT-4-LLM [23] democratized adaptation. RLHF and its extensions (InstructGPT[ 21],Constitutional AI[ 1], DPO[ 24], and related datasets likeSelf-Instr
  • background to mitigate SHADOWMERGE. We have responsibly disclosed our findings to affected graph-memory vendors and open sourced SHADOWMERGE at https://anonymous.4open.science/status/S hadowMerge -033C. I. INTRODUCTION LLM agents are moving from single-turn chatbots [1], [2] toward long-running systems that remember, adapt, and act across repeated interactions [3], [4], [5]. Persistent mem- ory [6], [7], [8], [9], [10] enables this shift by allowing agents to reuse past tool outcomes, maintain user prefere
  • background ing on the human behavior and language in their training data [2], and can be engaged conversationally rather than read as static arti- facts [8, 22]. Building on this, generative agents extend LLM per- sonas with memory and reflection, and can serve as believable prox- ies of individuals and communities [18]. Follow-up work shows that grounding agents in interview and survey data improves their accuracy [19]. Design and UX researchers are also actively ex- ploring AI personas in design workflow
  • background Stylette [ 22] maps styling goals to CSS edits, DynaVis [43] creates manipulable widgets for visualization editing, and DirectGPT [ 29] supports in-place modification of selected objects. These systems show that natural language can support in-situ GUI changes, but each interaction is largely self-contained. Recent works such as IRF [34] and CARE [33] explored sustained interaction by updating interface content as users refine preferences over time. Still, these systems largely position the agen
  • background 3 Provenance-Based Credit Assignment In classical TD(λ) (Sutton & Barto, 2018; Sutton, 1988), theλ-return Gλ t = (1−λ)∑∞ n=1λn−1G(n) t interpolates between the one-step bootstrapG(1) t =rt +γQ(st+1,at+1)and the Monte Carlo return G(∞) t =∑∞ k=0γkrt+k. Theλ-return advantage is standardly expressed as a discounted sum of future TD errors: Gλ t−Q(st,at) = T−t−1∑ k=0 (γλ)kδt+k,(4) where δt = rt +γQ(st+1,at+1)−Q(st,at). This telescoping decomposition underpins eligibility traces, propagating credit t

authors

co-cited works

clear filters

representative citing papers

LegalWorld: A Life-Cycle Interactive Environment for Legal Agents

cs.CL · 2026-06-17 · unverdicted · novelty 7.0

LegalWorld is a life-cycle interactive environment modeling Chinese civil litigation as five causally connected stages grounded in 75,309 judgments, paired with LongJud-Bench for cross-stage agent evaluation.

ZIPP:Zero-shot Image Personalization from Personas

cs.AI · 2026-06-07 · unverdicted · novelty 7.0

ZIPP conditions diffusion models on LLM-rewritten prompts derived from graph-mined natural-language personas to achieve zero-shot personalization, reporting 13-20% gains and 79% human preference win rate over generic outputs.

A Case for Agentic Tuning: From Documentation to Action in PostgreSQL

cs.SE · 2026-05-19 · unverdicted · novelty 7.0

PerfEvolve equips LLM agents with executable skills from expert methods to enable dynamic, version-consistent, workload-specific tuning in PostgreSQL, outperforming documentation baselines by up to 35.2% on TPC-C and TPC-H.

From Role to Person: Trust Calibration Challenges in Twin Agents

cs.HC · 2026-05-19 · unverdicted · novelty 7.0

Twin agents as personal digital representations create distinct trust calibration challenges because they dissolve the boundary between AI and human decision-makers, unlike existing frameworks designed for clear separation.

Attributing Emergence in Million-Agent Systems

cs.AI · 2026-05-12 · unverdicted · novelty 7.0

A scalable Aumann-Shapley attribution method for million-agent systems reveals that small-scale samples structurally misattribute emergence under nonlinear macro indicators, as shown by the Attribution Scaling Bias theorem.

Causal state binding predicts action control in language agents

cs.AI · 2026-05-10 · unverdicted · novelty 7.0 · 3 refs

Causal state binding is introduced as a framework that predicts action control in language agents, validated across large benchmarks and SWE-bench Lite where adding the measure raised issue-to-file hit@3 AUC from 0.873 to 0.935.

MEMOREPAIR: Barrier-First Cascade Repair in Agentic Memory

cs.AI · 2026-05-08 · unverdicted · novelty 7.0

MemoRepair formalizes the cascade update problem in agentic memory and solves it via a min-cut reduction that eliminates invalidated memory exposure to 0% while recovering 91-94% of valid successors at 57-76% of baseline repair cost.

citing papers explorer

Showing 4 of 4 citing papers after filters.