Noah Shinn
- 2works
- 2Pith-reviewed
- 100.0%Recognition coverage
- 0queued
works
- $\tau$-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains Pith 2024 · cs.AI · verdict UNVERDICTED · 107 Pith citing
- Reflexion: Language Agents with Verbal Reinforcement Learning Pith 2023 · cs.AI · verdict CONDITIONAL · 142 Pith citing