Introduces KAPRO framework and KAware dataset to benchmark LLM agents' self-awareness in distinguishing internal knowledge from external tool needs.
InInternational Conference on Learning Representa- tions (ICLR)
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.AI 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
A survey that maps risks along the agent workflow and consolidates metrics and benchmarks for safety, robustness, privacy, and security in agentic AI.
citing papers explorer
-
From Knowing to Acting: Benchmarking Self-Awareness Capability of LLM Agents
Introduces KAPRO framework and KAware dataset to benchmark LLM agents' self-awareness in distinguishing internal knowledge from external tool needs.
-
Towards trustworthy agentic AI: a comprehensive survey of safety, robustness, privacy, and system security
A survey that maps risks along the agent workflow and consolidates metrics and benchmarks for safety, robustness, privacy, and security in agentic AI.