FORTIS benchmark shows over-privilege is the norm in LLM agent skill selection and execution, with models reaching for higher-privilege skills and tools than required across ten frontier models and three domains.
Adaptive collaboration with humans: Metacognitive policy optimization for multi-agent LLMs with continual learning
3 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.AI 3years
2026 3verdicts
UNVERDICTED 3representative citing papers
UFCOD extracts Path Energy and Dynamics Energy from diffusion trajectories to perform few-shot OOD detection across unrelated domains with one fixed model.
SBD is a bilevel optimization framework that learns context-dependent safety weights for runtime task delegation in hierarchical multi-agent systems, with continuous authority transfer alpha and theoretical guarantees on safety monotonicity, policy convergence, and accountability propagation.
citing papers explorer
-
FORTIS: Benchmarking Over-Privilege in Agent Skills
FORTIS benchmark shows over-privilege is the norm in LLM agent skill selection and execution, with models reaching for higher-privilege skills and tools than required across ten frontier models and three domains.
-
Geometry over Density: Few-Shot Cross-Domain OOD Detection
UFCOD extracts Path Energy and Dynamics Energy from diffusion trajectories to perform few-shot OOD detection across unrelated domains with one fixed model.
-
Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems
SBD is a bilevel optimization framework that learns context-dependent safety weights for runtime task delegation in hierarchical multi-agent systems, with continuous authority transfer alpha and theoretical guarantees on safety monotonicity, policy convergence, and accountability propagation.