HERA evolves query-specific agent topologies via reward-guided sampling and refines role-specific prompts via credit assignment, yielding 38.69% average gains on six knowledge-intensive benchmarks.
Title resolution pending
5 Pith papers cite this work. Polarity classification is still indexing.
5
Pith papers citing it
citation-role summary
other 1
citation-polarity summary
years
2026 5roles
other 1polarities
unclear 1representative citing papers
WorldCup is a new multi-bit LLM watermarking framework that models token sampling as a communication channel and uses hierarchical competition with entropy-aware modulation for robust message embedding and recovery.
SALLIE detects jailbreaks in text and vision-language models by extracting residual stream activations, scoring maliciousness per layer with k-NN, and ensembling predictions, outperforming baselines on multiple datasets.