arXiv:2601.22881 [cs.SE] https://arxiv.org/abs/2601.22881

· 2026 · arXiv 2601.22881

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

CUJBench: Benchmarking LLM-Agent on Cross-Modal Failure Diagnosis from Browser to Backend

cs.SE · 2026-04-25 · unverdicted · novelty 8.0

CUJBench is the first benchmark for cross-modal LLM-agent failure diagnosis, reporting 19.7% accuracy and identifying evidence attribution as the core bottleneck across six models.

Multi-Agent Systems for Root Cause Analysis in Microservices

cs.SE · 2026-05-05 · unverdicted · novelty 6.0

LATS-RCA applies multi-agent Language Agent Tree Search to automate root cause analysis in microservices, reporting high accuracy on a small open-source Java system but lower accuracy in a complex production environment.

Security Considerations for Multi-agent Systems

cs.CR · 2026-03-09 · unverdicted · novelty 6.0

No existing AI security framework covers a majority of the 193 identified multi-agent system threats in any category, with OWASP Agentic Security Initiative achieving the highest overall coverage at 65.3%.

citing papers explorer

Showing 3 of 3 citing papers.

CUJBench: Benchmarking LLM-Agent on Cross-Modal Failure Diagnosis from Browser to Backend cs.SE · 2026-04-25 · unverdicted · none · ref 13
CUJBench is the first benchmark for cross-modal LLM-agent failure diagnosis, reporting 19.7% accuracy and identifying evidence attribution as the core bottleneck across six models.
Multi-Agent Systems for Root Cause Analysis in Microservices cs.SE · 2026-05-05 · unverdicted · none · ref 9
LATS-RCA applies multi-agent Language Agent Tree Search to automate root cause analysis in microservices, reporting high accuracy on a small open-source Java system but lower accuracy in a complex production environment.
Security Considerations for Multi-agent Systems cs.CR · 2026-03-09 · unverdicted · none · ref 275
No existing AI security framework covers a majority of the 193 identified multi-agent system threats in any category, with OWASP Agentic Security Initiative achieving the highest overall coverage at 65.3%.

arXiv:2601.22881 [cs.SE] https://arxiv.org/abs/2601.22881

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer