pith. sign in

WAInjectBench: Benchmarking prompt injection detections for web agents

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

citation-role summary

background 2 dataset 1

citation-polarity summary

fields

cs.CR 8 cs.CL 1

years

2026 9

verdicts

UNVERDICTED 9

clear filters

representative citing papers

Formal Security Analysis of Agent Protocol Composition

cs.CR · 2026-06-27 · unverdicted · novelty 7.0

AgentThread analyzes five agent protocols with formal TLA+ invariants and SDK tests, reporting 35 specification findings, 80 implementation tests, 30 composition-only failures, and a cross-protocol responsibility gap in security enforcement.

Same-Origin Policy for Agentic Browsers

cs.CR · 2026-06-12 · unverdicted · novelty 7.0

The paper builds SOPBench showing frequent SOP violations in agentic browsers and introduces SOPGuard to enforce the policy with low overhead in BrowserOS.

Web Agents Should Adopt the Plan-Then-Execute Paradigm

cs.CR · 2026-05-14 · unverdicted · novelty 6.0

Web agents should default to planning a complete task program before observing live web content to reduce prompt injection exposure, since WebArena tasks are compatible and 80% need no runtime LLM calls.

citing papers explorer

Showing 9 of 9 citing papers.