Ensure you don’t mix them up with other numbers (e.g

Focus on the numerical labels in the TOP LEFT corner of each rectangle (element)

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

cs.AI · 2026-04-30 · unverdicted · novelty 7.0

InteractWeb-Bench shows that frontier multimodal AI agents remain trapped in blind execution when generating websites from perturbed, low-quality non-expert instructions.

WebSP-Eval: Evaluating Web Agents on Website Security and Privacy Tasks

cs.CR · 2026-04-07 · unverdicted · novelty 7.0

WebSP-Eval shows that multimodal LLM-based web agents fail more than 45% of the time on security and privacy tasks involving stateful UI elements such as toggles and checkboxes.

Learning to Learn from Multimodal Experience

cs.AI · 2026-05-16 · unverdicted · novelty 5.0

Agents learn to dynamically construct and organize memory from multimodal experiences, improving performance over static designs in task-dependent settings.

citing papers explorer

Showing 3 of 3 citing papers.

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation? cs.AI · 2026-04-30 · unverdicted · none · ref 17
InteractWeb-Bench shows that frontier multimodal AI agents remain trapped in blind execution when generating websites from perturbed, low-quality non-expert instructions.
WebSP-Eval: Evaluating Web Agents on Website Security and Privacy Tasks cs.CR · 2026-04-07 · unverdicted · none · ref 65
WebSP-Eval shows that multimodal LLM-based web agents fail more than 45% of the time on security and privacy tasks involving stateful UI elements such as toggles and checkboxes.
Learning to Learn from Multimodal Experience cs.AI · 2026-05-16 · unverdicted · none · ref 64
Agents learn to dynamically construct and organize memory from multimodal experiences, improving performance over static designs in task-dependent settings.

Ensure you don’t mix them up with other numbers (e.g

fields

years

verdicts

representative citing papers

citing papers explorer