Title resolution pending

Delete existing content in a textbox, then type content

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

cs.AI · 2026-04-30 · unverdicted · novelty 7.0

InteractWeb-Bench shows that frontier multimodal AI agents remain trapped in blind execution when generating websites from perturbed, low-quality non-expert instructions.

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

cs.CL · 2024-01-25 · unverdicted · novelty 6.0

WebVoyager uses a large multimodal model to complete real-world web tasks end-to-end and reaches 59.1 percent success on a new benchmark of 15 live sites, with an automatic GPT-4V evaluator that matches human judgments 85 percent of the time.

Learning to Learn from Multimodal Experience

cs.AI · 2026-05-16 · unverdicted · novelty 5.0

Agents learn to dynamically construct and organize memory from multimodal experiences, improving performance over static designs in task-dependent settings.

citing papers explorer

Showing 3 of 3 citing papers.

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation? cs.AI · 2026-04-30 · unverdicted · none · ref 5
InteractWeb-Bench shows that frontier multimodal AI agents remain trapped in blind execution when generating websites from perturbed, low-quality non-expert instructions.
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models cs.CL · 2024-01-25 · unverdicted · none · ref 6
WebVoyager uses a large multimodal model to complete real-world web tasks end-to-end and reaches 59.1 percent success on a new benchmark of 15 live sites, with an automatic GPT-4V evaluator that matches human judgments 85 percent of the time.
Learning to Learn from Multimodal Experience cs.AI · 2026-05-16 · unverdicted · none · ref 51
Agents learn to dynamically construct and organize memory from multimodal experiences, improving performance over static designs in task-dependent settings.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer