Rhetor automates rehearsed live web-app demos with segment-synchronized narration and real-time voice QA using cross-modal UI-plus-code features, a grounded scripter, rehearsal loops, and timing invariants, with case-study metrics on four applications.
arXiv preprint arXiv:2511.19477 , year =
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.AI 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
SUPERBROWSER reaches 89.47% success on Mind2Web Hard by implementing a human-like perception-cognition-action system with vision-first bounding boxes, a three-role brain, and an evicting ledger.
citing papers explorer
-
Rehearsed Multi-Agent Live Product Demonstrations with Real-Time Voice Question Answering
Rhetor automates rehearsed live web-app demos with segment-synchronized narration and real-time voice QA using cross-modal UI-plus-code features, a grounded scripter, rehearsal loops, and timing invariants, with case-study metrics on four applications.
-
RunAgent SuperBrowser: A Theory of Autonomous Web Navigation Grounded in Human Browsing Behaviour
SUPERBROWSER reaches 89.47% success on Mind2Web Hard by implementing a human-like perception-cognition-action system with vision-first bounding boxes, a three-role brain, and an evicting ledger.