A new benchmark shows LLM smartphone agents achieve comparable success with screen text alone as with screenshots, but both fail often due to UI accessibility and reasoning gaps.
Title resolution pending
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 4roles
background 4representative citing papers
Polite chatbot feedback lowers psychological reactance and boosts behavioral intentions but lacks engagement, whereas verbal leakage heightens surprise and engagement at the expense of increased reactance.
An interactive public hammock captures and replays biodata as embodied traces, with a field study of ten users indicating it fosters anonymous connection and appreciation for shared vitality.
citing papers explorer
-
Do LLMs Need to See Everything? A Benchmark and Study of Failures in LLM-driven Smartphone Automation using Screentext vs. Screenshots
A new benchmark shows LLM smartphone agents achieve comparable success with screen text alone as with screenshots, but both fail often due to UI accessibility and reasoning gaps.
-
Polite But Boring? Trade-offs Between Engagement and Psychological Reactance to Chatbot Feedback Styles
Polite chatbot feedback lowers psychological reactance and boosts behavioral intentions but lacks engagement, whereas verbal leakage heightens surprise and engagement at the expense of increased reactance.
-
HeartSway: Exploring Biodata as Poetic Traces in Public Space
An interactive public hammock captures and replays biodata as embodied traces, with a field study of ten users indicating it fosters anonymous connection and appreciation for shared vitality.
- Beyond Explainable AI (XAI): An Overdue Paradigm Shift and Post-XAI Research Directions