A gaze-aware LLM assistant using egocentric video with gaze overlays outperforms text-only LLMs in accuracy of reading behavior assessment, personalization, information recall, and interaction efficiency in a 36-person study.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.HC 3roles
background 1polarities
background 1representative citing papers
VisionClaw couples continuous egocentric vision on smart glasses with speech-driven AI agents to enable hands-free real-world tasks, with lab and field studies showing faster completion and a shift toward opportunistic delegation.
XARP provides a WebSocket-based remote-procedure system that lets Python code and AI agents control Unity XR clients, with benchmarks and user studies showing faster iteration than conventional XR workflows.
citing papers explorer
-
From Gaze to Guidance: Interpreting and Adapting to Users' Cognitive Needs with Multimodal Gaze-Aware AI Assistants
A gaze-aware LLM assistant using egocentric video with gaze overlays outperforms text-only LLMs in accuracy of reading behavior assessment, personalization, information recall, and interaction efficiency in a 36-person study.
-
VisionClaw: Always-On AI Agents through Smart Glasses
VisionClaw couples continuous egocentric vision on smart glasses with speech-driven AI agents to enable hands-free real-world tasks, with lab and field studies showing faster completion and a shift toward opportunistic delegation.
-
XARP Tools: An Extended Reality Platform for Humans and AI Agents
XARP provides a WebSocket-based remote-procedure system that lets Python code and AI agents control Unity XR clients, with benchmarks and user studies showing faster iteration than conventional XR workflows.