IMPACT is a synchronized five-view RGB-D dataset of 112 real industrial assembly trials with multi-granularity annotations, anomaly taxonomy, and compliance tracking.
Title resolution pending
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 4roles
background 1polarities
background 1representative citing papers
A gaze-aware LLM assistant using egocentric video with gaze overlays outperforms text-only LLMs in accuracy of reading behavior assessment, personalization, information recall, and interaction efficiency in a 36-person study.
Fully aligned instructional videos for physical tasks yield 11.1% better completion quality and 15.5% faster times, with four decomposable visual attributes whose isolated misalignments degrade performance without users noticing.
VisionClaw couples continuous egocentric vision on smart glasses with speech-driven AI agents to enable hands-free real-world tasks, with lab and field studies showing faster completion and a shift toward opportunistic delegation.
citing papers explorer
-
IMPACT: A Dataset for Multi-Granularity Human Procedural Action Understanding in Industrial Assembly
IMPACT is a synchronized five-view RGB-D dataset of 112 real industrial assembly trials with multi-granularity annotations, anomaly taxonomy, and compliance tracking.
-
From Gaze to Guidance: Interpreting and Adapting to Users' Cognitive Needs with Multimodal Gaze-Aware AI Assistants
A gaze-aware LLM assistant using egocentric video with gaze overlays outperforms text-only LLMs in accuracy of reading behavior assessment, personalization, information recall, and interaction efficiency in a 36-person study.
-
Substantial, Decomposable, and Invisible: Visual Context Misalignment in Instructional Videos for Physical Tasks
Fully aligned instructional videos for physical tasks yield 11.1% better completion quality and 15.5% faster times, with four decomposable visual attributes whose isolated misalignments degrade performance without users noticing.
-
VisionClaw: Always-On AI Agents through Smart Glasses
VisionClaw couples continuous egocentric vision on smart glasses with speech-driven AI agents to enable hands-free real-world tasks, with lab and field studies showing faster completion and a shift toward opportunistic delegation.