A two-stage VLM-LM system that infers actions from screen recordings to detect inefficient workflows and generate tailored recommendations.
InProceedings of the 28th Annual ACM Symposium on User Interface Software & Technology
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.HC 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
The Invisible Mentor: Inferring User Actions from Screen Recordings to Recommend Better Workflows
A two-stage VLM-LM system that infers actions from screen recordings to detect inefficient workflows and generate tailored recommendations.