A new dataset IFLLM shows implicit feedback from mouse and eye movements boosts LLM reward model accuracy from 55% to 64% and nearly triples DPO response quality gains across eight models.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Your Mouse and Eyes Secretly Leak Your Preference: LLM Alignment using Implicit Feedback from Users
A new dataset IFLLM shows implicit feedback from mouse and eye movements boosts LLM reward model accuracy from 55% to 64% and nearly triples DPO response quality gains across eight models.