2507.09016

Total: 1

#1 Enhancing RLHF with Human Gaze Modeling [PDF1] [Copy] [Kimi] [REL]

Authors: Karim Galliamov, Ivan Titov, Ilya Pershin

Reinforcement Learning from Human Feedback (RLHF) aligns language models with human preferences but is computationally expensive. We explore two approaches that leverage human gaze modeling to enhance RLHF: (1) gaze-aware reward models and (2) gaze-based distribution of sparse rewards at token level. Our experiments demonstate that gaze-informed RLHF achieves faster convergence while maintaining or slightly improving performance, thus, reducing computational costs during policy optimization. These results show that human gaze provides a valuable and underused signal for policy optimization, pointing to a promising direction for improving RLHF efficiency.

Subject: Machine Learning

Publish: 2025-07-11 20:49:04 UTC