GazePrior: Zero-Shot AR/VR Eye Tracking via Learned 3D Gaze Reconstruction

Alexander Fix; Ali Behrooz; Corentin Dumery; David Colmenares; Jogendra Kundu; Pascal Fua

arxiv: 2605.22359 · v1 · pith:UTI32JA7new · submitted 2026-05-21 · 💻 cs.CV

GazePrior: Zero-Shot AR/VR Eye Tracking via Learned 3D Gaze Reconstruction

Corentin Dumery , David Colmenares , Alexander Fix , Pascal Fua , Ali Behrooz , Jogendra Kundu This is my paper

Pith reviewed 2026-05-22 06:20 UTC · model grok-4.3

classification 💻 cs.CV

keywords eye trackingzero-shotAR/VR3D reconstructionsynthetic datagaze estimationprior model

0 comments

The pith

A learned 3D prior on eye appearance and gaze lets researchers synthesize realistic training data for eye trackers on any new AR/VR device without collecting real examples from it.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to remove the need for costly real data collection every time a new eye-tracking hardware design appears in AR or VR. It does so by first training GazePrior, a statistical model of how human eyes look and move under varied identities, directions, and lighting. The prior then takes existing annotated recordings from older devices, reconstructs their 3D geometry and appearance, and re-renders the eyes exactly as they would appear to the cameras of a completely different target device. A sympathetic reader would care because eye tracking is required for natural interaction in headsets, yet the current practice of gathering fresh real data for each new camera layout remains slow and expensive.

Core claim

GazePrior is a data-driven 3D prior that models the distribution of human eyes across diverse identities, gaze directions, and light settings. It enables sparse-input 3D reconstruction of annotated data collected with previous ET devices, which can then be rendered from the cameras of any target ET device, producing synthetic training data that combines the realism, diversity, and ground-truth accuracy of real collection without its prohibitive costs.

What carries the argument

GazePrior, a learned 3D prior on eye identity, gaze direction, and illumination that supports reconstruction from limited views followed by novel-view rendering for arbitrary camera geometries.

If this is right

Eye-tracking models trained on the synthesized data outperform previous zero-shot methods in both accuracy and robustness.
Ground-truth gaze labels remain available because they are carried through the 3D reconstruction and rendering steps.
The same prior can be used to generate data for any target device simply by specifying its camera parameters, removing the need for new real recordings.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The reconstruction-plus-render pipeline may prove more robust to domain shift than purely image-based synthesis techniques when device optics change substantially.
Extending the same prior to additional facial landmarks could support joint eye-and-face tracking without separate data collection campaigns.

Load-bearing premise

The 3D prior learned from data collected with previous eye-tracking devices will generalize across new device camera geometries, lighting conditions, and user populations without requiring any real data from the target device.

What would settle it

Train an eye-tracking model on GazePrior-synthesized images for a specific new device, then compare its gaze-estimation error on real test images from that same device against the error of a model trained on actual recordings from the device; a large gap in favor of the real-data model would falsify the central claim.

read the original abstract

Eye tracking (ET) is a foundational technology for advanced AR/VR applications. However, training ET models for every new ET device is challenging: real data collection is costly and time-consuming, while existing synthetic data generation methods lack realism. To remove the need for additional data collection while maintaining data quality, we introduce a data-driven 3D prior that models the distribution of human eyes across diverse identities, gaze directions, and light settings. This model, which we coin GazePrior, then enables sparse-input 3D reconstruction of annotated data collected with previous ET devices, which can in turn be rendered from the cameras of any target ET device. Our approach synthesizes data with the realism, diversity and ground-truth accuracy of real data collection without its prohibitive costs. Our experiments demonstrate that ET models trained with our synthesized data outperform previous zero-shot methods, achieving higher accuracy and robustness.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces GazePrior, a learned 3D prior over human eye appearance, shape, and gaze that is trained on data from existing eye-tracking devices. This prior is used to perform sparse-input 3D reconstruction of annotated eye images collected on prior hardware; the resulting 3D models are then re-rendered from the camera geometries, intrinsics, and illumination of a new target device to produce synthetic training data. Eye-tracking models trained on this synthesized data are reported to outperform previous zero-shot baselines in accuracy and robustness.

Significance. If the central claim holds, the work would provide a practical route to zero-shot deployment of eye tracking on new AR/VR hardware, removing the need for per-device real-data collection while retaining the photometric and geometric fidelity required for high-accuracy gaze estimation. The approach directly addresses a recurring deployment bottleneck in the field.

major comments (2)

[§4 and Table 2] §4 (Experiments) and Table 2: the reported outperformance over prior zero-shot methods is stated without accompanying quantitative metrics, error distributions, dataset sizes, or ablation studies on the number of input views or lighting conditions; this makes it impossible to assess whether the gains are robust or sensitive to post-hoc hyper-parameter choices.
[§3.2] §3.2 (3D Reconstruction and Rendering): the claim that the GazePrior captures sufficient variation to generalize across novel camera intrinsics, lens distortion, and IR illumination patterns is load-bearing for the zero-shot claim, yet the manuscript provides no explicit test (e.g., cross-device reconstruction error or photometric consistency metrics) that isolates the domain gap introduced by re-rendering under unseen spectral responses and distortion models.

minor comments (2)

[Figure 3] Figure 3: the caption does not specify the number of input views or the exact camera parameters used for the qualitative re-rendering examples, making it difficult to reproduce the visualization.
[§3.1] Notation in §3.1: the symbol for the learned prior distribution is introduced without an explicit statement of its dimensionality or conditioning variables (identity, gaze, lighting).

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments on our work. We address each of the major comments in detail below and have updated the manuscript accordingly to improve clarity and provide additional supporting evidence.

read point-by-point responses

Referee: [§4 and Table 2] §4 (Experiments) and Table 2: the reported outperformance over prior zero-shot methods is stated without accompanying quantitative metrics, error distributions, dataset sizes, or ablation studies on the number of input views or lighting conditions; this makes it impossible to assess whether the gains are robust or sensitive to post-hoc hyper-parameter choices.

Authors: We agree that providing more detailed quantitative analysis would enhance the evaluation section. In the revised version, we have expanded Table 2 to include mean and standard deviation of angular errors, as well as dataset sizes used for training and testing. Additionally, we have included error distribution plots in Section 4 and performed ablations on the number of input views (1, 2, 3, and 5 views) and different lighting conditions. These ablations confirm that the performance improvements are consistent and not sensitive to specific hyper-parameter choices. The gains over baselines remain significant across all tested configurations. revision: yes
Referee: [§3.2] §3.2 (3D Reconstruction and Rendering): the claim that the GazePrior captures sufficient variation to generalize across novel camera intrinsics, lens distortion, and IR illumination patterns is load-bearing for the zero-shot claim, yet the manuscript provides no explicit test (e.g., cross-device reconstruction error or photometric consistency metrics) that isolates the domain gap introduced by re-rendering under unseen spectral responses and distortion models.

Authors: We recognize the importance of explicitly validating the generalization capability of GazePrior to unseen device parameters. To address this, we have added new experiments in the revised Section 3.2, including cross-device reconstruction error metrics and photometric consistency measures (such as PSNR between synthesized and real images from the target device). These results demonstrate that the prior effectively captures the necessary variations, with low reconstruction errors even when re-rendering under novel intrinsics, distortions, and illumination patterns. This supports the robustness of our zero-shot approach. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained

full rationale

The paper introduces a learned 3D prior (GazePrior) trained on data from prior eye-tracking devices, then uses sparse-input reconstruction and novel-view rendering to synthesize training data for unseen target devices. No equations, fitted parameters, or self-citations are presented that reduce the claimed outperformance to a quantity defined by construction from the same inputs. The central claims rest on empirical generalization of the prior across device geometries and the results of downstream experiments, which remain independently falsifiable and do not collapse into tautology or renaming of known patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Based solely on the abstract, the central claim rests on the unstated assumption that a single learned 3D prior can faithfully capture eye variation across identities and devices; no explicit free parameters, axioms, or invented entities are described.

pith-pipeline@v0.9.0 · 5701 in / 1071 out tokens · 37583 ms · 2026-05-22T06:20:28.038183+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We introduce a data-driven 3D prior that models the distribution of human eyes across diverse identities, gaze directions, and light settings... conditional radiance field... latent-vector representation
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Gaze Latent Vector... sinusoidal encoding... Subject Latent Vector... Lighting Latent Vector

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

95 extracted references · 95 canonical work pages · 1 internal anchor

[1]

Vive pro eye

HTC. Vive pro eye. https://www.vive.com/sea/product/vive-pro-eye/overview/, 2019. VR headset with integrated eye tracking

work page 2019
[2]

Quest pro.https://www.meta.com/quest/quest-pro/, 2022

Meta. Quest pro.https://www.meta.com/quest/quest-pro/, 2022. Eye-tracking VR headset

work page 2022
[3]

Hololens 2

Microsoft. Hololens 2. https://www.microsoft.com/en-us/hololens/buy, 2019. Mixed reality headset with eye tracking

work page 2019
[4]

Apple vision pro.https://www.apple.com/apple-vision-pro/, 2024

Apple Inc. Apple vision pro.https://www.apple.com/apple-vision-pro/, 2024. Mixed reality headset with eye and hand tracking

work page 2024
[5]

Project Aria: A New Tool for Egocentric Multi-Modal AI Research

Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, et al. Project aria: A new tool for egocentric multi-modal ai research.arXiv preprint arXiv:2308.13561, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[6]

Preface: A data-driven volumetric prior for few-shot ultra high-resolution face synthesis

Marcel C Bühler, Kripasindhu Sarkar, Tanmay Shah, Gengyan Li, Daoye Wang, Leonhard Helminger, Sergio Orts-Escolano, Dmitry Lagun, Otmar Hilliges, Thabo Beeler, et al. Preface: A data-driven volumetric prior for few-shot ultra high-resolution face synthesis. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3402–3413, 2023

work page 2023
[7]

Cafca: High-quality novel view synthesis of expressive faces from casual few-shot captures

Marcel C Buehler, Gengyan Li, Erroll Wood, Leonhard Helminger, Xu Chen, Tanmay Shah, Daoye Wang, Stephan Garbin, Sergio Orts-Escolano, Otmar Hilliges, et al. Cafca: High-quality novel view synthesis of expressive faces from casual few-shot captures. InSIGGRAPH Asia 2024 Conference Papers, pages 1–12, 2024

work page 2024
[8]

Nonrigid Structure from Motion in Trajectory Space

Jonas Kulhanek and Torsten Sattler. Nonrigid Structure from Motion in Trajectory Space. InAdvances in Neural Information Processing Systems, 2025

work page 2025
[9]

Gazenerf: 3d-aware gaze redirection with neural radiance fields

Alessandro Ruzzi, Xiangwei Shi, Xi Wang, Gengyan Li, Shalini De Mello, Hyung Jin Chang, Xucong Zhang, and Otmar Hilliges. Gazenerf: 3d-aware gaze redirection with neural radiance fields. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9676–9685, 2023

work page 2023
[10]

What we need is explicit controllability: Training 3d gaze estimator using only facial images

Tingwei Li, Jun Bao, Zhenzhong Kuang, and Buyu Liu. What we need is explicit controllability: Training 3d gaze estimator using only facial images. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 11414–11424, 2025

work page 2025
[11]

Dig- itally prototype your eye tracker: Simulating hardware performance using 3d synthetic data.arXiv preprint arXiv:2503.16742, 2025

Esther YH Lin, Yimin Ding, Jogendra Kundu, Yatong An, Mohamed T El-Haddad, and Alexander Fix. Dig- itally prototype your eye tracker: Simulating hardware performance using 3d synthetic data.arXiv preprint arXiv:2503.16742, 2025. 11

work page arXiv 2025
[12]

Gazegaussian: High-fidelity gaze redirection with 3d gaussian splatting

Xiaobao Wei, Peng Chen, Guangyu Li, Ming Lu, Hui Chen, and Feng Tian. Gazegaussian: High-fidelity gaze redirection with 3d gaussian splatting. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 13293–13303, 2025

work page 2025
[13]

Learning to find eye region landmarks for remote gaze estimation in unconstrained settings

Seonwook Park, Xucong Zhang, Andreas Bulling, and Otmar Hilliges. Learning to find eye region landmarks for remote gaze estimation in unconstrained settings. InProceedings of the 2018 ACM symposium on eye tracking research & applications, pages 1–10, 2018

work page 2018
[14]

3d model-based gaze tracking via iris features with a single camera and a single light source.IEEE Transactions on Human-Machine Systems, 51(2):75–86, 2020

Jiahui Liu, Jiannan Chi, Wenxue Hu, and Zhiliang Wang. 3d model-based gaze tracking via iris features with a single camera and a single light source.IEEE Transactions on Human-Machine Systems, 51(2):75–86, 2020

work page 2020
[15]

Neural 3d gaze: 3d pupil localization and gaze tracking based on anatomical eye model and neural refraction correction

Conny Lu, Praneeth Chakravarthula, Kaihao Liu, Xixiang Liu, Siyuan Li, and Henry Fuchs. Neural 3d gaze: 3d pupil localization and gaze tracking based on anatomical eye model and neural refraction correction. In2022 IEEE international symposium on mixed and augmented reality (ISMAR), pages 375–383. IEEE, 2022

work page 2022
[16]

Appearance-based gaze estimation in the wild

Xucong Zhang, Yusuke Sugano, Mario Fritz, and Andreas Bulling. Appearance-based gaze estimation in the wild. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 4511–4520, 2015

work page 2015
[17]

Person independent 3d gaze estimation from remote rgb-d cameras

Kenneth Alberto Funes Mora and Jean-Marc Odobez. Person independent 3d gaze estimation from remote rgb-d cameras. In2013 IEEE International Conference on Image Processing, pages 2787–2791. IEEE, 2013

work page 2013
[18]

Gaze estimation in the 3d space using rgb-d sensors: towards head-pose and user invariance.International Journal of Computer Vision, 118(2):194–216, 2016

Kenneth A Funes-Mora and Jean-Marc Odobez. Gaze estimation in the 3d space using rgb-d sensors: towards head-pose and user invariance.International Journal of Computer Vision, 118(2):194–216, 2016

work page 2016
[19]

Eyediap: A database for the development and evaluation of gaze estimation algorithms from rgb and rgb-d cameras

Kenneth Alberto Funes Mora, Florent Monay, and Jean-Marc Odobez. Eyediap: A database for the development and evaluation of gaze estimation algorithms from rgb and rgb-d cameras. InProceedings of the symposium on eye tracking research and applications, pages 255–258, 2014

work page 2014
[20]

A 3d morphable eye region model for gaze estimation

Erroll Wood, Tadas Baltrušaitis, Louis-Philippe Morency, Peter Robinson, and Andreas Bulling. A 3d morphable eye region model for gaze estimation. InEuropean conference on computer vision, pages 297–313. Springer, 2016

work page 2016
[21]

Gazedirector: Fully articulated eye gaze redirection in video

Erroll Wood, Tadas Baltrušaitis, Louis-Philippe Morency, Peter Robinson, and Andreas Bulling. Gazedirector: Fully articulated eye gaze redirection in video. InComputer Graphics Forum. Wiley Online Library, 2018

work page 2018
[22]

3dgazenet: Generalizing 3d gaze estimation with weak-supervision from synthetic views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng, Michail Christos Doukas, Jia Guo, and Stefanos Zafeiriou. 3dgazenet: Generalizing 3d gaze estimation with weak-supervision from synthetic views. InEuropean Conference on Computer Vision, pages 387–404. Springer, 2024

work page 2024
[23]

Towards an accurate 3d deformable eye model for gaze estimation

Chenyi Kuang, Jeffery O Kephart, and Qiang Ji. Towards an accurate 3d deformable eye model for gaze estimation. InInternational Conference on Pattern Recognition, pages 109–123. Springer, 2022

work page 2022
[24]

Accurate real-time 3d gaze tracking using a lightweight eyeball calibration

Quan Wen, Derek Bradley, Thabo Beeler, Seonwook Park, Otmar Hilliges, Junhai Yong, and Feng Xu. Accurate real-time 3d gaze tracking using a lightweight eyeball calibration. InComputer Graphics Forum. Wiley Online Library, 2020

work page 2020
[25]

Learning gaze biases with head motion for head pose-free gaze estimation.Image and Vision Computing, 32(3):169–179, 2014

Feng Lu, Takahiro Okabe, Yusuke Sugano, and Yoichi Sato. Learning gaze biases with head motion for head pose-free gaze estimation.Image and Vision Computing, 32(3):169–179, 2014

work page 2014
[26]

Gaze estimation from eye appearance: A head pose-free method via eye image synthesis.IEEE Transactions on Image Processing, 24(11):3680–3693, 2015

Feng Lu, Yusuke Sugano, Takahiro Okabe, and Yoichi Sato. Gaze estimation from eye appearance: A head pose-free method via eye image synthesis.IEEE Transactions on Image Processing, 24(11):3680–3693, 2015

work page 2015
[27]

Inferring human gaze from appearance via adaptive linear regression

Feng Lu, Yusuke Sugano, Takahiro Okabe, and Yoichi Sato. Inferring human gaze from appearance via adaptive linear regression. In2011 International Conference on Computer Vision, pages 153–160. IEEE, 2011

work page 2011
[28]

Eth-xgaze: A large scale dataset for gaze estimation under extreme head pose and gaze variation

Xucong Zhang, Seonwook Park, Thabo Beeler, Derek Bradley, Siyu Tang, and Otmar Hilliges. Eth-xgaze: A large scale dataset for gaze estimation under extreme head pose and gaze variation. InEuropean conference on computer vision, pages 365–381. Springer, 2020

work page 2020
[29]

Mpiigaze: Real-world dataset and deep appearance-based gaze estimation.IEEE transactions on pattern analysis and machine intelligence, 41(1):162–175, 2017

Xucong Zhang, Yusuke Sugano, Mario Fritz, and Andreas Bulling. Mpiigaze: Real-world dataset and deep appearance-based gaze estimation.IEEE transactions on pattern analysis and machine intelligence, 41(1):162–175, 2017

work page 2017
[30]

Eye tracking for everyone

Kyle Krafka, Aditya Khosla, Petr Kellnhofer, Harini Kannan, Suchendra Bhandarkar, Wojciech Matusik, and Antonio Torralba. Eye tracking for everyone. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 2176–2184, 2016. 12

work page 2016
[31]

Rendering of eyes for eye-shape registration and gaze estimation

Erroll Wood, Tadas Baltrusaitis, Xucong Zhang, Yusuke Sugano, Peter Robinson, and Andreas Bulling. Rendering of eyes for eye-shape registration and gaze estimation. InProceedings of the IEEE international conference on computer vision, pages 3756–3764, 2015

work page 2015
[32]

Learning an appearance-based gaze estimator from one million synthesised images

Erroll Wood, Tadas Baltrušaitis, Louis-Philippe Morency, Peter Robinson, and Andreas Bulling. Learning an appearance-based gaze estimator from one million synthesised images. InProceedings of the ninth biennial ACM symposium on eye tracking research & applications, pages 131–138, 2016

work page 2016
[33]

Gazegene: Large-scale synthetic gaze dataset with 3d eyeball annotations

Yiwei Bao, Zhiming Wang, and Feng Lu. Gazegene: Large-scale synthetic gaze dataset with 3d eyeball annotations. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 18749–18759, 2025

work page 2025
[34]

Ga3ce: Unconstrained 3d gaze estimation with gaze-aware 3d context encoding

Yuki Kawana, Shintaro Shiba, Quan Kong, and Norimasa Kobori. Ga3ce: Unconstrained 3d gaze estimation with gaze-aware 3d context encoding. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 3081–3090, 2025

work page 2025
[35]

Enhancing 3d gaze estimation in the wild using weak supervision with gaze following labels

Pierre Vuillecard and Jean-Marc Odobez. Enhancing 3d gaze estimation in the wild using weak supervision with gaze following labels. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 13508–13518, 2025

work page 2025
[36]

Eye tracking in virtual reality.Journal of eye movement research, 12(1):10–16910, 2019

Viviane Clay, Peter König, and Sabine Koenig. Eye tracking in virtual reality.Journal of eye movement research, 12(1):10–16910, 2019

work page 2019
[37]

The eye in extended reality: A survey on gaze interaction and eye tracking in head-worn extended reality.ACM Computing Surveys (CSUR), 55(3):1–39, 2022

Alexander Plopski, Teresa Hirzle, Nahal Norouzi, Long Qian, Gerd Bruder, and Tobias Langlotz. The eye in extended reality: A survey on gaze interaction and eye tracking in head-worn extended reality.ACM Computing Surveys (CSUR), 55(3):1–39, 2022

work page 2022
[38]

Towards foveated rendering for gaze-tracked virtual reality.ACM Transactions On Graphics (TOG), 35 (6):1–12, 2016

Anjul Patney, Marco Salvi, Joohwan Kim, Anton Kaplanyan, Chris Wyman, Nir Benty, David Luebke, and Aaron Lefohn. Towards foveated rendering for gaze-tracked virtual reality.ACM Transactions On Graphics (TOG), 35 (6):1–12, 2016

work page 2016
[39]

Gaze+ pinch interaction in virtual reality

Ken Pfeuffer, Benedikt Mayer, Diako Mardanbegi, and Hans Gellersen. Gaze+ pinch interaction in virtual reality. InProceedings of the 5th symposium on spatial user interaction, pages 99–108, 2017

work page 2017
[40]

Pinch, click, or dwell: Comparing different selection techniques for eye-gaze-based pointing in virtual reality

Aunnoy K Mutasim, Anil Ufuk Batmaz, and Wolfgang Stuerzlinger. Pinch, click, or dwell: Comparing different selection techniques for eye-gaze-based pointing in virtual reality. InAcm symposium on eye tracking research and applications, pages 1–7, 2021

work page 2021
[41]

egoppg: Heart rate estimation from eye-tracking cameras in egocentric systems to benefit downstream vision tasks

Björn Braun, Rayan Armani, Manuel Meier, Max Moebus, and Christian Holz. egoppg: Heart rate estimation from eye-tracking cameras in egocentric systems to benefit downstream vision tasks. InProceedings of the IEEE/CVF International Conference on Computer Vision, 2023

work page 2023
[42]

Challenges of eye tracking systems for mobile xr glasses

Injoon Hong, Kyeongryeol Bong, and Hoi-Jun Yoo. Challenges of eye tracking systems for mobile xr glasses. In Applications of Digital Image Processing XLI, volume 10752, pages 391–397. SPIE, 2018

work page 2018
[43]

Privacy-preserving datasets of eye-tracking samples with applications in xr.IEEE Transactions on Visualization and Computer Graphics, 29(5):2774–2784, 2023

Brendan David-John, Kevin Butler, and Eakta Jain. Privacy-preserving datasets of eye-tracking samples with applications in xr.IEEE Transactions on Visualization and Computer Graphics, 29(5):2774–2784, 2023

work page 2023
[44]

E-track: Eye tracking with event camera for extended reality (xr) applications

Nealson Li, Ashwin Bhat, and Arijit Raychowdhury. E-track: Eye tracking with event camera for extended reality (xr) applications. In2023 IEEE 5th International Conference on Artificial Intelligence Circuits and Systems (AICAS), pages 1–5. IEEE, 2023

work page 2023
[45]

Polarization-resolved imaging improves eye tracking.arXiv preprint arXiv:2511.04652, 2025

Mantas Žurauskas, Tom Bu, Sanaz Alali, Beyza Kalkanli, Derek Shi, Fernando Alamos, Gauresh Pandit, Christopher Mei, Ali Behrooz, Ramin Mirjalili, et al. Polarization-resolved imaging improves eye tracking.arXiv preprint arXiv:2511.04652, 2025

work page arXiv 2025
[46]

Nerf: Representing scenes as neural radiance fields for view synthesis.Communications of the ACM, 65(1):99–106, 2021

Ben Mildenhall, Pratul P Srinivasan, Matthew Tancik, Jonathan T Barron, Ravi Ramamoorthi, and Ren Ng. Nerf: Representing scenes as neural radiance fields for view synthesis.Communications of the ACM, 65(1):99–106, 2021

work page 2021
[47]

Instant neural graphics primitives with a multiresolution hash encoding.ACM transactions on graphics (TOG), 41(4):1–15, 2022

Thomas Müller, Alex Evans, Christoph Schied, and Alexander Keller. Instant neural graphics primitives with a multiresolution hash encoding.ACM transactions on graphics (TOG), 41(4):1–15, 2022

work page 2022
[48]

3d gaussian splatting for real-time radiance field rendering.ACM Trans

Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, and George Drettakis. 3d gaussian splatting for real-time radiance field rendering.ACM Trans. Graph., 42(4):139–1, 2023

work page 2023
[49]

Mip-nerf 360: Unbounded anti-aliased neural radiance fields

Jonathan T Barron, Ben Mildenhall, Dor Verbin, Pratul P Srinivasan, and Peter Hedman. Mip-nerf 360: Unbounded anti-aliased neural radiance fields. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5470–5479, 2022. 13

work page 2022
[50]

Zip-nerf: Anti-aliased grid-based neural radiance fields

Jonathan T Barron, Ben Mildenhall, Dor Verbin, Pratul P Srinivasan, and Peter Hedman. Zip-nerf: Anti-aliased grid-based neural radiance fields. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 19697–19705, 2023

work page 2023
[51]

3d gaussian splatting as markov chain monte carlo.Advances in Neural Information Processing Systems, 37:80965–80986, 2024

Shakiba Kheradmand, Daniel Rebain, Gopal Sharma, Weiwei Sun, Yang-Che Tseng, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, and Kwang Moo Yi. 3d gaussian splatting as markov chain monte carlo.Advances in Neural Information Processing Systems, 37:80965–80986, 2024

work page 2024
[52]

3dgut: Enabling distorted cameras and secondary rays in gaussian splatting

Qi Wu, Janick Martinez Esturo, Ashkan Mirzaei, Nicolas Moenne-Loccoz, and Zan Gojcic. 3dgut: Enabling distorted cameras and secondary rays in gaussian splatting. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 26036–26046, 2025

work page 2025
[53]

Scaffold-gs: Structured 3d gaussians for view-adaptive rendering

Tao Lu, Mulin Yu, Linning Xu, Yuanbo Xiangli, Limin Wang, Dahua Lin, and Bo Dai. Scaffold-gs: Structured 3d gaussians for view-adaptive rendering. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20654–20664, 2024

work page 2024
[54]

Compressed 3d gaussian splatting for accelerated novel view synthesis

Simon Niedermayr, Josef Stumpfegger, and Rüdiger Westermann. Compressed 3d gaussian splatting for accelerated novel view synthesis. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10349–10358, 2024

work page 2024
[55]

Temporally compressed 3d gaussian splatting for dynamic scenes.arXiv preprint arXiv:2412.05700, 4, 2024

Saqib Javed, Ahmad Jarrar Khan, Corentin Dumery, Chen Zhao, and Mathieu Salzmann. Temporally compressed 3d gaussian splatting for dynamic scenes.arXiv preprint arXiv:2412.05700, 4, 2024

work page arXiv 2024
[56]

A view-consistent sampling method for regularized training of neural radiance fields

Aoxiang Fan, Corentin Dumery, Nicolas Talabot, and Pascal Fua. A view-consistent sampling method for regularized training of neural radiance fields. InInternational Conference on Computer Vision (ICCV), 2025

work page 2025
[57]

Nerfacc: Efficient sampling accelerates nerfs

Ruilong Li, Hang Gao, Matthew Tancik, and Angjoo Kanazawa. Nerfacc: Efficient sampling accelerates nerfs. In Proceedings of the IEEE/CVF international conference on computer vision, pages 18537–18546, 2023

work page 2023
[58]

Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields.CVPRW, 2025

Corentin Dumery, Aoxiang Fan, Ren Li, Nicolas Talabot, and Pascal Fua. Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields.CVPRW, 2025

work page 2025
[59]

Lerf: Language embedded radiance fields

Justin Kerr, Chung Min Kim, Ken Goldberg, Angjoo Kanazawa, and Matthew Tancik. Lerf: Language embedded radiance fields. InProceedings of the IEEE/CVF international conference on computer vision, pages 19729–19739, 2023

work page 2023
[60]

Novel view synthesis with diffusion models

Daniel Watson, William Chan, Ricardo Martin Brualla, Jonathan Ho, Andrea Tagliasacchi, and Mohammad Norouzi. Novel view synthesis with diffusion models. InThe Eleventh International Conference on Learning Representations, 2022

work page 2022
[61]

Generative novel view synthesis with 3d-aware diffusion models

Eric R Chan, Koki Nagano, Matthew A Chan, Alexander W Bergman, Jeong Joon Park, Axel Levy, Miika Aittala, Shalini De Mello, Tero Karras, and Gordon Wetzstein. Generative novel view synthesis with 3d-aware diffusion models. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 4217–4229, 2023

work page 2023
[62]

Dreamfusion: Text-to-3d using 2d diffusion

Ben Poole, Ajay Jain, Jonathan T Barron, and Ben Mildenhall. Dreamfusion: Text-to-3d using 2d diffusion. In The Eleventh International Conference on Learning Representations

work page
[63]

Zero-1-to-3: Zero-shot one image to 3d object

Ruoshi Liu, Rundi Wu, Basile Van Hoorick, Pavel Tokmakov, Sergey Zakharov, and Carl Vondrick. Zero-1-to-3: Zero-shot one image to 3d object. InProceedings of the IEEE/CVF international conference on computer vision, pages 9298–9309, 2023

work page 2023
[64]

Self-learning transformations for improving gaze and head redirection.Advances in Neural Information Processing Systems, 33:13127–13138, 2020

Yufeng Zheng, Seonwook Park, Xucong Zhang, Shalini De Mello, and Otmar Hilliges. Self-learning transformations for improving gaze and head redirection.Advances in Neural Information Processing Systems, 33:13127–13138, 2020

work page 2020
[65]

Few-shot adaptive gaze estimation

Seonwook Park, Shalini De Mello, Pavlo Molchanov, Umar Iqbal, Otmar Hilliges, and Jan Kautz. Few-shot adaptive gaze estimation. InProceedings of the IEEE/CVF international conference on computer vision, pages 9368–9377, 2019

work page 2019
[66]

Photo-realistic monocular gaze redirection using generative adversarial networks

Zhe He, Adrian Spurr, Xucong Zhang, and Otmar Hilliges. Photo-realistic monocular gaze redirection using generative adversarial networks. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 6932–6941, 2019

work page 2019
[67]

A hierarchical generative model for eye image synthesis and eye gaze estimation

Kang Wang, Rui Zhao, and Qiang Ji. A hierarchical generative model for eye image synthesis and eye gaze estimation. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 440–448, 2018. 14

work page 2018
[68]

Controllable continuous gaze redirection

Weihao Xia, Yujiu Yang, Jing-Hao Xue, and Wensen Feng. Controllable continuous gaze redirection. InProceedings of the 28th ACM International Conference on Multimedia, pages 1782–1790, 2020

work page 2020
[69]

Gaze manipulation for one-to-one teleconferencing

Criminisi, Shotton, Blake, and Torr. Gaze manipulation for one-to-one teleconferencing. InProceedings Ninth IEEE International Conference on Computer Vision, pages 191–198. IEEE, 2003

work page 2003
[70]

Gaze correction with a single webcam

Dominik Giger, Jean-Charles Bazin, Claudia Kuster, Tiberiu Popa, and Markus Gross. Gaze correction with a single webcam. In2014 IEEE International Conference on Multimedia and Expo (ICME), pages 1–6. IEEE, 2014

work page 2014
[71]

Gaze correction for home video conferencing.ACM Transactions on Graphics (TOG), 31(6):1–6, 2012

Claudia Kuster, Tiberiu Popa, Jean-Charles Bazin, Craig Gotsman, and Markus Gross. Gaze correction for home video conferencing.ACM Transactions on Graphics (TOG), 31(6):1–6, 2012

work page 2012
[72]

Learning to look up: Realtime monocular gaze correction using machine learning

Daniil Kononenko and Victor Lempitsky. Learning to look up: Realtime monocular gaze correction using machine learning. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4667–4675, 2015

work page 2015
[73]

Eyeopener: Editing eyes in the wild.ACM Transactions on Graphics (TOG), 36(1):1–13, 2016

Zhixin Shu, Eli Shechtman, Dimitris Samaras, and Sunil Hadap. Eyeopener: Editing eyes in the wild.ACM Transactions on Graphics (TOG), 36(1):1–13, 2016

work page 2016
[74]

Deepwarp: Photorealistic image resynthesis for gaze manipulation

Yaroslav Ganin, Daniil Kononenko, Diana Sungatullina, and Victor Lempitsky. Deepwarp: Photorealistic image resynthesis for gaze manipulation. InEuropean conference on computer vision, pages 311–326. Springer, 2016

work page 2016
[75]

Learning a model of facial shape and expression from 4d scans.ACM Trans

Tianye Li, Timo Bolkart, Michael J Black, Hao Li, and Javier Romero. Learning a model of facial shape and expression from 4d scans.ACM Trans. Graph., 36(6):194–1, 2017

work page 2017
[76]

A morphable model for the synthesis of 3d faces

Volker Blanz and Thomas Vetter. A morphable model for the synthesis of 3d faces. InSeminal Graphics Papers: Pushing the Boundaries, Volume 2, pages 157–164, 2023

work page 2023
[77]

Combining 3d morphable models: A large scale face-and-head model

Stylianos Ploumpis, Haoyang Wang, Nick Pears, William AP Smith, and Stefanos Zafeiriou. Combining 3d morphable models: A large scale face-and-head model. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10934–10943, 2019

work page 2019
[78]

Gaussianhead: High-fidelity head avatars with learnable gaussian derivation.IEEE Transactions on Visualization and Computer Graphics, 2025

Jie Wang, Jiu-Cheng Xie, Xianyan Li, Feng Xu, Chi-Man Pun, and Hao Gao. Gaussianhead: High-fidelity head avatars with learnable gaussian derivation.IEEE Transactions on Visualization and Computer Graphics, 2025

work page 2025
[79]

Flashavatar: High-fidelity head avatar with efficient gaussian embedding

Jun Xiang, Xuan Gao, Yudong Guo, and Juyong Zhang. Flashavatar: High-fidelity head avatar with efficient gaussian embedding. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1802–1812, 2024

work page 2024
[80]

Hq3davatar: High-quality implicit 3d head avatar.ACM Transactions on Graphics, 43(3):1–24, 2024

Kartik Teotia, Mallikarjun B R, Xingang Pan, Hyeongwoo Kim, Pablo Garrido, Mohamed Elgharib, and Christian Theobalt. Hq3davatar: High-quality implicit 3d head avatar.ACM Transactions on Graphics, 43(3):1–24, 2024

work page 2024

Showing first 80 references.

[1] [1]

Vive pro eye

HTC. Vive pro eye. https://www.vive.com/sea/product/vive-pro-eye/overview/, 2019. VR headset with integrated eye tracking

work page 2019

[2] [2]

Quest pro.https://www.meta.com/quest/quest-pro/, 2022

Meta. Quest pro.https://www.meta.com/quest/quest-pro/, 2022. Eye-tracking VR headset

work page 2022

[3] [3]

Hololens 2

Microsoft. Hololens 2. https://www.microsoft.com/en-us/hololens/buy, 2019. Mixed reality headset with eye tracking

work page 2019

[4] [4]

Apple vision pro.https://www.apple.com/apple-vision-pro/, 2024

Apple Inc. Apple vision pro.https://www.apple.com/apple-vision-pro/, 2024. Mixed reality headset with eye and hand tracking

work page 2024

[5] [5]

Project Aria: A New Tool for Egocentric Multi-Modal AI Research

Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, et al. Project aria: A new tool for egocentric multi-modal ai research.arXiv preprint arXiv:2308.13561, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[6] [6]

Preface: A data-driven volumetric prior for few-shot ultra high-resolution face synthesis

Marcel C Bühler, Kripasindhu Sarkar, Tanmay Shah, Gengyan Li, Daoye Wang, Leonhard Helminger, Sergio Orts-Escolano, Dmitry Lagun, Otmar Hilliges, Thabo Beeler, et al. Preface: A data-driven volumetric prior for few-shot ultra high-resolution face synthesis. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3402–3413, 2023

work page 2023

[7] [7]

Cafca: High-quality novel view synthesis of expressive faces from casual few-shot captures

Marcel C Buehler, Gengyan Li, Erroll Wood, Leonhard Helminger, Xu Chen, Tanmay Shah, Daoye Wang, Stephan Garbin, Sergio Orts-Escolano, Otmar Hilliges, et al. Cafca: High-quality novel view synthesis of expressive faces from casual few-shot captures. InSIGGRAPH Asia 2024 Conference Papers, pages 1–12, 2024

work page 2024

[8] [8]

Nonrigid Structure from Motion in Trajectory Space

Jonas Kulhanek and Torsten Sattler. Nonrigid Structure from Motion in Trajectory Space. InAdvances in Neural Information Processing Systems, 2025

work page 2025

[9] [9]

Gazenerf: 3d-aware gaze redirection with neural radiance fields

Alessandro Ruzzi, Xiangwei Shi, Xi Wang, Gengyan Li, Shalini De Mello, Hyung Jin Chang, Xucong Zhang, and Otmar Hilliges. Gazenerf: 3d-aware gaze redirection with neural radiance fields. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9676–9685, 2023

work page 2023

[10] [10]

What we need is explicit controllability: Training 3d gaze estimator using only facial images

Tingwei Li, Jun Bao, Zhenzhong Kuang, and Buyu Liu. What we need is explicit controllability: Training 3d gaze estimator using only facial images. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 11414–11424, 2025

work page 2025

[11] [11]

Dig- itally prototype your eye tracker: Simulating hardware performance using 3d synthetic data.arXiv preprint arXiv:2503.16742, 2025

Esther YH Lin, Yimin Ding, Jogendra Kundu, Yatong An, Mohamed T El-Haddad, and Alexander Fix. Dig- itally prototype your eye tracker: Simulating hardware performance using 3d synthetic data.arXiv preprint arXiv:2503.16742, 2025. 11

work page arXiv 2025

[12] [12]

Gazegaussian: High-fidelity gaze redirection with 3d gaussian splatting

Xiaobao Wei, Peng Chen, Guangyu Li, Ming Lu, Hui Chen, and Feng Tian. Gazegaussian: High-fidelity gaze redirection with 3d gaussian splatting. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 13293–13303, 2025

work page 2025

[13] [13]

Learning to find eye region landmarks for remote gaze estimation in unconstrained settings

Seonwook Park, Xucong Zhang, Andreas Bulling, and Otmar Hilliges. Learning to find eye region landmarks for remote gaze estimation in unconstrained settings. InProceedings of the 2018 ACM symposium on eye tracking research & applications, pages 1–10, 2018

work page 2018

[14] [14]

3d model-based gaze tracking via iris features with a single camera and a single light source.IEEE Transactions on Human-Machine Systems, 51(2):75–86, 2020

Jiahui Liu, Jiannan Chi, Wenxue Hu, and Zhiliang Wang. 3d model-based gaze tracking via iris features with a single camera and a single light source.IEEE Transactions on Human-Machine Systems, 51(2):75–86, 2020

work page 2020

[15] [15]

Neural 3d gaze: 3d pupil localization and gaze tracking based on anatomical eye model and neural refraction correction

Conny Lu, Praneeth Chakravarthula, Kaihao Liu, Xixiang Liu, Siyuan Li, and Henry Fuchs. Neural 3d gaze: 3d pupil localization and gaze tracking based on anatomical eye model and neural refraction correction. In2022 IEEE international symposium on mixed and augmented reality (ISMAR), pages 375–383. IEEE, 2022

work page 2022

[16] [16]

Appearance-based gaze estimation in the wild

Xucong Zhang, Yusuke Sugano, Mario Fritz, and Andreas Bulling. Appearance-based gaze estimation in the wild. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 4511–4520, 2015

work page 2015

[17] [17]

Person independent 3d gaze estimation from remote rgb-d cameras

Kenneth Alberto Funes Mora and Jean-Marc Odobez. Person independent 3d gaze estimation from remote rgb-d cameras. In2013 IEEE International Conference on Image Processing, pages 2787–2791. IEEE, 2013

work page 2013

[18] [18]

Gaze estimation in the 3d space using rgb-d sensors: towards head-pose and user invariance.International Journal of Computer Vision, 118(2):194–216, 2016

Kenneth A Funes-Mora and Jean-Marc Odobez. Gaze estimation in the 3d space using rgb-d sensors: towards head-pose and user invariance.International Journal of Computer Vision, 118(2):194–216, 2016

work page 2016

[19] [19]

Eyediap: A database for the development and evaluation of gaze estimation algorithms from rgb and rgb-d cameras

Kenneth Alberto Funes Mora, Florent Monay, and Jean-Marc Odobez. Eyediap: A database for the development and evaluation of gaze estimation algorithms from rgb and rgb-d cameras. InProceedings of the symposium on eye tracking research and applications, pages 255–258, 2014

work page 2014

[20] [20]

A 3d morphable eye region model for gaze estimation

Erroll Wood, Tadas Baltrušaitis, Louis-Philippe Morency, Peter Robinson, and Andreas Bulling. A 3d morphable eye region model for gaze estimation. InEuropean conference on computer vision, pages 297–313. Springer, 2016

work page 2016

[21] [21]

Gazedirector: Fully articulated eye gaze redirection in video

Erroll Wood, Tadas Baltrušaitis, Louis-Philippe Morency, Peter Robinson, and Andreas Bulling. Gazedirector: Fully articulated eye gaze redirection in video. InComputer Graphics Forum. Wiley Online Library, 2018

work page 2018

[22] [22]

3dgazenet: Generalizing 3d gaze estimation with weak-supervision from synthetic views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng, Michail Christos Doukas, Jia Guo, and Stefanos Zafeiriou. 3dgazenet: Generalizing 3d gaze estimation with weak-supervision from synthetic views. InEuropean Conference on Computer Vision, pages 387–404. Springer, 2024

work page 2024

[23] [23]

Towards an accurate 3d deformable eye model for gaze estimation

Chenyi Kuang, Jeffery O Kephart, and Qiang Ji. Towards an accurate 3d deformable eye model for gaze estimation. InInternational Conference on Pattern Recognition, pages 109–123. Springer, 2022

work page 2022

[24] [24]

Accurate real-time 3d gaze tracking using a lightweight eyeball calibration

Quan Wen, Derek Bradley, Thabo Beeler, Seonwook Park, Otmar Hilliges, Junhai Yong, and Feng Xu. Accurate real-time 3d gaze tracking using a lightweight eyeball calibration. InComputer Graphics Forum. Wiley Online Library, 2020

work page 2020

[25] [25]

Learning gaze biases with head motion for head pose-free gaze estimation.Image and Vision Computing, 32(3):169–179, 2014

Feng Lu, Takahiro Okabe, Yusuke Sugano, and Yoichi Sato. Learning gaze biases with head motion for head pose-free gaze estimation.Image and Vision Computing, 32(3):169–179, 2014

work page 2014

[26] [26]

Gaze estimation from eye appearance: A head pose-free method via eye image synthesis.IEEE Transactions on Image Processing, 24(11):3680–3693, 2015

Feng Lu, Yusuke Sugano, Takahiro Okabe, and Yoichi Sato. Gaze estimation from eye appearance: A head pose-free method via eye image synthesis.IEEE Transactions on Image Processing, 24(11):3680–3693, 2015

work page 2015

[27] [27]

Inferring human gaze from appearance via adaptive linear regression

Feng Lu, Yusuke Sugano, Takahiro Okabe, and Yoichi Sato. Inferring human gaze from appearance via adaptive linear regression. In2011 International Conference on Computer Vision, pages 153–160. IEEE, 2011

work page 2011

[28] [28]

Eth-xgaze: A large scale dataset for gaze estimation under extreme head pose and gaze variation

Xucong Zhang, Seonwook Park, Thabo Beeler, Derek Bradley, Siyu Tang, and Otmar Hilliges. Eth-xgaze: A large scale dataset for gaze estimation under extreme head pose and gaze variation. InEuropean conference on computer vision, pages 365–381. Springer, 2020

work page 2020

[29] [29]

Mpiigaze: Real-world dataset and deep appearance-based gaze estimation.IEEE transactions on pattern analysis and machine intelligence, 41(1):162–175, 2017

Xucong Zhang, Yusuke Sugano, Mario Fritz, and Andreas Bulling. Mpiigaze: Real-world dataset and deep appearance-based gaze estimation.IEEE transactions on pattern analysis and machine intelligence, 41(1):162–175, 2017

work page 2017

[30] [30]

Eye tracking for everyone

Kyle Krafka, Aditya Khosla, Petr Kellnhofer, Harini Kannan, Suchendra Bhandarkar, Wojciech Matusik, and Antonio Torralba. Eye tracking for everyone. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 2176–2184, 2016. 12

work page 2016

[31] [31]

Rendering of eyes for eye-shape registration and gaze estimation

Erroll Wood, Tadas Baltrusaitis, Xucong Zhang, Yusuke Sugano, Peter Robinson, and Andreas Bulling. Rendering of eyes for eye-shape registration and gaze estimation. InProceedings of the IEEE international conference on computer vision, pages 3756–3764, 2015

work page 2015

[32] [32]

Learning an appearance-based gaze estimator from one million synthesised images

Erroll Wood, Tadas Baltrušaitis, Louis-Philippe Morency, Peter Robinson, and Andreas Bulling. Learning an appearance-based gaze estimator from one million synthesised images. InProceedings of the ninth biennial ACM symposium on eye tracking research & applications, pages 131–138, 2016

work page 2016

[33] [33]

Gazegene: Large-scale synthetic gaze dataset with 3d eyeball annotations

Yiwei Bao, Zhiming Wang, and Feng Lu. Gazegene: Large-scale synthetic gaze dataset with 3d eyeball annotations. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 18749–18759, 2025

work page 2025

[34] [34]

Ga3ce: Unconstrained 3d gaze estimation with gaze-aware 3d context encoding

Yuki Kawana, Shintaro Shiba, Quan Kong, and Norimasa Kobori. Ga3ce: Unconstrained 3d gaze estimation with gaze-aware 3d context encoding. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 3081–3090, 2025

work page 2025

[35] [35]

Enhancing 3d gaze estimation in the wild using weak supervision with gaze following labels

Pierre Vuillecard and Jean-Marc Odobez. Enhancing 3d gaze estimation in the wild using weak supervision with gaze following labels. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 13508–13518, 2025

work page 2025

[36] [36]

Eye tracking in virtual reality.Journal of eye movement research, 12(1):10–16910, 2019

Viviane Clay, Peter König, and Sabine Koenig. Eye tracking in virtual reality.Journal of eye movement research, 12(1):10–16910, 2019

work page 2019

[37] [37]

The eye in extended reality: A survey on gaze interaction and eye tracking in head-worn extended reality.ACM Computing Surveys (CSUR), 55(3):1–39, 2022

Alexander Plopski, Teresa Hirzle, Nahal Norouzi, Long Qian, Gerd Bruder, and Tobias Langlotz. The eye in extended reality: A survey on gaze interaction and eye tracking in head-worn extended reality.ACM Computing Surveys (CSUR), 55(3):1–39, 2022

work page 2022

[38] [38]

Towards foveated rendering for gaze-tracked virtual reality.ACM Transactions On Graphics (TOG), 35 (6):1–12, 2016

Anjul Patney, Marco Salvi, Joohwan Kim, Anton Kaplanyan, Chris Wyman, Nir Benty, David Luebke, and Aaron Lefohn. Towards foveated rendering for gaze-tracked virtual reality.ACM Transactions On Graphics (TOG), 35 (6):1–12, 2016

work page 2016

[39] [39]

Gaze+ pinch interaction in virtual reality

Ken Pfeuffer, Benedikt Mayer, Diako Mardanbegi, and Hans Gellersen. Gaze+ pinch interaction in virtual reality. InProceedings of the 5th symposium on spatial user interaction, pages 99–108, 2017

work page 2017

[40] [40]

Pinch, click, or dwell: Comparing different selection techniques for eye-gaze-based pointing in virtual reality

Aunnoy K Mutasim, Anil Ufuk Batmaz, and Wolfgang Stuerzlinger. Pinch, click, or dwell: Comparing different selection techniques for eye-gaze-based pointing in virtual reality. InAcm symposium on eye tracking research and applications, pages 1–7, 2021

work page 2021

[41] [41]

egoppg: Heart rate estimation from eye-tracking cameras in egocentric systems to benefit downstream vision tasks

Björn Braun, Rayan Armani, Manuel Meier, Max Moebus, and Christian Holz. egoppg: Heart rate estimation from eye-tracking cameras in egocentric systems to benefit downstream vision tasks. InProceedings of the IEEE/CVF International Conference on Computer Vision, 2023

work page 2023

[42] [42]

Challenges of eye tracking systems for mobile xr glasses

Injoon Hong, Kyeongryeol Bong, and Hoi-Jun Yoo. Challenges of eye tracking systems for mobile xr glasses. In Applications of Digital Image Processing XLI, volume 10752, pages 391–397. SPIE, 2018

work page 2018

[43] [43]

Privacy-preserving datasets of eye-tracking samples with applications in xr.IEEE Transactions on Visualization and Computer Graphics, 29(5):2774–2784, 2023

Brendan David-John, Kevin Butler, and Eakta Jain. Privacy-preserving datasets of eye-tracking samples with applications in xr.IEEE Transactions on Visualization and Computer Graphics, 29(5):2774–2784, 2023

work page 2023

[44] [44]

E-track: Eye tracking with event camera for extended reality (xr) applications

Nealson Li, Ashwin Bhat, and Arijit Raychowdhury. E-track: Eye tracking with event camera for extended reality (xr) applications. In2023 IEEE 5th International Conference on Artificial Intelligence Circuits and Systems (AICAS), pages 1–5. IEEE, 2023

work page 2023

[45] [45]

Polarization-resolved imaging improves eye tracking.arXiv preprint arXiv:2511.04652, 2025

Mantas Žurauskas, Tom Bu, Sanaz Alali, Beyza Kalkanli, Derek Shi, Fernando Alamos, Gauresh Pandit, Christopher Mei, Ali Behrooz, Ramin Mirjalili, et al. Polarization-resolved imaging improves eye tracking.arXiv preprint arXiv:2511.04652, 2025

work page arXiv 2025

[46] [46]

Nerf: Representing scenes as neural radiance fields for view synthesis.Communications of the ACM, 65(1):99–106, 2021

Ben Mildenhall, Pratul P Srinivasan, Matthew Tancik, Jonathan T Barron, Ravi Ramamoorthi, and Ren Ng. Nerf: Representing scenes as neural radiance fields for view synthesis.Communications of the ACM, 65(1):99–106, 2021

work page 2021

[47] [47]

Instant neural graphics primitives with a multiresolution hash encoding.ACM transactions on graphics (TOG), 41(4):1–15, 2022

Thomas Müller, Alex Evans, Christoph Schied, and Alexander Keller. Instant neural graphics primitives with a multiresolution hash encoding.ACM transactions on graphics (TOG), 41(4):1–15, 2022

work page 2022

[48] [48]

3d gaussian splatting for real-time radiance field rendering.ACM Trans

Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, and George Drettakis. 3d gaussian splatting for real-time radiance field rendering.ACM Trans. Graph., 42(4):139–1, 2023

work page 2023

[49] [49]

Mip-nerf 360: Unbounded anti-aliased neural radiance fields

Jonathan T Barron, Ben Mildenhall, Dor Verbin, Pratul P Srinivasan, and Peter Hedman. Mip-nerf 360: Unbounded anti-aliased neural radiance fields. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5470–5479, 2022. 13

work page 2022

[50] [50]

Zip-nerf: Anti-aliased grid-based neural radiance fields

Jonathan T Barron, Ben Mildenhall, Dor Verbin, Pratul P Srinivasan, and Peter Hedman. Zip-nerf: Anti-aliased grid-based neural radiance fields. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 19697–19705, 2023

work page 2023

[51] [51]

3d gaussian splatting as markov chain monte carlo.Advances in Neural Information Processing Systems, 37:80965–80986, 2024

Shakiba Kheradmand, Daniel Rebain, Gopal Sharma, Weiwei Sun, Yang-Che Tseng, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, and Kwang Moo Yi. 3d gaussian splatting as markov chain monte carlo.Advances in Neural Information Processing Systems, 37:80965–80986, 2024

work page 2024

[52] [52]

3dgut: Enabling distorted cameras and secondary rays in gaussian splatting

Qi Wu, Janick Martinez Esturo, Ashkan Mirzaei, Nicolas Moenne-Loccoz, and Zan Gojcic. 3dgut: Enabling distorted cameras and secondary rays in gaussian splatting. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 26036–26046, 2025

work page 2025

[53] [53]

Scaffold-gs: Structured 3d gaussians for view-adaptive rendering

Tao Lu, Mulin Yu, Linning Xu, Yuanbo Xiangli, Limin Wang, Dahua Lin, and Bo Dai. Scaffold-gs: Structured 3d gaussians for view-adaptive rendering. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20654–20664, 2024

work page 2024

[54] [54]

Compressed 3d gaussian splatting for accelerated novel view synthesis

Simon Niedermayr, Josef Stumpfegger, and Rüdiger Westermann. Compressed 3d gaussian splatting for accelerated novel view synthesis. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10349–10358, 2024

work page 2024

[55] [55]

Temporally compressed 3d gaussian splatting for dynamic scenes.arXiv preprint arXiv:2412.05700, 4, 2024

Saqib Javed, Ahmad Jarrar Khan, Corentin Dumery, Chen Zhao, and Mathieu Salzmann. Temporally compressed 3d gaussian splatting for dynamic scenes.arXiv preprint arXiv:2412.05700, 4, 2024

work page arXiv 2024

[56] [56]

A view-consistent sampling method for regularized training of neural radiance fields

Aoxiang Fan, Corentin Dumery, Nicolas Talabot, and Pascal Fua. A view-consistent sampling method for regularized training of neural radiance fields. InInternational Conference on Computer Vision (ICCV), 2025

work page 2025

[57] [57]

Nerfacc: Efficient sampling accelerates nerfs

Ruilong Li, Hang Gao, Matthew Tancik, and Angjoo Kanazawa. Nerfacc: Efficient sampling accelerates nerfs. In Proceedings of the IEEE/CVF international conference on computer vision, pages 18537–18546, 2023

work page 2023

[58] [58]

Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields.CVPRW, 2025

Corentin Dumery, Aoxiang Fan, Ren Li, Nicolas Talabot, and Pascal Fua. Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields.CVPRW, 2025

work page 2025

[59] [59]

Lerf: Language embedded radiance fields

Justin Kerr, Chung Min Kim, Ken Goldberg, Angjoo Kanazawa, and Matthew Tancik. Lerf: Language embedded radiance fields. InProceedings of the IEEE/CVF international conference on computer vision, pages 19729–19739, 2023

work page 2023

[60] [60]

Novel view synthesis with diffusion models

Daniel Watson, William Chan, Ricardo Martin Brualla, Jonathan Ho, Andrea Tagliasacchi, and Mohammad Norouzi. Novel view synthesis with diffusion models. InThe Eleventh International Conference on Learning Representations, 2022

work page 2022

[61] [61]

Generative novel view synthesis with 3d-aware diffusion models

Eric R Chan, Koki Nagano, Matthew A Chan, Alexander W Bergman, Jeong Joon Park, Axel Levy, Miika Aittala, Shalini De Mello, Tero Karras, and Gordon Wetzstein. Generative novel view synthesis with 3d-aware diffusion models. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 4217–4229, 2023

work page 2023

[62] [62]

Dreamfusion: Text-to-3d using 2d diffusion

Ben Poole, Ajay Jain, Jonathan T Barron, and Ben Mildenhall. Dreamfusion: Text-to-3d using 2d diffusion. In The Eleventh International Conference on Learning Representations

work page

[63] [63]

Zero-1-to-3: Zero-shot one image to 3d object

Ruoshi Liu, Rundi Wu, Basile Van Hoorick, Pavel Tokmakov, Sergey Zakharov, and Carl Vondrick. Zero-1-to-3: Zero-shot one image to 3d object. InProceedings of the IEEE/CVF international conference on computer vision, pages 9298–9309, 2023

work page 2023

[64] [64]

Self-learning transformations for improving gaze and head redirection.Advances in Neural Information Processing Systems, 33:13127–13138, 2020

Yufeng Zheng, Seonwook Park, Xucong Zhang, Shalini De Mello, and Otmar Hilliges. Self-learning transformations for improving gaze and head redirection.Advances in Neural Information Processing Systems, 33:13127–13138, 2020

work page 2020

[65] [65]

Few-shot adaptive gaze estimation

Seonwook Park, Shalini De Mello, Pavlo Molchanov, Umar Iqbal, Otmar Hilliges, and Jan Kautz. Few-shot adaptive gaze estimation. InProceedings of the IEEE/CVF international conference on computer vision, pages 9368–9377, 2019

work page 2019

[66] [66]

Photo-realistic monocular gaze redirection using generative adversarial networks

Zhe He, Adrian Spurr, Xucong Zhang, and Otmar Hilliges. Photo-realistic monocular gaze redirection using generative adversarial networks. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 6932–6941, 2019

work page 2019

[67] [67]

A hierarchical generative model for eye image synthesis and eye gaze estimation

Kang Wang, Rui Zhao, and Qiang Ji. A hierarchical generative model for eye image synthesis and eye gaze estimation. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 440–448, 2018. 14

work page 2018

[68] [68]

Controllable continuous gaze redirection

Weihao Xia, Yujiu Yang, Jing-Hao Xue, and Wensen Feng. Controllable continuous gaze redirection. InProceedings of the 28th ACM International Conference on Multimedia, pages 1782–1790, 2020

work page 2020

[69] [69]

Gaze manipulation for one-to-one teleconferencing

Criminisi, Shotton, Blake, and Torr. Gaze manipulation for one-to-one teleconferencing. InProceedings Ninth IEEE International Conference on Computer Vision, pages 191–198. IEEE, 2003

work page 2003

[70] [70]

Gaze correction with a single webcam

Dominik Giger, Jean-Charles Bazin, Claudia Kuster, Tiberiu Popa, and Markus Gross. Gaze correction with a single webcam. In2014 IEEE International Conference on Multimedia and Expo (ICME), pages 1–6. IEEE, 2014

work page 2014

[71] [71]

Gaze correction for home video conferencing.ACM Transactions on Graphics (TOG), 31(6):1–6, 2012

Claudia Kuster, Tiberiu Popa, Jean-Charles Bazin, Craig Gotsman, and Markus Gross. Gaze correction for home video conferencing.ACM Transactions on Graphics (TOG), 31(6):1–6, 2012

work page 2012

[72] [72]

Learning to look up: Realtime monocular gaze correction using machine learning

Daniil Kononenko and Victor Lempitsky. Learning to look up: Realtime monocular gaze correction using machine learning. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4667–4675, 2015

work page 2015

[73] [73]

Eyeopener: Editing eyes in the wild.ACM Transactions on Graphics (TOG), 36(1):1–13, 2016

Zhixin Shu, Eli Shechtman, Dimitris Samaras, and Sunil Hadap. Eyeopener: Editing eyes in the wild.ACM Transactions on Graphics (TOG), 36(1):1–13, 2016

work page 2016

[74] [74]

Deepwarp: Photorealistic image resynthesis for gaze manipulation

Yaroslav Ganin, Daniil Kononenko, Diana Sungatullina, and Victor Lempitsky. Deepwarp: Photorealistic image resynthesis for gaze manipulation. InEuropean conference on computer vision, pages 311–326. Springer, 2016

work page 2016

[75] [75]

Learning a model of facial shape and expression from 4d scans.ACM Trans

Tianye Li, Timo Bolkart, Michael J Black, Hao Li, and Javier Romero. Learning a model of facial shape and expression from 4d scans.ACM Trans. Graph., 36(6):194–1, 2017

work page 2017

[76] [76]

A morphable model for the synthesis of 3d faces

Volker Blanz and Thomas Vetter. A morphable model for the synthesis of 3d faces. InSeminal Graphics Papers: Pushing the Boundaries, Volume 2, pages 157–164, 2023

work page 2023

[77] [77]

Combining 3d morphable models: A large scale face-and-head model

Stylianos Ploumpis, Haoyang Wang, Nick Pears, William AP Smith, and Stefanos Zafeiriou. Combining 3d morphable models: A large scale face-and-head model. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10934–10943, 2019

work page 2019

[78] [78]

Gaussianhead: High-fidelity head avatars with learnable gaussian derivation.IEEE Transactions on Visualization and Computer Graphics, 2025

Jie Wang, Jiu-Cheng Xie, Xianyan Li, Feng Xu, Chi-Man Pun, and Hao Gao. Gaussianhead: High-fidelity head avatars with learnable gaussian derivation.IEEE Transactions on Visualization and Computer Graphics, 2025

work page 2025

[79] [79]

Flashavatar: High-fidelity head avatar with efficient gaussian embedding

Jun Xiang, Xuan Gao, Yudong Guo, and Juyong Zhang. Flashavatar: High-fidelity head avatar with efficient gaussian embedding. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1802–1812, 2024

work page 2024

[80] [80]

Hq3davatar: High-quality implicit 3d head avatar.ACM Transactions on Graphics, 43(3):1–24, 2024

Kartik Teotia, Mallikarjun B R, Xingang Pan, Hyeongwoo Kim, Pablo Garrido, Mohamed Elgharib, and Christian Theobalt. Hq3davatar: High-quality implicit 3d head avatar.ACM Transactions on Graphics, 43(3):1–24, 2024

work page 2024