GazePrior: Zero-Shot AR/VR Eye Tracking via Learned 3D Gaze Reconstruction
Pith reviewed 2026-05-22 06:20 UTC · model grok-4.3
The pith
A learned 3D prior on eye appearance and gaze lets researchers synthesize realistic training data for eye trackers on any new AR/VR device without collecting real examples from it.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
GazePrior is a data-driven 3D prior that models the distribution of human eyes across diverse identities, gaze directions, and light settings. It enables sparse-input 3D reconstruction of annotated data collected with previous ET devices, which can then be rendered from the cameras of any target ET device, producing synthetic training data that combines the realism, diversity, and ground-truth accuracy of real collection without its prohibitive costs.
What carries the argument
GazePrior, a learned 3D prior on eye identity, gaze direction, and illumination that supports reconstruction from limited views followed by novel-view rendering for arbitrary camera geometries.
If this is right
- Eye-tracking models trained on the synthesized data outperform previous zero-shot methods in both accuracy and robustness.
- Ground-truth gaze labels remain available because they are carried through the 3D reconstruction and rendering steps.
- The same prior can be used to generate data for any target device simply by specifying its camera parameters, removing the need for new real recordings.
Where Pith is reading between the lines
- The reconstruction-plus-render pipeline may prove more robust to domain shift than purely image-based synthesis techniques when device optics change substantially.
- Extending the same prior to additional facial landmarks could support joint eye-and-face tracking without separate data collection campaigns.
Load-bearing premise
The 3D prior learned from data collected with previous eye-tracking devices will generalize across new device camera geometries, lighting conditions, and user populations without requiring any real data from the target device.
What would settle it
Train an eye-tracking model on GazePrior-synthesized images for a specific new device, then compare its gaze-estimation error on real test images from that same device against the error of a model trained on actual recordings from the device; a large gap in favor of the real-data model would falsify the central claim.
read the original abstract
Eye tracking (ET) is a foundational technology for advanced AR/VR applications. However, training ET models for every new ET device is challenging: real data collection is costly and time-consuming, while existing synthetic data generation methods lack realism. To remove the need for additional data collection while maintaining data quality, we introduce a data-driven 3D prior that models the distribution of human eyes across diverse identities, gaze directions, and light settings. This model, which we coin GazePrior, then enables sparse-input 3D reconstruction of annotated data collected with previous ET devices, which can in turn be rendered from the cameras of any target ET device. Our approach synthesizes data with the realism, diversity and ground-truth accuracy of real data collection without its prohibitive costs. Our experiments demonstrate that ET models trained with our synthesized data outperform previous zero-shot methods, achieving higher accuracy and robustness.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces GazePrior, a learned 3D prior over human eye appearance, shape, and gaze that is trained on data from existing eye-tracking devices. This prior is used to perform sparse-input 3D reconstruction of annotated eye images collected on prior hardware; the resulting 3D models are then re-rendered from the camera geometries, intrinsics, and illumination of a new target device to produce synthetic training data. Eye-tracking models trained on this synthesized data are reported to outperform previous zero-shot baselines in accuracy and robustness.
Significance. If the central claim holds, the work would provide a practical route to zero-shot deployment of eye tracking on new AR/VR hardware, removing the need for per-device real-data collection while retaining the photometric and geometric fidelity required for high-accuracy gaze estimation. The approach directly addresses a recurring deployment bottleneck in the field.
major comments (2)
- [§4 and Table 2] §4 (Experiments) and Table 2: the reported outperformance over prior zero-shot methods is stated without accompanying quantitative metrics, error distributions, dataset sizes, or ablation studies on the number of input views or lighting conditions; this makes it impossible to assess whether the gains are robust or sensitive to post-hoc hyper-parameter choices.
- [§3.2] §3.2 (3D Reconstruction and Rendering): the claim that the GazePrior captures sufficient variation to generalize across novel camera intrinsics, lens distortion, and IR illumination patterns is load-bearing for the zero-shot claim, yet the manuscript provides no explicit test (e.g., cross-device reconstruction error or photometric consistency metrics) that isolates the domain gap introduced by re-rendering under unseen spectral responses and distortion models.
minor comments (2)
- [Figure 3] Figure 3: the caption does not specify the number of input views or the exact camera parameters used for the qualitative re-rendering examples, making it difficult to reproduce the visualization.
- [§3.1] Notation in §3.1: the symbol for the learned prior distribution is introduced without an explicit statement of its dimensionality or conditioning variables (identity, gaze, lighting).
Simulated Author's Rebuttal
We thank the referee for their constructive comments on our work. We address each of the major comments in detail below and have updated the manuscript accordingly to improve clarity and provide additional supporting evidence.
read point-by-point responses
-
Referee: [§4 and Table 2] §4 (Experiments) and Table 2: the reported outperformance over prior zero-shot methods is stated without accompanying quantitative metrics, error distributions, dataset sizes, or ablation studies on the number of input views or lighting conditions; this makes it impossible to assess whether the gains are robust or sensitive to post-hoc hyper-parameter choices.
Authors: We agree that providing more detailed quantitative analysis would enhance the evaluation section. In the revised version, we have expanded Table 2 to include mean and standard deviation of angular errors, as well as dataset sizes used for training and testing. Additionally, we have included error distribution plots in Section 4 and performed ablations on the number of input views (1, 2, 3, and 5 views) and different lighting conditions. These ablations confirm that the performance improvements are consistent and not sensitive to specific hyper-parameter choices. The gains over baselines remain significant across all tested configurations. revision: yes
-
Referee: [§3.2] §3.2 (3D Reconstruction and Rendering): the claim that the GazePrior captures sufficient variation to generalize across novel camera intrinsics, lens distortion, and IR illumination patterns is load-bearing for the zero-shot claim, yet the manuscript provides no explicit test (e.g., cross-device reconstruction error or photometric consistency metrics) that isolates the domain gap introduced by re-rendering under unseen spectral responses and distortion models.
Authors: We recognize the importance of explicitly validating the generalization capability of GazePrior to unseen device parameters. To address this, we have added new experiments in the revised Section 3.2, including cross-device reconstruction error metrics and photometric consistency measures (such as PSNR between synthesized and real images from the target device). These results demonstrate that the prior effectively captures the necessary variations, with low reconstruction errors even when re-rendering under novel intrinsics, distortions, and illumination patterns. This supports the robustness of our zero-shot approach. revision: yes
Circularity Check
No significant circularity; derivation is self-contained
full rationale
The paper introduces a learned 3D prior (GazePrior) trained on data from prior eye-tracking devices, then uses sparse-input reconstruction and novel-view rendering to synthesize training data for unseen target devices. No equations, fitted parameters, or self-citations are presented that reduce the claimed outperformance to a quantity defined by construction from the same inputs. The central claims rest on empirical generalization of the prior across device geometries and the results of downstream experiments, which remain independently falsifiable and do not collapse into tautology or renaming of known patterns.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We introduce a data-driven 3D prior that models the distribution of human eyes across diverse identities, gaze directions, and light settings... conditional radiance field... latent-vector representation
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Gaze Latent Vector... sinusoidal encoding... Subject Latent Vector... Lighting Latent Vector
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
HTC. Vive pro eye. https://www.vive.com/sea/product/vive-pro-eye/overview/, 2019. VR headset with integrated eye tracking
work page 2019
-
[2]
Quest pro.https://www.meta.com/quest/quest-pro/, 2022
Meta. Quest pro.https://www.meta.com/quest/quest-pro/, 2022. Eye-tracking VR headset
work page 2022
-
[3]
Microsoft. Hololens 2. https://www.microsoft.com/en-us/hololens/buy, 2019. Mixed reality headset with eye tracking
work page 2019
-
[4]
Apple vision pro.https://www.apple.com/apple-vision-pro/, 2024
Apple Inc. Apple vision pro.https://www.apple.com/apple-vision-pro/, 2024. Mixed reality headset with eye and hand tracking
work page 2024
-
[5]
Project Aria: A New Tool for Egocentric Multi-Modal AI Research
Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, et al. Project aria: A new tool for egocentric multi-modal ai research.arXiv preprint arXiv:2308.13561, 2023
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[6]
Preface: A data-driven volumetric prior for few-shot ultra high-resolution face synthesis
Marcel C Bühler, Kripasindhu Sarkar, Tanmay Shah, Gengyan Li, Daoye Wang, Leonhard Helminger, Sergio Orts-Escolano, Dmitry Lagun, Otmar Hilliges, Thabo Beeler, et al. Preface: A data-driven volumetric prior for few-shot ultra high-resolution face synthesis. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3402–3413, 2023
work page 2023
-
[7]
Cafca: High-quality novel view synthesis of expressive faces from casual few-shot captures
Marcel C Buehler, Gengyan Li, Erroll Wood, Leonhard Helminger, Xu Chen, Tanmay Shah, Daoye Wang, Stephan Garbin, Sergio Orts-Escolano, Otmar Hilliges, et al. Cafca: High-quality novel view synthesis of expressive faces from casual few-shot captures. InSIGGRAPH Asia 2024 Conference Papers, pages 1–12, 2024
work page 2024
-
[8]
Nonrigid Structure from Motion in Trajectory Space
Jonas Kulhanek and Torsten Sattler. Nonrigid Structure from Motion in Trajectory Space. InAdvances in Neural Information Processing Systems, 2025
work page 2025
-
[9]
Gazenerf: 3d-aware gaze redirection with neural radiance fields
Alessandro Ruzzi, Xiangwei Shi, Xi Wang, Gengyan Li, Shalini De Mello, Hyung Jin Chang, Xucong Zhang, and Otmar Hilliges. Gazenerf: 3d-aware gaze redirection with neural radiance fields. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9676–9685, 2023
work page 2023
-
[10]
What we need is explicit controllability: Training 3d gaze estimator using only facial images
Tingwei Li, Jun Bao, Zhenzhong Kuang, and Buyu Liu. What we need is explicit controllability: Training 3d gaze estimator using only facial images. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 11414–11424, 2025
work page 2025
-
[11]
Esther YH Lin, Yimin Ding, Jogendra Kundu, Yatong An, Mohamed T El-Haddad, and Alexander Fix. Dig- itally prototype your eye tracker: Simulating hardware performance using 3d synthetic data.arXiv preprint arXiv:2503.16742, 2025. 11
-
[12]
Gazegaussian: High-fidelity gaze redirection with 3d gaussian splatting
Xiaobao Wei, Peng Chen, Guangyu Li, Ming Lu, Hui Chen, and Feng Tian. Gazegaussian: High-fidelity gaze redirection with 3d gaussian splatting. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 13293–13303, 2025
work page 2025
-
[13]
Learning to find eye region landmarks for remote gaze estimation in unconstrained settings
Seonwook Park, Xucong Zhang, Andreas Bulling, and Otmar Hilliges. Learning to find eye region landmarks for remote gaze estimation in unconstrained settings. InProceedings of the 2018 ACM symposium on eye tracking research & applications, pages 1–10, 2018
work page 2018
-
[14]
Jiahui Liu, Jiannan Chi, Wenxue Hu, and Zhiliang Wang. 3d model-based gaze tracking via iris features with a single camera and a single light source.IEEE Transactions on Human-Machine Systems, 51(2):75–86, 2020
work page 2020
-
[15]
Conny Lu, Praneeth Chakravarthula, Kaihao Liu, Xixiang Liu, Siyuan Li, and Henry Fuchs. Neural 3d gaze: 3d pupil localization and gaze tracking based on anatomical eye model and neural refraction correction. In2022 IEEE international symposium on mixed and augmented reality (ISMAR), pages 375–383. IEEE, 2022
work page 2022
-
[16]
Appearance-based gaze estimation in the wild
Xucong Zhang, Yusuke Sugano, Mario Fritz, and Andreas Bulling. Appearance-based gaze estimation in the wild. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 4511–4520, 2015
work page 2015
-
[17]
Person independent 3d gaze estimation from remote rgb-d cameras
Kenneth Alberto Funes Mora and Jean-Marc Odobez. Person independent 3d gaze estimation from remote rgb-d cameras. In2013 IEEE International Conference on Image Processing, pages 2787–2791. IEEE, 2013
work page 2013
-
[18]
Kenneth A Funes-Mora and Jean-Marc Odobez. Gaze estimation in the 3d space using rgb-d sensors: towards head-pose and user invariance.International Journal of Computer Vision, 118(2):194–216, 2016
work page 2016
-
[19]
Kenneth Alberto Funes Mora, Florent Monay, and Jean-Marc Odobez. Eyediap: A database for the development and evaluation of gaze estimation algorithms from rgb and rgb-d cameras. InProceedings of the symposium on eye tracking research and applications, pages 255–258, 2014
work page 2014
-
[20]
A 3d morphable eye region model for gaze estimation
Erroll Wood, Tadas Baltrušaitis, Louis-Philippe Morency, Peter Robinson, and Andreas Bulling. A 3d morphable eye region model for gaze estimation. InEuropean conference on computer vision, pages 297–313. Springer, 2016
work page 2016
-
[21]
Gazedirector: Fully articulated eye gaze redirection in video
Erroll Wood, Tadas Baltrušaitis, Louis-Philippe Morency, Peter Robinson, and Andreas Bulling. Gazedirector: Fully articulated eye gaze redirection in video. InComputer Graphics Forum. Wiley Online Library, 2018
work page 2018
-
[22]
3dgazenet: Generalizing 3d gaze estimation with weak-supervision from synthetic views
Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng, Michail Christos Doukas, Jia Guo, and Stefanos Zafeiriou. 3dgazenet: Generalizing 3d gaze estimation with weak-supervision from synthetic views. InEuropean Conference on Computer Vision, pages 387–404. Springer, 2024
work page 2024
-
[23]
Towards an accurate 3d deformable eye model for gaze estimation
Chenyi Kuang, Jeffery O Kephart, and Qiang Ji. Towards an accurate 3d deformable eye model for gaze estimation. InInternational Conference on Pattern Recognition, pages 109–123. Springer, 2022
work page 2022
-
[24]
Accurate real-time 3d gaze tracking using a lightweight eyeball calibration
Quan Wen, Derek Bradley, Thabo Beeler, Seonwook Park, Otmar Hilliges, Junhai Yong, and Feng Xu. Accurate real-time 3d gaze tracking using a lightweight eyeball calibration. InComputer Graphics Forum. Wiley Online Library, 2020
work page 2020
-
[25]
Feng Lu, Takahiro Okabe, Yusuke Sugano, and Yoichi Sato. Learning gaze biases with head motion for head pose-free gaze estimation.Image and Vision Computing, 32(3):169–179, 2014
work page 2014
-
[26]
Feng Lu, Yusuke Sugano, Takahiro Okabe, and Yoichi Sato. Gaze estimation from eye appearance: A head pose-free method via eye image synthesis.IEEE Transactions on Image Processing, 24(11):3680–3693, 2015
work page 2015
-
[27]
Inferring human gaze from appearance via adaptive linear regression
Feng Lu, Yusuke Sugano, Takahiro Okabe, and Yoichi Sato. Inferring human gaze from appearance via adaptive linear regression. In2011 International Conference on Computer Vision, pages 153–160. IEEE, 2011
work page 2011
-
[28]
Eth-xgaze: A large scale dataset for gaze estimation under extreme head pose and gaze variation
Xucong Zhang, Seonwook Park, Thabo Beeler, Derek Bradley, Siyu Tang, and Otmar Hilliges. Eth-xgaze: A large scale dataset for gaze estimation under extreme head pose and gaze variation. InEuropean conference on computer vision, pages 365–381. Springer, 2020
work page 2020
-
[29]
Xucong Zhang, Yusuke Sugano, Mario Fritz, and Andreas Bulling. Mpiigaze: Real-world dataset and deep appearance-based gaze estimation.IEEE transactions on pattern analysis and machine intelligence, 41(1):162–175, 2017
work page 2017
-
[30]
Kyle Krafka, Aditya Khosla, Petr Kellnhofer, Harini Kannan, Suchendra Bhandarkar, Wojciech Matusik, and Antonio Torralba. Eye tracking for everyone. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 2176–2184, 2016. 12
work page 2016
-
[31]
Rendering of eyes for eye-shape registration and gaze estimation
Erroll Wood, Tadas Baltrusaitis, Xucong Zhang, Yusuke Sugano, Peter Robinson, and Andreas Bulling. Rendering of eyes for eye-shape registration and gaze estimation. InProceedings of the IEEE international conference on computer vision, pages 3756–3764, 2015
work page 2015
-
[32]
Learning an appearance-based gaze estimator from one million synthesised images
Erroll Wood, Tadas Baltrušaitis, Louis-Philippe Morency, Peter Robinson, and Andreas Bulling. Learning an appearance-based gaze estimator from one million synthesised images. InProceedings of the ninth biennial ACM symposium on eye tracking research & applications, pages 131–138, 2016
work page 2016
-
[33]
Gazegene: Large-scale synthetic gaze dataset with 3d eyeball annotations
Yiwei Bao, Zhiming Wang, and Feng Lu. Gazegene: Large-scale synthetic gaze dataset with 3d eyeball annotations. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 18749–18759, 2025
work page 2025
-
[34]
Ga3ce: Unconstrained 3d gaze estimation with gaze-aware 3d context encoding
Yuki Kawana, Shintaro Shiba, Quan Kong, and Norimasa Kobori. Ga3ce: Unconstrained 3d gaze estimation with gaze-aware 3d context encoding. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 3081–3090, 2025
work page 2025
-
[35]
Enhancing 3d gaze estimation in the wild using weak supervision with gaze following labels
Pierre Vuillecard and Jean-Marc Odobez. Enhancing 3d gaze estimation in the wild using weak supervision with gaze following labels. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 13508–13518, 2025
work page 2025
-
[36]
Eye tracking in virtual reality.Journal of eye movement research, 12(1):10–16910, 2019
Viviane Clay, Peter König, and Sabine Koenig. Eye tracking in virtual reality.Journal of eye movement research, 12(1):10–16910, 2019
work page 2019
-
[37]
Alexander Plopski, Teresa Hirzle, Nahal Norouzi, Long Qian, Gerd Bruder, and Tobias Langlotz. The eye in extended reality: A survey on gaze interaction and eye tracking in head-worn extended reality.ACM Computing Surveys (CSUR), 55(3):1–39, 2022
work page 2022
-
[38]
Anjul Patney, Marco Salvi, Joohwan Kim, Anton Kaplanyan, Chris Wyman, Nir Benty, David Luebke, and Aaron Lefohn. Towards foveated rendering for gaze-tracked virtual reality.ACM Transactions On Graphics (TOG), 35 (6):1–12, 2016
work page 2016
-
[39]
Gaze+ pinch interaction in virtual reality
Ken Pfeuffer, Benedikt Mayer, Diako Mardanbegi, and Hans Gellersen. Gaze+ pinch interaction in virtual reality. InProceedings of the 5th symposium on spatial user interaction, pages 99–108, 2017
work page 2017
-
[40]
Aunnoy K Mutasim, Anil Ufuk Batmaz, and Wolfgang Stuerzlinger. Pinch, click, or dwell: Comparing different selection techniques for eye-gaze-based pointing in virtual reality. InAcm symposium on eye tracking research and applications, pages 1–7, 2021
work page 2021
-
[41]
Björn Braun, Rayan Armani, Manuel Meier, Max Moebus, and Christian Holz. egoppg: Heart rate estimation from eye-tracking cameras in egocentric systems to benefit downstream vision tasks. InProceedings of the IEEE/CVF International Conference on Computer Vision, 2023
work page 2023
-
[42]
Challenges of eye tracking systems for mobile xr glasses
Injoon Hong, Kyeongryeol Bong, and Hoi-Jun Yoo. Challenges of eye tracking systems for mobile xr glasses. In Applications of Digital Image Processing XLI, volume 10752, pages 391–397. SPIE, 2018
work page 2018
-
[43]
Brendan David-John, Kevin Butler, and Eakta Jain. Privacy-preserving datasets of eye-tracking samples with applications in xr.IEEE Transactions on Visualization and Computer Graphics, 29(5):2774–2784, 2023
work page 2023
-
[44]
E-track: Eye tracking with event camera for extended reality (xr) applications
Nealson Li, Ashwin Bhat, and Arijit Raychowdhury. E-track: Eye tracking with event camera for extended reality (xr) applications. In2023 IEEE 5th International Conference on Artificial Intelligence Circuits and Systems (AICAS), pages 1–5. IEEE, 2023
work page 2023
-
[45]
Polarization-resolved imaging improves eye tracking.arXiv preprint arXiv:2511.04652, 2025
Mantas Žurauskas, Tom Bu, Sanaz Alali, Beyza Kalkanli, Derek Shi, Fernando Alamos, Gauresh Pandit, Christopher Mei, Ali Behrooz, Ramin Mirjalili, et al. Polarization-resolved imaging improves eye tracking.arXiv preprint arXiv:2511.04652, 2025
-
[46]
Ben Mildenhall, Pratul P Srinivasan, Matthew Tancik, Jonathan T Barron, Ravi Ramamoorthi, and Ren Ng. Nerf: Representing scenes as neural radiance fields for view synthesis.Communications of the ACM, 65(1):99–106, 2021
work page 2021
-
[47]
Thomas Müller, Alex Evans, Christoph Schied, and Alexander Keller. Instant neural graphics primitives with a multiresolution hash encoding.ACM transactions on graphics (TOG), 41(4):1–15, 2022
work page 2022
-
[48]
3d gaussian splatting for real-time radiance field rendering.ACM Trans
Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, and George Drettakis. 3d gaussian splatting for real-time radiance field rendering.ACM Trans. Graph., 42(4):139–1, 2023
work page 2023
-
[49]
Mip-nerf 360: Unbounded anti-aliased neural radiance fields
Jonathan T Barron, Ben Mildenhall, Dor Verbin, Pratul P Srinivasan, and Peter Hedman. Mip-nerf 360: Unbounded anti-aliased neural radiance fields. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5470–5479, 2022. 13
work page 2022
-
[50]
Zip-nerf: Anti-aliased grid-based neural radiance fields
Jonathan T Barron, Ben Mildenhall, Dor Verbin, Pratul P Srinivasan, and Peter Hedman. Zip-nerf: Anti-aliased grid-based neural radiance fields. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 19697–19705, 2023
work page 2023
-
[51]
Shakiba Kheradmand, Daniel Rebain, Gopal Sharma, Weiwei Sun, Yang-Che Tseng, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, and Kwang Moo Yi. 3d gaussian splatting as markov chain monte carlo.Advances in Neural Information Processing Systems, 37:80965–80986, 2024
work page 2024
-
[52]
3dgut: Enabling distorted cameras and secondary rays in gaussian splatting
Qi Wu, Janick Martinez Esturo, Ashkan Mirzaei, Nicolas Moenne-Loccoz, and Zan Gojcic. 3dgut: Enabling distorted cameras and secondary rays in gaussian splatting. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 26036–26046, 2025
work page 2025
-
[53]
Scaffold-gs: Structured 3d gaussians for view-adaptive rendering
Tao Lu, Mulin Yu, Linning Xu, Yuanbo Xiangli, Limin Wang, Dahua Lin, and Bo Dai. Scaffold-gs: Structured 3d gaussians for view-adaptive rendering. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20654–20664, 2024
work page 2024
-
[54]
Compressed 3d gaussian splatting for accelerated novel view synthesis
Simon Niedermayr, Josef Stumpfegger, and Rüdiger Westermann. Compressed 3d gaussian splatting for accelerated novel view synthesis. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10349–10358, 2024
work page 2024
-
[55]
Saqib Javed, Ahmad Jarrar Khan, Corentin Dumery, Chen Zhao, and Mathieu Salzmann. Temporally compressed 3d gaussian splatting for dynamic scenes.arXiv preprint arXiv:2412.05700, 4, 2024
-
[56]
A view-consistent sampling method for regularized training of neural radiance fields
Aoxiang Fan, Corentin Dumery, Nicolas Talabot, and Pascal Fua. A view-consistent sampling method for regularized training of neural radiance fields. InInternational Conference on Computer Vision (ICCV), 2025
work page 2025
-
[57]
Nerfacc: Efficient sampling accelerates nerfs
Ruilong Li, Hang Gao, Matthew Tancik, and Angjoo Kanazawa. Nerfacc: Efficient sampling accelerates nerfs. In Proceedings of the IEEE/CVF international conference on computer vision, pages 18537–18546, 2023
work page 2023
-
[58]
Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields.CVPRW, 2025
Corentin Dumery, Aoxiang Fan, Ren Li, Nicolas Talabot, and Pascal Fua. Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields.CVPRW, 2025
work page 2025
-
[59]
Lerf: Language embedded radiance fields
Justin Kerr, Chung Min Kim, Ken Goldberg, Angjoo Kanazawa, and Matthew Tancik. Lerf: Language embedded radiance fields. InProceedings of the IEEE/CVF international conference on computer vision, pages 19729–19739, 2023
work page 2023
-
[60]
Novel view synthesis with diffusion models
Daniel Watson, William Chan, Ricardo Martin Brualla, Jonathan Ho, Andrea Tagliasacchi, and Mohammad Norouzi. Novel view synthesis with diffusion models. InThe Eleventh International Conference on Learning Representations, 2022
work page 2022
-
[61]
Generative novel view synthesis with 3d-aware diffusion models
Eric R Chan, Koki Nagano, Matthew A Chan, Alexander W Bergman, Jeong Joon Park, Axel Levy, Miika Aittala, Shalini De Mello, Tero Karras, and Gordon Wetzstein. Generative novel view synthesis with 3d-aware diffusion models. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 4217–4229, 2023
work page 2023
-
[62]
Dreamfusion: Text-to-3d using 2d diffusion
Ben Poole, Ajay Jain, Jonathan T Barron, and Ben Mildenhall. Dreamfusion: Text-to-3d using 2d diffusion. In The Eleventh International Conference on Learning Representations
-
[63]
Zero-1-to-3: Zero-shot one image to 3d object
Ruoshi Liu, Rundi Wu, Basile Van Hoorick, Pavel Tokmakov, Sergey Zakharov, and Carl Vondrick. Zero-1-to-3: Zero-shot one image to 3d object. InProceedings of the IEEE/CVF international conference on computer vision, pages 9298–9309, 2023
work page 2023
-
[64]
Yufeng Zheng, Seonwook Park, Xucong Zhang, Shalini De Mello, and Otmar Hilliges. Self-learning transformations for improving gaze and head redirection.Advances in Neural Information Processing Systems, 33:13127–13138, 2020
work page 2020
-
[65]
Few-shot adaptive gaze estimation
Seonwook Park, Shalini De Mello, Pavlo Molchanov, Umar Iqbal, Otmar Hilliges, and Jan Kautz. Few-shot adaptive gaze estimation. InProceedings of the IEEE/CVF international conference on computer vision, pages 9368–9377, 2019
work page 2019
-
[66]
Photo-realistic monocular gaze redirection using generative adversarial networks
Zhe He, Adrian Spurr, Xucong Zhang, and Otmar Hilliges. Photo-realistic monocular gaze redirection using generative adversarial networks. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 6932–6941, 2019
work page 2019
-
[67]
A hierarchical generative model for eye image synthesis and eye gaze estimation
Kang Wang, Rui Zhao, and Qiang Ji. A hierarchical generative model for eye image synthesis and eye gaze estimation. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 440–448, 2018. 14
work page 2018
-
[68]
Controllable continuous gaze redirection
Weihao Xia, Yujiu Yang, Jing-Hao Xue, and Wensen Feng. Controllable continuous gaze redirection. InProceedings of the 28th ACM International Conference on Multimedia, pages 1782–1790, 2020
work page 2020
-
[69]
Gaze manipulation for one-to-one teleconferencing
Criminisi, Shotton, Blake, and Torr. Gaze manipulation for one-to-one teleconferencing. InProceedings Ninth IEEE International Conference on Computer Vision, pages 191–198. IEEE, 2003
work page 2003
-
[70]
Gaze correction with a single webcam
Dominik Giger, Jean-Charles Bazin, Claudia Kuster, Tiberiu Popa, and Markus Gross. Gaze correction with a single webcam. In2014 IEEE International Conference on Multimedia and Expo (ICME), pages 1–6. IEEE, 2014
work page 2014
-
[71]
Gaze correction for home video conferencing.ACM Transactions on Graphics (TOG), 31(6):1–6, 2012
Claudia Kuster, Tiberiu Popa, Jean-Charles Bazin, Craig Gotsman, and Markus Gross. Gaze correction for home video conferencing.ACM Transactions on Graphics (TOG), 31(6):1–6, 2012
work page 2012
-
[72]
Learning to look up: Realtime monocular gaze correction using machine learning
Daniil Kononenko and Victor Lempitsky. Learning to look up: Realtime monocular gaze correction using machine learning. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4667–4675, 2015
work page 2015
-
[73]
Eyeopener: Editing eyes in the wild.ACM Transactions on Graphics (TOG), 36(1):1–13, 2016
Zhixin Shu, Eli Shechtman, Dimitris Samaras, and Sunil Hadap. Eyeopener: Editing eyes in the wild.ACM Transactions on Graphics (TOG), 36(1):1–13, 2016
work page 2016
-
[74]
Deepwarp: Photorealistic image resynthesis for gaze manipulation
Yaroslav Ganin, Daniil Kononenko, Diana Sungatullina, and Victor Lempitsky. Deepwarp: Photorealistic image resynthesis for gaze manipulation. InEuropean conference on computer vision, pages 311–326. Springer, 2016
work page 2016
-
[75]
Learning a model of facial shape and expression from 4d scans.ACM Trans
Tianye Li, Timo Bolkart, Michael J Black, Hao Li, and Javier Romero. Learning a model of facial shape and expression from 4d scans.ACM Trans. Graph., 36(6):194–1, 2017
work page 2017
-
[76]
A morphable model for the synthesis of 3d faces
Volker Blanz and Thomas Vetter. A morphable model for the synthesis of 3d faces. InSeminal Graphics Papers: Pushing the Boundaries, Volume 2, pages 157–164, 2023
work page 2023
-
[77]
Combining 3d morphable models: A large scale face-and-head model
Stylianos Ploumpis, Haoyang Wang, Nick Pears, William AP Smith, and Stefanos Zafeiriou. Combining 3d morphable models: A large scale face-and-head model. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10934–10943, 2019
work page 2019
-
[78]
Jie Wang, Jiu-Cheng Xie, Xianyan Li, Feng Xu, Chi-Man Pun, and Hao Gao. Gaussianhead: High-fidelity head avatars with learnable gaussian derivation.IEEE Transactions on Visualization and Computer Graphics, 2025
work page 2025
-
[79]
Flashavatar: High-fidelity head avatar with efficient gaussian embedding
Jun Xiang, Xuan Gao, Yudong Guo, and Juyong Zhang. Flashavatar: High-fidelity head avatar with efficient gaussian embedding. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1802–1812, 2024
work page 2024
-
[80]
Hq3davatar: High-quality implicit 3d head avatar.ACM Transactions on Graphics, 43(3):1–24, 2024
Kartik Teotia, Mallikarjun B R, Xingang Pan, Hyeongwoo Kim, Pablo Garrido, Mohamed Elgharib, and Christian Theobalt. Hq3davatar: High-quality implicit 3d head avatar.ACM Transactions on Graphics, 43(3):1–24, 2024
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.