GazePrior learns a 3D prior over eyes to synthesize realistic ground-truth data for training eye trackers on new devices without new real data collection.
Learning a model of facial shape and expression from 4d scans.ACM Trans
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 4verdicts
UNVERDICTED 4representative citing papers
Constructs continuous sign conversation data from isolated signs using retrieval and diffusion models to train a direct sign-to-sign conversational AI.
FFAvatar is a generalizable feed-forward framework that reconstructs high-quality animatable 3D Gaussian head avatars from few-shot unposed portrait images in seconds via Multi-View Query-Former and end-to-end FLAME prediction.
AudioFace improves speech-driven facial animation by guiding blendshape prediction with linguistic and articulatory information extracted via multimodal language models.
citing papers explorer
-
GazePrior: Zero-Shot AR/VR Eye Tracking via Learned 3D Gaze Reconstruction
GazePrior learns a 3D prior over eyes to synthesize realistic ground-truth data for training eye trackers on new devices without new real data collection.
-
Towards Continuous Sign Language Conversation from Isolated Signs
Constructs continuous sign conversation data from isolated signs using retrieval and diffusion models to train a direct sign-to-sign conversational AI.
-
FFAvatar: Few-Shot, Feed-Forward, and Generalizable Avatar Reconstruction
FFAvatar is a generalizable feed-forward framework that reconstructs high-quality animatable 3D Gaussian head avatars from few-shot unposed portrait images in seconds via Multi-View Query-Former and end-to-end FLAME prediction.
-
AudioFace: Language-Assisted Speech-Driven Facial Animation with Multimodal Language Models
AudioFace improves speech-driven facial animation by guiding blendshape prediction with linguistic and articulatory information extracted via multimodal language models.