Real-time Facial Surface Geometry from Monocular Video on Mobile GPUs
pith:F6KVF4TH Add to your LaTeX paper
What is a Pith Number?\usepackage{pith}
\pithnumber{F6KVF4TH}
Prints a linked pith:F6KVF4TH badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more
read the original abstract
We present an end-to-end neural network-based model for inferring an approximate 3D mesh representation of a human face from single camera input for AR applications. The relatively dense mesh model of 468 vertices is well-suited for face-based AR effects. The proposed model demonstrates super-realtime inference speed on mobile GPUs (100-1000+ FPS, depending on the device and model variant) and a high prediction quality that is comparable to the variance in manual annotations of the same image.
This paper has not been read by Pith yet.
Forward citations
Cited by 3 Pith papers
-
Markerless Head Tracking for Accurate and Accessible Neuronavigation
Markerless multi-camera head tracking achieves 2.32 mm and 2.01° median accuracy versus marker-based systems in 50 subjects, sufficient for transcranial magnetic stimulation.
-
Artificial Intelligence can Recognize Whether a Job Applicant is Selling and/or Lying According to Facial Expressions and Head Movements Much More Correctly Than Human Interviewers
Deep learning models analyzing temporal facial expressions and head movements in interview videos explained 91% and 84% of variance in self-reported honest and deceptive impression management, outperforming human inte...
-
Emotion-Conditioned Short-Horizon Human Pose Forecasting with a Lightweight Predictive World Model
Facial emotion embeddings improve short-term pose forecasting accuracy for emotion-driven motions when fused via normalized gating in a lightweight LSTM world model, but not with simple multimodal fusion.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.