pith. sign in

arxiv: 1804.03786 · v3 · pith:QNG5LC3Znew · submitted 2018-04-11 · 💻 cs.CV

Nonlinear 3D Face Morphable Model

classification 💻 cs.CV
keywords facemodelshapetexturenonlinearfacialimageimages
0
0 comments X
read the original abstract

As a classic statistical model of 3D facial shape and texture, 3D Morphable Model (3DMM) is widely used in facial analysis, e.g., model fitting, image synthesis. Conventional 3DMM is learned from a set of well-controlled 2D face images with associated 3D face scans, and represented by two sets of PCA basis functions. Due to the type and amount of training data, as well as the linear bases, the representation power of 3DMM can be limited. To address these problems, this paper proposes an innovative framework to learn a nonlinear 3DMM model from a large set of unconstrained face images, without collecting 3D face scans. Specifically, given a face image as input, a network encoder estimates the projection, shape and texture parameters. Two decoders serve as the nonlinear 3DMM to map from the shape and texture parameters to the 3D shape and texture, respectively. With the projection parameter, 3D shape, and texture, a novel analytically-differentiable rendering layer is designed to reconstruct the original input face. The entire network is end-to-end trainable with only weak supervision. We demonstrate the superior representation power of our nonlinear 3DMM over its linear counterpart, and its contribution to face alignment and 3D reconstruction.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. 3D Human Face Reconstruction with 3DMM face model from RGB image

    cs.CV 2026-05 unverdicted novelty 1.0

    The authors implement and document a standard 3DMM-based monocular face reconstruction pipeline that regresses shape, expression, and pose parameters from one RGB image.