pith. sign in

arxiv: 2409.17576 · v2 · pith:YV43LVMHnew · submitted 2024-09-26 · 💻 cs.CV

ID³: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition

classification 💻 cs.CV
keywords facediversityidentitymodelsrecognitionid-preservingsynthetictext
0
0 comments X
read the original abstract

Synthetic face recognition (SFR) aims to generate synthetic face datasets that mimic the distribution of real face data, which allows for training face recognition models in a privacy-preserving manner. Despite the remarkable potential of diffusion models in image generation, current diffusion-based SFR models struggle with generalization to real-world faces. To address this limitation, we outline three key objectives for SFR: (1) promoting diversity across identities (inter-class diversity), (2) ensuring diversity within each identity by injecting various facial attributes (intra-class diversity), and (3) maintaining identity consistency within each identity group (intra-class identity preservation). Inspired by these goals, we introduce a diffusion-fueled SFR model termed $\text{ID}^3$. $\text{ID}^3$ employs an ID-preserving loss to generate diverse yet identity-consistent facial appearances. Theoretically, we show that minimizing this loss is equivalent to maximizing the lower bound of an adjusted conditional log-likelihood over ID-preserving data. This equivalence motivates an ID-preserving sampling algorithm, which operates over an adjusted gradient vector field, enabling the generation of fake face recognition datasets that approximate the distribution of real-world faces. Extensive experiments across five challenging benchmarks validate the advantages of $\text{ID}^3$.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries

    cs.CV 2025-07 unverdicted novelty 6.0

    UniEmo unifies emotional understanding and generation by extracting multi-scale features via learnable expert queries, guiding diffusion-based image generation, and using dual feedback to improve both tasks.

  2. SteerFace: Debiasing Synthetic Face Generation via Adaptive Residue Perturbation

    cs.CV 2026-05 unverdicted novelty 5.0

    SteerFace perturbs identity embeddings toward random orthogonal directions on the hypersphere with an adaptive strategy to mitigate visual tendency in synthetic faces and improve downstream recognition performance.