Position: Age Estimation Models Do Not Process Biometric Data
Pith reviewed 2026-05-19 23:19 UTC · model grok-4.3
The pith
Age estimation models do not process biometric data because they cannot identify individuals.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Age estimation models cannot identify individuals. When 14 separate age estimators were evaluated on three standard face verification benchmarks, their performance remained orders of magnitude below identification thresholds, showing that identity-discriminative representations do not arise inside these networks during inference.
What carries the argument
Face verification benchmarks applied to age estimation networks to check whether identity-discriminative signals appear during inference.
If this is right
- Age estimation systems would not trigger consent obligations tied to biometric identification.
- Regulators could classify age estimators separately from template-based recognition tools.
- Developers could avoid storing identity templates while still estimating age.
- Transparency reports should include whether a model meets identification thresholds.
Where Pith is reading between the lines
- The same benchmark test could be applied to other attribute predictors such as gender or expression models.
- Future rules might focus on measurable output capability rather than input type alone.
- Repeating the evaluation on newer or larger models would provide a direct check on the result.
Load-bearing premise
That the chosen face verification benchmarks can detect any identity information the age model might process and that missing identification thresholds equals not processing biometric data under current legal definitions.
What would settle it
An age estimation model that reaches identification-level accuracy on one of the same face verification benchmarks used in the study.
Figures
read the original abstract
When a neural network estimates someone's age from a photograph, does it process biometric data? The answer depends on whether identity-discriminative representations arise within the network during inference, a question that may seem trivial to ML researchers but triggers consent requirements under GDPR, statutory damages under BIPA, or high-risk AI classification under the EU AI Act. Yet no regulatory guidance addresses it. This position paper provides empirical evidence: 14 models evaluated across 3 face verification benchmarks show age estimators fall orders of magnitude short of identification thresholds. Age estimation models cannot identify individuals. We call on researchers to provide transparency about what systems store and can do, and on regulators to distinguish transient processing from template storage.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that age estimation neural networks do not process biometric data under frameworks such as GDPR, BIPA, and the EU AI Act. It supports this by reporting that 14 age-estimation models evaluated on three face-verification benchmarks perform orders of magnitude below identification thresholds, from which it concludes that no identity-discriminative representations arise during inference. The position paper calls for greater transparency about model capabilities and for regulators to distinguish transient processing from template storage.
Significance. If the empirical results and the sufficiency of the chosen benchmarks hold, the work supplies concrete, falsifiable measurements that could inform regulatory guidance on whether age-estimation systems trigger biometric consent or high-risk classification requirements. The provision of results across 14 models and three benchmarks is a strength that moves the discussion beyond purely definitional arguments.
major comments (2)
- [Empirical evaluation / Results] The central inference—that sub-threshold performance on the three face-verification benchmarks demonstrates the absence of any identity-discriminative representations—depends on the assumption that standard 1:1 verification protocols are sensitive enough to reveal such representations if they exist. The manuscript does not appear to report auxiliary probes (e.g., multi-layer embeddings, clustering, or training of auxiliary identity classifiers) that would test this assumption directly.
- [Discussion / Regulatory implications] The mapping from “falling orders of magnitude short of identification thresholds” to the regulatory conclusion that the models “do not process biometric data” requires explicit justification against the precise statutory language (GDPR Art. 4(14), BIPA §15, EU AI Act Annex III). The current text equates non-identification with non-processing without addressing whether transient, non-template representations could still qualify as biometric data under those definitions.
minor comments (2)
- [Abstract] The abstract states results from 14 models on 3 benchmarks but does not report the specific verification thresholds, statistical tests, or exclusion criteria used; these details should be summarized in the abstract for a position paper.
- [Throughout] Notation for the three benchmarks and the exact performance metric (e.g., EER, AUC, or rank-1 accuracy) should be defined at first use and kept consistent across tables and text.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed feedback on our position paper. We address each major comment below, providing clarifications on our empirical approach and regulatory analysis, and note revisions made to the manuscript.
read point-by-point responses
-
Referee: [Empirical evaluation / Results] The central inference—that sub-threshold performance on the three face-verification benchmarks demonstrates the absence of any identity-discriminative representations—depends on the assumption that standard 1:1 verification protocols are sensitive enough to reveal such representations if they exist. The manuscript does not appear to report auxiliary probes (e.g., multi-layer embeddings, clustering, or training of auxiliary identity classifiers) that would test this assumption directly.
Authors: We maintain that the standard 1:1 face verification protocols employed are the most direct and established method for assessing the presence of identity-discriminative representations, as these benchmarks are specifically designed and widely used to measure whether embeddings support individual distinction. The orders-of-magnitude shortfall observed consistently across 14 models and three benchmarks provides robust evidence that such representations are not arising. While auxiliary probes such as clustering or auxiliary identity classifiers could offer supplementary data, they are not required to support our inference given the scale of the performance gap; any identity information sufficient for identification would be expected to manifest on these primary benchmarks. We have added a short explanatory paragraph in the revised manuscript justifying the sufficiency of the chosen evaluation protocol. revision: partial
-
Referee: [Discussion / Regulatory implications] The mapping from “falling orders of magnitude short of identification thresholds” to the regulatory conclusion that the models “do not process biometric data” requires explicit justification against the precise statutory language (GDPR Art. 4(14), BIPA §15, EU AI Act Annex III). The current text equates non-identification with non-processing without addressing whether transient, non-template representations could still qualify as biometric data under those definitions.
Authors: We agree that greater precision strengthens the regulatory claims and have revised the discussion section accordingly. We now explicitly reference GDPR Article 4(14), which defines biometric data as personal data resulting from technical processing relating to physical characteristics 'which allow or confirm the unique identification of that natural person'. Our empirical results demonstrate that the models do not allow or confirm unique identification. Parallel mappings are provided for BIPA §15 and EU AI Act Annex III, with emphasis that the statutory focus is on identifiability rather than any facial feature processing. We further clarify that transient representations lacking identifiability—as shown by the benchmark results—fall outside these definitions, distinguishing them from template storage. revision: yes
Circularity Check
No circularity: central claim rests on direct empirical measurement against external benchmarks
full rationale
The paper advances its position through empirical evaluation of 14 age-estimation models on three independent face-verification benchmarks, reporting that performance falls orders of magnitude below identification thresholds. This constitutes external, falsifiable evidence rather than any derivation, fitted parameter, or self-citation that reduces to the paper's own inputs by construction. No load-bearing step equates a result to its premise via definition or renaming.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Identity-discriminative representations must arise during inference for an age estimation model to be considered processing biometric data under GDPR, BIPA, and EU AI Act.
Reference graph
Works this paper leans on
-
[1]
FedAffect: Few-shot federated learning for facial expression recognition,
An, X., Zhu, X., Gao, Y., Xiao, Y., Zhao, Y., Feng, Z., Wu, L., Qin, B., Zhang, M., Zhang, D., and Fu, Y. Partial FC : Training 10 million identities on a single machine. In 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pp.\ 1445--1449, 2021. doi:10.1109/ICCVW54120.2021.00166
-
[2]
Opinion 3/2012 on developments in biometric technologies
Article 29 Working Party . Opinion 3/2012 on developments in biometric technologies. https://ec.europa.eu/justice/article-29/documentation/opinion-recommendation/files/2012/wp193_en.pdf, 2012. WP193
work page 2012
-
[3]
FaceLib : Detection, age gender estimation & recognition
Ayobi, S. FaceLib : Detection, age gender estimation & recognition. https://github.com/sajjjadayobi/FaceLib, 2020
work page 2020
-
[4]
H., Madotto, A., Wei, C., Ma, T., Zhi, J., Rajasegaran, J., Rasheed, H
Bolya, D., Huang, P.-Y., Sun, P., Cho, J. H., Madotto, A., Wei, C., Ma, T., Zhi, J., Rajasegaran, J., Rasheed, H. A., Wang, J., Monteiro, M., Xu, H., Dong, S., Ravi, N., Li, S.-W., Dollar, P., and Feichtenhofer, C. Perception encoder: The best visual embeddings are not at the output of the network. In The Thirty-ninth Annual Conference on Neural Informati...
work page 2026
-
[5]
CNIL . Cam\'eras augment\'ees pour estimer l'\^age dans les bureaux de tabac : la CNIL pr\'ecise sa position. https://www.cnil.fr/fr/cameras-augmentees-pour-estimer-lage-dans-les-bureaux-de-tabac-la-cnil-precise-sa-position, 2025. Position statement on AI age estimation cameras in tobacco shops
work page 2025
-
[6]
Arcface: Additive angular margin loss for deep face recognition
Deng, J., Guo, J., Xue, N., and Zafeiriou, S. ArcFace : Additive angular margin loss for deep face recognition. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.\ 4685--4694, 2019. doi:10.1109/CVPR.2019.00482
-
[7]
Guidelines 3/2019 on processing of personal data through video devices
EDPB . Guidelines 3/2019 on processing of personal data through video devices. https://www.edpb.europa.eu/our-work-tools/our-documents/guidelines/guidelines-32019-processing-personal-data-through-video_en, 2020. Version 2.0, adopted 29 January 2020
work page 2019
-
[8]
The French SA fines Clearview AI EUR 20 million
EDPB . The French SA fines Clearview AI EUR 20 million. https://www.edpb.europa.eu/news/national-news/2022/french-sa-fines-clearview-ai-eur-20-million_en, 2022. European Data Protection Board announcement of CNIL decision
work page 2022
-
[9]
Guidelines 05/2022 on the use of facial recognition technology in the area of law enforcement
EDPB . Guidelines 05/2022 on the use of facial recognition technology in the area of law enforcement. https://www.edpb.europa.eu/system/files/2023-05/edpb_guidelines_202304_frtlawenforcement_v2_en.pdf, 2023. Version 2.0, adopted 26 April 2023
work page 2022
-
[10]
Statement 1/2025 on age assurance
EDPB . Statement 1/2025 on age assurance. https://www.edpb.europa.eu/our-work-tools/our-documents/statements/statement-12025-age-assurance_en, 2025
work page 2025
-
[11]
European Commission . Commission implementing decision ( EU ) 2019/329 laying down the specifications for the quality, resolution and use of fingerprints and facial image for biometric verification and identification in the Entry/Exit System (EES) . https://eur-lex.europa.eu/eli/dec_impl/2019/329/oj, 2019
work page 2019
-
[12]
Digital omnibus regulation proposal
European Commission . Digital omnibus regulation proposal. https://digital-strategy.ec.europa.eu/en/library/digital-omnibus-regulation-proposal, 2025
work page 2025
-
[13]
European Parliament and Council . Regulation ( EU ) 2016/679 of the European Parliament and of the Council ( General Data Protection Regulation ). https://eur-lex.europa.eu/eli/reg/2016/679/oj, 2016
work page 2016
-
[14]
Regulation ( EU ) 2024/1689 laying down harmonised rules on artificial intelligence ( AI Act )
European Parliament and Council . Regulation ( EU ) 2024/1689 laying down harmonised rules on artificial intelligence ( AI Act ). https://eur-lex.europa.eu/eli/reg/2024/1689/oj, 2024
work page 2024
-
[15]
FIDO Alliance . FIDO biometrics requirements. https://fidoalliance.org/specs/biometric/requirements/, 2025. Version 4.1, January 2025
work page 2025
-
[16]
FTC . Policy statement of the Federal Trade Commission on biometric information and section 5 of the FTC act. https://www.ftc.gov/legal-library/browse/policy-statement-federal-trade-commission-biometric-information-section-5-federal-trade-commission, 2023
work page 2023
-
[17]
Deep residual learning for image recognition,
He, K., Zhang, X., Ren, S., and Sun, J. Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.\ 770--778, 2016. doi:10.1109/CVPR.2016.90
-
[18]
B., Mattar, M., Berg, T., and Learned-Miller, E
Huang, G. B., Mattar, M., Berg, T., and Learned-Miller, E. Labeled faces in the wild: A database for studying face recognition in unconstrained environments. In Workshop on Faces in 'Real-Life' Images: Detection, Alignment, and Recognition, Marseille, France, 2008. Erik Learned-Miller and Andras Ferencz and Fr\'ed\'eric Jurie. URL https://inria.hal.scienc...
work page 2008
-
[19]
Regulatory sandbox final report: Yoti
ICO . Regulatory sandbox final report: Yoti . https://ico.org.uk/for-organisations/advice-and-services/regulatory-sandbox/previous-participants/2022/, 2022. Exit date: April 2022
work page 2022
-
[20]
Biometric data guidance: Biometric recognition
ICO . Biometric data guidance: Biometric recognition. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/lawful-basis/biometric-data-guidance-biometric-recognition/, 2024
work page 2024
-
[21]
Biometric information privacy act, 740 ILCS 14
Illinois General Assembly . Biometric information privacy act, 740 ILCS 14. https://law.justia.com/codes/illinois/chapter-740/act-740-ilcs-14/, 2008
work page 2008
-
[22]
InsightFace : 2d and 3d face analysis project
InsightFace . InsightFace : 2d and 3d face analysis project. https://github.com/deepinsight/insightface, 2018
work page 2018
-
[23]
ISO/IEC . ISO/IEC 19795-1:2021 -- information technology -- biometric performance testing and reporting -- part 1: Principles and framework. https://www.iso.org/standard/73515.html, 2021
work page 2021
-
[24]
ISO/IEC . ISO/IEC 27566-1:2025 -- information security, cybersecurity and privacy protection -- age assurance systems -- part 1: Framework. https://www.iso.org/standard/88143.html, 2025
work page 2025
-
[25]
Facial recognition: Italian SA fines Clearview AI EUR 20 million
Italian Garante . Facial recognition: Italian SA fines Clearview AI EUR 20 million. https://www.gpdp.it/web/guest/home/docweb/-/docweb-display/docweb/9751362, 2022
-
[26]
Efficient attention: Attention with linear complexities
K\"arkk\"ainen, K. and Joo, J. FairFace : Face attribute dataset for balanced race, gender, and age for bias measurement and mitigation. In 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), pp.\ 1547--1557, 2021. doi:10.1109/WACV48630.2021.00159
-
[27]
Minivision Technology . Silent-face-anti-spoofing. https://github.com/minivision-ai/Silent-Face-Anti-Spoofing, 2020
work page 2020
-
[28]
AgeDB : The first manually collected, in-the-wild age database
Moschoglou, S., Papaioannou, A., Sagonas, C., Deng, J., Kotsia, I., and Zafeiriou, S. AgeDB : The first manually collected, in-the-wild age database. In 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp.\ 1997--2005, 2017. doi:10.1109/CVPRW.2017.250
-
[29]
NIST special publication 800-63-4: Digital identity guidelines
NIST . NIST special publication 800-63-4: Digital identity guidelines. https://pages.nist.gov/800-63-4/, 2025. Supersedes SP 800-63-3
work page 2025
-
[30]
Ofcom fines porn company £1 million for not having robust age checks
Ofcom . Ofcom fines porn company £1 million for not having robust age checks. https://www.ofcom.org.uk/online-safety/protecting-children/ofcom-fines-porn-company-1million-for-not-having-robust-age-checks, 2025
work page 2025
-
[31]
Age-gender-prediction: Vision transformer for facial analysis
Sahoo, A. Age-gender-prediction: Vision transformer for facial analysis. https://huggingface.co/abhilash88/age-gender-prediction, 2025. ViT-based model trained on UTKFace dataset
work page 2025
-
[32]
M., Chellappa, R., and Jacobs, D
Sengupta, S., Chen, J.-C., Castillo, C., Patel, V. M., Chellappa, R., and Jacobs, D. W. Frontal to profile face verification in the wild. In 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), pp.\ 1--9, 2016. doi:10.1109/WACV.2016.7477558
-
[33]
Serengil, S. I. DeepFace : A lightweight face recognition and facial attribute analysis framework. https://github.com/serengil/deepface, 2018
work page 2018
-
[34]
V., Seitzer, M., Baldassarre, F., Oquab, M., Jose, C., Khalidov, V., Szafraniec, M., Yi, S
Sim \'e oni, O., Vo, H. V., Seitzer, M., Baldassarre, F., Oquab, M., Jose, C., Khalidov, V., Szafraniec, M., Yi, S. E., Ramamonjisoa, M., Massa, F., Haziza, D., Wehrstedt, L., Wang, J., Darcet, T., Moutakanni, T., Sentana, L., Roberts, C., Vedaldi, A., Tolan, J., Brandt, J., Couprie, C., Mairal, J., Jegou, H., Labatut, P., and Bojanowski, P. DINOv3 . Tran...
work page 2026
- [35]
-
[36]
Sumsub identity verification platform
Sumsub . Sumsub identity verification platform. https://sumsub.com, 2026
work page 2026
-
[37]
Wightman, R. PyTorch Image Models . https://github.com/rwightman/pytorch-image-models
-
[38]
SSR-Net : A compact soft stagewise regression network for age estimation
Yang, T.-Y., Huang, Y.-H., Lin, Y.-Y., Hsiu, P.-C., and Chuang, Y.-Y. SSR-Net : A compact soft stagewise regression network for age estimation. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18 , pp.\ 1078--1084. International Joint Conferences on Artificial Intelligence Organization, 2018. doi:10.249...
-
[39]
CoCa : Contrastive captioners are image-text foundation models
Yu, J., Wang, Z., Vasudevan, V., Yeung, L., Seyedhosseini, M., and Wu, Y. CoCa : Contrastive captioners are image-text foundation models. Transactions on Machine Learning Research, 2022. URL https://openreview.net/forum?id=Ee277P3AYC
work page 2022
-
[40]
yu4u. Age estimation pytorch. https://github.com/yu4u/age-estimation-pytorch, 2019. PyTorch implementation for age estimation using APPA-REAL dataset
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.