Image Quality Assessment of Identity Cards Using Measures from Open Face Image Quality

Christian Rathgeb; Gregor Grote; Juan E. Tapia

arxiv: 2606.11884 · v2 · pith:W7DP5V23new · submitted 2026-06-10 · 💻 cs.CV · cs.CR

Image Quality Assessment of Identity Cards Using Measures from Open Face Image Quality

Gregor Grote , Juan E. Tapia , Christian Rathgeb This is my paper

Pith reviewed 2026-06-27 10:17 UTC · model grok-4.3

classification 💻 cs.CV cs.CR

keywords image quality assessmentidentity cardspresentation attack detectionremote verificationOpen Face Image Quality

0 comments

The pith

Some Open Face Image Quality measures, after preprocessing, correlate with improved presentation attack detection on ID cards.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper applies a set of capture-related quality measures originally developed for face images to identity card photos in remote verification settings. A dedicated preprocessing step first locates card corners, corrects perspective distortion, and masks the foreground to isolate the relevant region before computing the measures. These quality scores are then checked for correlation against the accuracy of three different presentation attack detection algorithms on four separate ID card datasets that include both genuine images and printed fakes. The central finding is that selected measures from the Open Face Image Quality standard track PAD performance closely enough to suggest they can be used to strengthen attack detection.

Core claim

The authors show that quality assessment based on some Open Face Image Quality measures can significantly improve presentation attack detection performance when the measures are computed on ID card images that have first undergone corner detection, perspective normalization, and comprehensive foreground masking.

What carries the argument

The preprocessing pipeline (corner detection, perspective normalization, and foreground masking) that adapts Open Face Image Quality measures from faces to ID cards.

If this is right

Selected OFIQ measures can be added as an input feature to existing PAD algorithms to raise their detection rates on both pristine and mock ID cards.
The same preprocessing and scoring pipeline works across multiple distinct ID card datasets without retraining the quality measures.
Quality filtering based on these measures can be inserted upstream of PAD to discard low-quality captures before attack detection is attempted.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same adaptation steps could be tested on other non-face identity documents such as passports or driver's licenses to check whether the correlation with attack detection generalizes.
Real-time computation of these quality scores during capture could trigger an automatic request for a better image before the verification process continues.

Load-bearing premise

The preprocessing pipeline ensures accurate and unbiased quality measure computation on ID card images.

What would settle it

A new set of ID card images where adding the selected OFIQ quality scores produces no measurable gain in PAD accuracy on any of the three tested detectors would falsify the claim.

Figures

Figures reproduced from arXiv: 2606.11884 by Christian Rathgeb, Gregor Grote, Juan E. Tapia.

**Figure 2.** Figure 2: Workflow of the image quality assessment (IQA) system. Preprocess [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Example of preprocessing steps. B. Quality Assessment After preprocessing, the quality measures from Table I were computed on the preprocessed ID card images. The [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗

**Figure 4.** Figure 4: Aggregated r∆EDCs for each quality measure. Solid lines show the median r∆EDC, dashed lines show the mean r∆EDC and the shaded area shows the 25th to 75th percentile r∆EDC. cards, such as the degree to which the ID card is captured frontally or how strongly it is occluded. During preprocessing, many pixels are masked out to prevent biases in the quality measures, but this means that a lot of information ab… view at source ↗

read the original abstract

This paper addresses the challenge of assessing image quality in ID cards in remote verification systems by applying capture-related quality measures from the Open Face Image Quality (OFIQ) standard to ID card images. Our preprocessing pipeline includes corner detection, perspective normalization, and comprehensive foreground masking to ensure accurate and unbiased quality measure computation. We evaluate the effectiveness of these measures by analyzing their correlation with the performance of three presentation attack detection (PAD) algorithms across four diverse ID card datasets, where two datasets contain bona fide, i.e. pristine, images and two contain printed mock ID cards. Our results suggest that quality assessment based on some OFIQ measures can significantly improve PAD performance.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper correlates some OFIQ measures with PAD performance on ID cards after preprocessing, but does not test whether using those measures inside a PAD pipeline actually improves detection rates.

read the letter

The main takeaway is straightforward: they adapt Open Face Image Quality measures to ID card images using corner detection, perspective normalization, and foreground masking, then measure how those scores relate to the error rates of three existing PAD algorithms across four datasets. This is a practical domain extension rather than a new method.

What the paper does well is keep the evaluation grounded. It reuses an established standard and external PAD algorithms instead of building custom ones, and the preprocessing step directly tackles the mismatch between face-oriented quality measures and card-shaped images. That makes the setup reproducible for anyone working on remote ID verification.

The soft spot is the gap between the reported correlations and the claim that quality assessment can significantly improve PAD performance. Correlation with standalone PAD results does not show what happens when low-quality samples are filtered, reweighted, or used to condition the decision. No experiment is described that applies the quality scores inside the PAD pipeline and measures the resulting change in EER, APCER, or BPCER. The abstract also gives no numbers, dataset sizes, or statistical details, which leaves the strength of the correlations hard to judge.

This paper is for engineers building remote verification systems who need quick ways to add quality gates to ID card scans. It is not aimed at researchers looking for new theoretical contributions. A serious referee could handle it if the authors add direct tests of whether the quality measures produce measurable PAD gains when used operationally; without that, the central suggestion rests on an assumption rather than evidence. I would send it to review with a request for those experiments rather than desk reject.

Referee Report

1 major / 1 minor

Summary. The manuscript applies capture-related quality measures from the Open Face Image Quality (OFIQ) standard to identity card images. A preprocessing pipeline (corner detection, perspective normalization, comprehensive foreground masking) is used to enable unbiased computation. Effectiveness is assessed via correlation analysis between these measures and the performance of three existing PAD algorithms across four ID card datasets (two bona fide, two with printed mocks). The authors conclude that some OFIQ measures can significantly improve PAD performance.

Significance. If the central claim holds, the work could aid remote ID verification by providing a standardized way to identify images where PAD is unreliable. The reuse of an existing standard (OFIQ) and evaluation on multiple external datasets and PAD algorithms are strengths. However, the absence of any reported numerical correlations, dataset sizes, or statistical tests in the abstract makes the practical impact difficult to gauge from the provided text.

major comments (1)

[Abstract] Abstract: The claim that 'quality assessment based on some OFIQ measures can significantly improve PAD performance' is not supported by the described evaluation. The text states that effectiveness is evaluated by 'analyzing their correlation with the performance of three presentation attack detection (PAD) algorithms,' but no experiment is described that integrates an OFIQ quality measure into a PAD pipeline (e.g., via low-quality sample rejection, score weighting, or conditional decision) and reports the resulting change in EER, AUC, BPCER, or APCER.

minor comments (1)

[Abstract] Abstract: No numerical results, correlation coefficients, p-values, dataset sizes, or error bars are supplied, which prevents verification of the 'significantly improve' assertion even at the level of the reported correlations.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the major comment on the abstract below and will revise accordingly to ensure the claims align precisely with the reported evaluation.

read point-by-point responses

Referee: [Abstract] Abstract: The claim that 'quality assessment based on some OFIQ measures can significantly improve PAD performance' is not supported by the described evaluation. The text states that effectiveness is evaluated by 'analyzing their correlation with the performance of three presentation attack detection (PAD) algorithms,' but no experiment is described that integrates an OFIQ quality measure into a PAD pipeline (e.g., via low-quality sample rejection, score weighting, or conditional decision) and reports the resulting change in EER, AUC, BPCER, or APCER.

Authors: We acknowledge the distinction: our evaluation consists of correlation analysis between the OFIQ measures (computed after preprocessing) and the performance metrics of three PAD algorithms on the four datasets, rather than an explicit integration experiment that applies quality-based filtering, weighting, or conditional decisions and quantifies the resulting change in EER/APCER/etc. The observed correlations support the suggestion that certain measures are associated with improved PAD reliability and could therefore be used to enhance performance in a deployed system, but the abstract wording does overstate the direct demonstration of improvement. We will revise the abstract (and relevant sections) to state that the measures exhibit significant correlations with PAD performance, indicating their potential utility for improving PAD in remote verification. If space permits in the revision, we will also add a brief integration experiment (e.g., rejecting low-quality samples) to strengthen the claim. revision: yes

Circularity Check

0 steps flagged

No significant circularity; empirical analysis uses external components

full rationale

The paper performs an empirical evaluation: it applies existing OFIQ measures to preprocessed ID card images and computes correlations against the standalone error rates of three external PAD algorithms on four external datasets. No equations, fitted parameters, or self-referential definitions are described that would reduce the reported correlations or the suggestion of PAD improvement to quantities defined by the authors' own choices. The preprocessing steps are standard geometric operations with no mathematical derivation that loops back to the quality measures themselves. The central claim rests on observable statistical associations rather than any self-definitional or self-citation chain.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no identifiable free parameters, axioms, or invented entities; the central claim rests on the unverified effectiveness of the described preprocessing and the existence of the reported correlations.

pith-pipeline@v0.9.1-grok · 5635 in / 985 out tokens · 28895 ms · 2026-06-27T10:17:28.288025+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

19 extracted references · 4 canonical work pages

[1]

Biometrics in the era of COVID-19: Challenges and opportunities,

M. Gomez-Barrero, P. Drozdowski, C. Rathgebet al., “Biometrics in the era of COVID-19: Challenges and opportunities,”Trans. on Technology and Society (TTS), June 2022. [2]Information Technology – Biometric Sample Quality – Part 5: Face Image Data, ISO/IEC Std. ISO/IEC 29 794-5:2025, Apr. 2025, published April 2025

2022
[2]

State of the Art of Quality Assessment of Facial Images,

J. Merkle, C. Rathgeb, B. Tams, D.-P. Lou, A. D ¨orsch, and P. Droz- dowski, “State of the Art of Quality Assessment of Facial Images,” arXiv e-prints, p. arXiv:2211.08030, Nov. 2022

arXiv 2022
[3]

Identity card presentation attack detection: A systematic review,

E. M. Ruiz, J. E. Tapia, R. T. Soto, and C. Busch, “Identity card presentation attack detection: A systematic review,” 2025. [Online]. Available: https://arxiv.org/abs/2511.06056

arXiv 2025
[4]

Face image quality assessment: A literature survey,

T. Schlett, C. Rathgeb, O. Henniger, J. Galbally, J. Fierrez, and C. Busch, “Face image quality assessment: A literature survey,”ACM Comput. Surv., vol. 54, no. 10s, Sep. 2022. [Online]. Available: https://doi.org/10.1145/3507901

work page doi:10.1145/3507901 2022
[5]

Image quality assessment on identity documents,

C. Y ´a˜nez and E. J. Tapia, “Image quality assessment on identity documents,” in2021 International Conference of the Biometrics Special Interest Group (BIOSIG), 2021, pp. 1–5

2021
[6]

Identity documents im- age quality assessment,

D. Schulz, J. Maureira, J. Tapia, and C. Busch, “Identity documents im- age quality assessment,” in30th European Signal Processing Conference (EUSIPCO), 2022, pp. 1017–1021

2022
[7]

Towards refining id cards presentation attack detection systems using face quality index,

S. Gonz ´alez and J. Tapia, “Towards refining id cards presentation attack detection systems using face quality index,” in30th European Signal Processing Conference (EUSIPCO), 2022, pp. 1027–1031

2022
[8]

In: IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022, Waikoloa, HI, USA, January 3-8, 2022

M. Al-Ghadi, J. V oerman, M. Coustaty, O. Lessard, and N. Sidere, “ IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering ,” inIEEE/CVF Winter Conf. on Appl. of Computer Vision Workshops (WACVW). Los Alamitos, CA, USA: IEEE Computer Society, Mar. 2025, pp. 668–675. [Online]. Available: https://doi.ieeecomputersociety.org/10.1109/W ACVW...

work page doi:10.1109/w 2025
[9]

Stacked hourglass networks for human pose estimation,

A. Newell, K. Yang, and J. Deng, “Stacked hourglass networks for human pose estimation,” inComputer Vision – ECCV 2016, B. Leibe, J. Matas, N. Sebe, and M. Welling, Eds. Cham: Springer Intl. Publishing, 2016, pp. 483–499

2016
[10]

Yolov11: An overview of the key architectural enhancements,

R. Khanam and M. Hussain, “Yolov11: An overview of the key architectural enhancements,” 2024. [Online]. Available: https: //arxiv.org/abs/2410.17725

Pith/arXiv arXiv 2024
[11]

Screen content image segmentation using sparse decomposition and total variation minimization,

S. Minaee and Y . Wang, “Screen content image segmentation using sparse decomposition and total variation minimization,” inIEEE Intl. Conf. on Image Processing (ICIP), 2016, pp. 3882–3886

2016
[12]

EAST: An Efficient and Accurate Scene Text Detector ,

X. Zhou, C. Yao, H. Wen, Y . Wang, S. Zhou, W. He, and J. Liang, “ EAST: An Efficient and Accurate Scene Text Detector ,” inIEEE Conf. on Computer Vision and Pattern Recognition (CVPR). Los Alamitos, CA, USA: IEEE Computer Society, Jul. 2017, pp. 2642–2651. [Online]. Available: https://doi.ieeecomputersociety.org/10.1109/CVPR.2017.283

work page doi:10.1109/cvpr.2017.283 2017
[13]

A threshold selection method from gray-level histograms,

N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Trans. on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62– 66, 1979

1979
[14]

Second competition on presentation attack detection on ID Card,

J. E. Tapia, M. Nieto, and J. M. e. a. Espin, “Second competition on presentation attack detection on ID Card,” inIEEE International Joint Conference on Biometrics (IJCB), 2025, pp. 1–10

2025
[15]

Document liveness challenge dataset (dlc-2021),

D. V . Polevoy, I. V . Sigareva, and e. a. Ershova, “Document liveness challenge dataset (dlc-2021),”Journal of Imaging, vol. 8, no. 7, 2022. [Online]. Available: https://www.mdpi.com/2313-433X/8/7/181

2021
[16]

ISO/IEC 30107-3, information technology — biometric presentation attack detection — part 3: Testing and reporting,

ISO/IEC JTC 1/SC 37 Biometrics, “ISO/IEC 30107-3, information technology — biometric presentation attack detection — part 3: Testing and reporting,”International Organization for Standardization, Geneva, CH, Standard, 2021

2021
[17]

Considerations on the evaluation of biometric quality assessment algorithms,

T. Schlett, C. Rathgeb, J. Tapia, and C. Busch, “Considerations on the evaluation of biometric quality assessment algorithms,”IEEE Trans. on Biometrics, Behavior , and Identity Science, vol. 6, no. 1, pp. 54–67, 2024

2024
[18]

Performance of biometric quality measures,

P. Grother and E. Tabassi, “Performance of biometric quality measures,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 29, no. 4, p. 531–543, Apr
[19]

Available: https://doi.org/10.1109/TPAMI.2007.1019

[Online]. Available: https://doi.org/10.1109/TPAMI.2007.1019

work page doi:10.1109/tpami.2007.1019 2007

[1] [1]

Biometrics in the era of COVID-19: Challenges and opportunities,

M. Gomez-Barrero, P. Drozdowski, C. Rathgebet al., “Biometrics in the era of COVID-19: Challenges and opportunities,”Trans. on Technology and Society (TTS), June 2022. [2]Information Technology – Biometric Sample Quality – Part 5: Face Image Data, ISO/IEC Std. ISO/IEC 29 794-5:2025, Apr. 2025, published April 2025

2022

[2] [2]

State of the Art of Quality Assessment of Facial Images,

J. Merkle, C. Rathgeb, B. Tams, D.-P. Lou, A. D ¨orsch, and P. Droz- dowski, “State of the Art of Quality Assessment of Facial Images,” arXiv e-prints, p. arXiv:2211.08030, Nov. 2022

arXiv 2022

[3] [3]

Identity card presentation attack detection: A systematic review,

E. M. Ruiz, J. E. Tapia, R. T. Soto, and C. Busch, “Identity card presentation attack detection: A systematic review,” 2025. [Online]. Available: https://arxiv.org/abs/2511.06056

arXiv 2025

[4] [4]

Face image quality assessment: A literature survey,

T. Schlett, C. Rathgeb, O. Henniger, J. Galbally, J. Fierrez, and C. Busch, “Face image quality assessment: A literature survey,”ACM Comput. Surv., vol. 54, no. 10s, Sep. 2022. [Online]. Available: https://doi.org/10.1145/3507901

work page doi:10.1145/3507901 2022

[5] [5]

Image quality assessment on identity documents,

C. Y ´a˜nez and E. J. Tapia, “Image quality assessment on identity documents,” in2021 International Conference of the Biometrics Special Interest Group (BIOSIG), 2021, pp. 1–5

2021

[6] [6]

Identity documents im- age quality assessment,

D. Schulz, J. Maureira, J. Tapia, and C. Busch, “Identity documents im- age quality assessment,” in30th European Signal Processing Conference (EUSIPCO), 2022, pp. 1017–1021

2022

[7] [7]

Towards refining id cards presentation attack detection systems using face quality index,

S. Gonz ´alez and J. Tapia, “Towards refining id cards presentation attack detection systems using face quality index,” in30th European Signal Processing Conference (EUSIPCO), 2022, pp. 1027–1031

2022

[8] [8]

In: IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022, Waikoloa, HI, USA, January 3-8, 2022

M. Al-Ghadi, J. V oerman, M. Coustaty, O. Lessard, and N. Sidere, “ IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering ,” inIEEE/CVF Winter Conf. on Appl. of Computer Vision Workshops (WACVW). Los Alamitos, CA, USA: IEEE Computer Society, Mar. 2025, pp. 668–675. [Online]. Available: https://doi.ieeecomputersociety.org/10.1109/W ACVW...

work page doi:10.1109/w 2025

[9] [9]

Stacked hourglass networks for human pose estimation,

A. Newell, K. Yang, and J. Deng, “Stacked hourglass networks for human pose estimation,” inComputer Vision – ECCV 2016, B. Leibe, J. Matas, N. Sebe, and M. Welling, Eds. Cham: Springer Intl. Publishing, 2016, pp. 483–499

2016

[10] [10]

Yolov11: An overview of the key architectural enhancements,

R. Khanam and M. Hussain, “Yolov11: An overview of the key architectural enhancements,” 2024. [Online]. Available: https: //arxiv.org/abs/2410.17725

Pith/arXiv arXiv 2024

[11] [11]

Screen content image segmentation using sparse decomposition and total variation minimization,

S. Minaee and Y . Wang, “Screen content image segmentation using sparse decomposition and total variation minimization,” inIEEE Intl. Conf. on Image Processing (ICIP), 2016, pp. 3882–3886

2016

[12] [12]

EAST: An Efficient and Accurate Scene Text Detector ,

X. Zhou, C. Yao, H. Wen, Y . Wang, S. Zhou, W. He, and J. Liang, “ EAST: An Efficient and Accurate Scene Text Detector ,” inIEEE Conf. on Computer Vision and Pattern Recognition (CVPR). Los Alamitos, CA, USA: IEEE Computer Society, Jul. 2017, pp. 2642–2651. [Online]. Available: https://doi.ieeecomputersociety.org/10.1109/CVPR.2017.283

work page doi:10.1109/cvpr.2017.283 2017

[13] [13]

A threshold selection method from gray-level histograms,

N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Trans. on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62– 66, 1979

1979

[14] [14]

Second competition on presentation attack detection on ID Card,

J. E. Tapia, M. Nieto, and J. M. e. a. Espin, “Second competition on presentation attack detection on ID Card,” inIEEE International Joint Conference on Biometrics (IJCB), 2025, pp. 1–10

2025

[15] [15]

Document liveness challenge dataset (dlc-2021),

D. V . Polevoy, I. V . Sigareva, and e. a. Ershova, “Document liveness challenge dataset (dlc-2021),”Journal of Imaging, vol. 8, no. 7, 2022. [Online]. Available: https://www.mdpi.com/2313-433X/8/7/181

2021

[16] [16]

ISO/IEC 30107-3, information technology — biometric presentation attack detection — part 3: Testing and reporting,

ISO/IEC JTC 1/SC 37 Biometrics, “ISO/IEC 30107-3, information technology — biometric presentation attack detection — part 3: Testing and reporting,”International Organization for Standardization, Geneva, CH, Standard, 2021

2021

[17] [17]

Considerations on the evaluation of biometric quality assessment algorithms,

T. Schlett, C. Rathgeb, J. Tapia, and C. Busch, “Considerations on the evaluation of biometric quality assessment algorithms,”IEEE Trans. on Biometrics, Behavior , and Identity Science, vol. 6, no. 1, pp. 54–67, 2024

2024

[18] [18]

Performance of biometric quality measures,

P. Grother and E. Tabassi, “Performance of biometric quality measures,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 29, no. 4, p. 531–543, Apr

[19] [19]

Available: https://doi.org/10.1109/TPAMI.2007.1019

[Online]. Available: https://doi.org/10.1109/TPAMI.2007.1019

work page doi:10.1109/tpami.2007.1019 2007