Receipt Replay OOD: A Small Benchmark for Screen Replay Detection Under Domain Shift
Pith reviewed 2026-06-29 18:20 UTC · model grok-4.3
The pith
Receipts form a privacy-safe out-of-domain test for screen replay detectors trained on identity documents.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Receipt Replay OOD demonstrates that document replay detection models suffer measurable performance loss when evaluated on receipts, which share planar geometry, curved corners, wear-and-tear artifacts, and text or logo patterns with identity documents yet avoid personally identifiable information constraints.
What carries the argument
The Receipt Replay OOD benchmark dataset, used as an out-of-domain test set to expose generalization failures under domain shift.
If this is right
- Existing replay detectors trained on public ID datasets will show reduced accuracy on receipt images.
- Domain shift between document types directly degrades presentation attack detection performance.
- Receipts can substitute for identity documents in robustness testing while sidestepping data-privacy rules.
- Benchmark results quantify the size of the generalization gap that future training methods must close.
Where Pith is reading between the lines
- Larger receipt collections with varied lighting and camera angles could strengthen the benchmark for future OOD studies.
- Training recipes that explicitly regularize for shape and texture invariance might reduce the observed domain gap.
- The same receipt proxy idea could apply to other document-security tasks that currently face PII restrictions.
Load-bearing premise
Receipts share enough geometric, textural, and artifact characteristics with identity documents to make performance on receipts a meaningful signal of out-of-domain robustness.
What would settle it
A controlled experiment in which the same models are tested on both Receipt Replay OOD and a held-out set of real identity documents under matched replay conditions; if accuracy patterns diverge sharply, the proxy claim fails.
read the original abstract
Public datasets such as DLC-2021, SynID, and KID34K have significantly contributed to research on presentation attack detection for identity documents, including screen replay attacks. However, evaluation of out-of-domain (OOD) robustness remains insufficiently explored, especially under realistic domain shifts. In this work, we introduce Receipt Replay OOD, a small out-of-domain benchmark for screen replay detection. Receipts share several characteristics with identity documents, including planar geometry, curved corners, wear-and-tear artifacts, and text or logo patterns, while avoiding personally identifiable information constraints commonly associated with identity documents. We evaluate document replay detection models under cross-domain conditions and demonstrate the impact of domain shift on generalization performance. The dataset is publicly available.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces Receipt Replay OOD, a small out-of-domain benchmark for screen replay detection. Receipts are proposed as a suitable proxy for identity documents because they share planar geometry, curved corners, wear-and-tear artifacts, and text or logo patterns, while avoiding PII constraints. The work evaluates document replay detection models under cross-domain conditions and demonstrates the impact of domain shift on generalization performance. The dataset is made publicly available.
Significance. If the evaluation supports the claims, this benchmark fills a gap in OOD robustness testing for presentation attack detection by providing a PII-free alternative to identity document datasets such as DLC-2021, SynID, and KID34K. The public availability of the dataset supports reproducibility and further research in the field. This is a modest but useful contribution for testing generalization of replay detectors.
minor comments (2)
- The abstract would be strengthened by including at least one quantitative result illustrating the domain shift impact.
- [Dataset] Provide more details on the number of samples, acquisition conditions, and any preprocessing steps applied to the receipt images in the dataset description section.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of Receipt Replay OOD as a useful PII-free benchmark for OOD robustness in screen replay detection, and for recommending minor revision. No specific major comments were raised in the report.
Circularity Check
No significant circularity
full rationale
The paper introduces a benchmark dataset and reports cross-domain evaluation results. It contains no equations, derivations, parameter fitting, or predictive claims that could reduce to inputs by construction. The justification for receipts as a proxy for ID documents is an explicit design assumption rather than a derived result. No self-citations are load-bearing for any central claim.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Receipts share planar geometry, curved corners, wear-and-tear artifacts, and text or logo patterns with identity documents.
Reference graph
Works this paper leans on
-
[1]
Introduction and Related Works Multiple datasets have been introduced for presentation attack detection in the domain of identity documents. The DLC-2021 dataset (Polevoy et al., 2022), based on images from the MIDV family, contains 10 document types and multiple attack categories including color photocopies, grayscale copies, and screen replay attacks ca...
2021
-
[2]
At the same time, receipts avoid legal and privacy limitations associated with identity documents, making them suitable for public OOD benchmarking
Dataset description Receipts were selected as lightweight real-world planar objects sharing several visual characteristics with identity documents, including text regions, curved geometry, folds, and wear-and-tear artifacts. At the same time, receipts avoid legal and privacy limitations associated with identity documents, making them suitable for public O...
-
[3]
Experiments To support this research, we trained three models for the screen replay detection task using the original DLC-2021 (RE) train-test split. The evaluated architectures included a custom ResNet-inspired model trained from scratch, EfficientNet-B0V2 fine-tuned from ImageNet pretraining, and ViT-Small with frozen DINOv2 backbone and fine-tuned clas...
2021
-
[4]
Park, E.-J., Back, S.-Y., Kim, J., & Woo, S. S. (2023). KID34K: A dataset for online identity card fraud detection . In Proceedings of the 2023 ACM Workshop on Information Hiding and Multimedia Security (pp. 191–196). ACM. https://doi.org/10.1145/3583780.3615122
-
[5]
Polevoy, D. V., Sigareva, I. V., Ershova, D. M., Arlazarov, V. V., Nikolaev, D. P., Ming, Z., Luqman, M. M., & Burie, J.-C. (2022). Document liveness challenge dataset (DLC-2021) . Journal of Imaging, 8 (7), 181. https://doi.org/10.3390/jimaging8070181
- [6]
- [7]
-
[8]
E., Stockhardt, F., González-Soler, L
Tapia, J. E., Stockhardt, F., González-Soler, L. J., & Busch, C. (2025). SynID: Passport synthetic dataset for presentation attack detection . arXiv:2505.07540
- [9]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.