Detecting Dental Landmarks from Intraoral 3D Scans: the 3DTeethLand challenge
Pith reviewed 2026-05-17 00:11 UTC · model grok-4.3
The pith
The 3DTeethLand challenge shows deep learning methods can detect 3D dental landmarks from intraoral scans with a top rank score of 0.91.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The 3DTeethLand challenge supplies a public dataset of 340 intraoral 3D scans to benchmark 3D dental landmark detection, resulting in 49 teams competing and the winner attaining a rank score of 0.91, mean Average Precision of 0.78, and mean Average Recall of 0.65 by applying a two-stage Stratified Transformer for segmentation followed by weighted DBSCAN.
What carries the argument
A two-stage pipeline consisting of a Stratified Transformer to segment individual teeth in the 3D scan followed by weighted DBSCAN clustering to pinpoint landmark locations.
Load-bearing premise
The 340 scans include enough examples of different tooth sizes, shapes, and positions to stand in for real-world patient variety, and that the rank score plus precision and recall numbers track with improvements in actual clinical outcomes.
What would settle it
Collecting a fresh set of 200 or more intraoral scans from a new group of patients and measuring whether the winning algorithm maintains a mean average precision above 0.75 on that set.
Figures
read the original abstract
Teeth landmark detection is a key task in modern orthodontics, supporting advanced diagnosis, personalized treatment planning, and effective monitoring of treatment progress. However, several significant challenges may arise due to the intricate geometry of individual teeth and the substantial variations observed across different individuals. To address these complexities, the development of advanced techniques, especially through the application of deep learning, is essential for the precise and reliable detection of 3D tooth landmarks. In this context, the 3DTeethLand challenge was held in conjunction with the International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) in 2024, calling for algorithms focused on teeth landmark detection from intraoral 3D scans. This challenge introduced a publicly available dataset for 3D dental landmark detection from 340 intraoral scans, providing a standardized benchmark to evaluate state-of-the-art approaches and encouraging methodological advances toward addressing this clinically problem. A total of 49 teams participated, and 6 teams reached the final phase. The winning team achieved a rank score of 0.91, with a mean Average Precision of 0.78 and a mean Average Recall of 0.65, demonstrating a balance between precision and recall. Top teams achieved high precision with different strategies: the first-ranked team used a two-stage Stratified Transformer with segmentation and weighted DBSCAN, while the second-ranked team adopted a single-stage DGCNN with offset regression and class-specific non-maximum suppression.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript reports on the 3DTeethLand challenge organized at MICCAI 2024 for detecting dental landmarks from intraoral 3D scans. It introduces a publicly available dataset of 340 intraoral scans as a standardized benchmark, notes participation by 49 teams with 6 reaching the final phase, and presents the top results including the winning team's rank score of 0.91, mAP of 0.78, and mAR of 0.65 achieved via a two-stage Stratified Transformer with segmentation and weighted DBSCAN; the second-place entry used a single-stage DGCNN with offset regression.
Significance. If the benchmark holds, the work supplies a valuable public dataset and community benchmark for 3D dental landmark detection that can accelerate reproducible research in orthodontic applications. The reported participation numbers and diversity of top-performing strategies (transformer-based versus point-cloud CNN) provide a useful snapshot of current methodological capabilities. The explicit release of the dataset is a concrete strength that supports future method development.
major comments (3)
- [Introduction] Introduction: The positioning of the task as supporting 'advanced diagnosis, personalized treatment planning, and effective monitoring of treatment progress' is not backed by any reader study, downstream clinical-task evaluation, or analysis showing correlation between the reported geometric metrics (rank score, mAP, mAR) and actual clinical utility.
- [Dataset] Dataset section: The assertion that the 340 scans capture 'substantial variations observed across different individuals' is stated without accompanying demographic or geometric statistics (age range, ethnicity, malocclusion severity, missing teeth counts), leaving the representativeness claim unquantified.
- [Evaluation] Evaluation: Full details on the evaluation protocol, train/test splits, and inter-rater variability or agreement metrics for the landmark annotations are not provided, which is necessary to interpret the reliability of the headline numbers (mAP 0.78, mAR 0.65).
minor comments (2)
- [Abstract] Abstract: 'clinically problem' appears to be a typographical error for 'clinical problem'.
- [Methods] Methods descriptions of the finalist teams would benefit from explicit statements on training hyperparameters, data augmentation, and inference-time settings to improve reproducibility.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive comments on our manuscript describing the 3DTeethLand MICCAI 2024 challenge. We address each major comment below. Revisions have been made to improve clarity and completeness where feasible within the scope of a challenge report.
read point-by-point responses
-
Referee: [Introduction] The positioning of the task as supporting 'advanced diagnosis, personalized treatment planning, and effective monitoring of treatment progress' is not backed by any reader study, downstream clinical-task evaluation, or analysis showing correlation between the reported geometric metrics (rank score, mAP, mAR) and actual clinical utility.
Authors: We agree that the manuscript does not contain a dedicated reader study or direct correlation analysis linking the geometric metrics to clinical endpoints. The introductory phrasing draws on established orthodontic literature regarding the role of landmark detection. In the revised version we will temper the language to present these as potential clinical applications supported by prior work and will add relevant citations. A full reader study or downstream task evaluation lies outside the scope of this benchmark paper. revision: partial
-
Referee: [Dataset] The assertion that the 340 scans capture 'substantial variations observed across different individuals' is stated without accompanying demographic or geometric statistics (age range, ethnicity, malocclusion severity, missing teeth counts), leaving the representativeness claim unquantified.
Authors: We acknowledge that quantitative descriptors would strengthen the dataset description. The revised manuscript will include available statistics on age range, gender distribution, malocclusion severity categories, and counts of missing teeth. Ethnicity information was not collected during acquisition for privacy reasons and will be noted as such. revision: yes
-
Referee: [Evaluation] Full details on the evaluation protocol, train/test splits, and inter-rater variability or agreement metrics for the landmark annotations are not provided, which is necessary to interpret the reliability of the headline numbers (mAP 0.78, mAR 0.65).
Authors: We appreciate this observation. The challenge website and supplementary material contain the full protocol and split details, but the main text will be expanded to provide a self-contained description of the evaluation protocol, exact train/test split ratios, and any computed inter-annotator agreement metrics. Where inter-rater data are limited we will state this explicitly. revision: yes
Circularity Check
Empirical benchmark report with no derivation chain
full rationale
This is a challenge summary paper that releases a dataset of 340 scans and reports external team submissions evaluated with standard detection metrics (rank score 0.91, mAP 0.78, mAR 0.65). No equations, fitted parameters, or internal derivations appear in the provided text. All performance numbers originate from independent participant algorithms rather than any self-referential construction or self-citation load-bearing step. The paper is therefore self-contained against external benchmarks and exhibits no circularity.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Standard point-cloud deep learning architectures and metrics such as mean Average Precision and mean Average Recall are appropriate for 3D dental landmark detection.
Reference graph
Works this paper leans on
-
[1]
F. Angelone, A. M. Ponsiglione, C. Ricciardi, G. Cesarelli, M. Sansone, F. Amato, Di- agnostic Applications of Intraoral Scanners: A Systematic Review , Journal of Imaging 9 (7) (2023) 134
work page 2023
-
[2]
S. Logozzo, E. M. Zanetti, G. Franceschini, A. Kilpelä, A. Mäkynen, Recent advances in dental optics – Part I: 3D intraoral scanners for restorative dentistry, Optics and lasers in engineering 54 (2014) 203–221
work page 2014
-
[3]
F. Eggmann, M. Blatz, Recent Advances in Intraoral Scanners, Journal of Dental Re- search 103 (13) (2024) 1349–1357
work page 2024
-
[4]
B. Woodsend, E. Koufoudaki, P. A. Mossey, P. Lin, Automatic recognition of landmarks on digital dental models, Computers in Biology and Medicine 137 (2021) 104819. 5https://github.com/crns-smartvision/3DTeethLand/tree/master 27
work page 2021
- [5]
-
[6]
F. Bolelli, L. Lumetti, S. Vinayahalingam, M. Di Bartolomeo, A. Pellacani, K. March- esini, N. Van Nistelrooij, P. Van Lierop, T. Xi, Y. Liu, et al., Segmenting the Inferior Alveolar Canal in CBCTs Volumes: The ToothFairy Challenge, IEEE Transactions on Medical Imaging (2024)
work page 2024
-
[7]
Y. Wang, Y. Zhang, X. Chen, S. Wang, D. Qian, F. Ye, F. Xu, H. Zhang, R. Dan, Q.Zhang, etal., MICCAI2023STSChallenge: Aretrospectivestudyofsemi-supervised approaches for teeth segmentation, Pattern Recognition (2025) 112049
work page 2025
-
[8]
A. Ben-Hamadou, O. Smaoui, A. Rekik, S. Pujades, E. Boyer, H. Lim, M. Kim, M. Lee, M. Chung, Y.-G. Shin, et al., 3DTeethSeg’22: 3D Teeth Scan Segmentation and Label- ing Challenge, arXiv preprint arXiv:2305.18277 (2023)
-
[9]
A.Rekik, A.Ben-Hamadou, O.Smaoui, F.Bouzguenda, S.Pujades, E.Boyer, TSegLab: Multi-stage 3D dental scan segmentation and labeling, Computers in Biology and Medicine 185 (2025) 109535
work page 2025
-
[10]
S. Triarjo, R. Sarno, S. C. Hidayati, G. Sihaj, Automatic 3D Digital Dental Landmark Based on Point Transformation Weight , in: 2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), IEEE, 2023, pp. 336–341
work page 2023
-
[11]
T.-H. Wu, C. Lian, S. Lee, M. Pastewait, C. Piers, J. Liu, F. Wang, L. Wang, C.-Y. Chiu, W. Wang, et al., Two-Stage Mesh Deep Learning for Automated Tooth Seg- mentation and Landmark Localization on 3D Intraoral Scans , IEEE transactions on medical imaging 41 (11) (2022) 3158–3166
work page 2022
-
[12]
G. Wei, Z. Cui, J. Zhu, L. Yang, Y. Zhou, P. Singh, M. Gu, W. Wang, Dense Rep- resentative Tooth Landmark/axis Detection Network on 3D Model , Computer Aided Geometric Design 94 (2022) 102077
work page 2022
-
[13]
Y. Lang, X. Chen, H. H. Deng, T. Kuang, J. C. Barber, J. Gateno, P.-T. Yap, J. J. Xia, DentalPointNet: landmark localization on high-resolution 3D digital dental models, in: International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, 2022, pp. 444–452
work page 2022
-
[14]
Y. Lang, H. H. Deng, D. Xiao, C. Lian, T. Kuang, J. Gateno, P.-T. Yap, J. J. Xia, DLLNet: An Attention-Based Deep Learning Method for Dental Landmark Localiza- tion on High-Resolution 3D Digital Dental Models, in: Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Stras- bourg, France, September 27–Octob...
work page 2021
- [15]
-
[16]
C. Lian, L. Wang, T.-H. Wu, F. Wang, P.-T. Yap, C.-C. Ko, D. Shen, Deep Multi- Scale Mesh Feature Learning for Automated Labeling of Raw Dental Surfaces From 3D Intraoral Scanners , IEEE transactions on medical imaging 39 (7) (2020) 2440–2450
work page 2020
-
[17]
C. Lian, L. Wang, T.-H. Wu, M. Liu, F. Durán, C.-C. Ko, D. Shen, MeshSNet: Deep Multi-scale Mesh Feature Learning for End-to-End Tooth Labeling on 3D Dental Sur- faces, in: Medical Image Computing and Computer Assisted Intervention – MICCAI 2019, Springer, 2019, pp. 837–845
work page 2019
-
[18]
Teeth3DS: A benchmark for teeth segmentation and labeling from intra-oral 3D scans,
A.Ben-Hamadou, N.Neifar, A.Rekik, O.Smaoui, F.Bouzguenda, S.Pujades, E.Boyer, E. Ladroit, Teeth3DS+: An Extended Benchmark for Intraoral 3D Scans Analysis , arXiv preprint arXiv:2210.06094 (2022)
-
[19]
M. Ester, H.-P. Kriegel, J. Sander, X. Xu, A density-based algorithm for discovering clusters in large spatial databases with noise, in: Proceedings of the Second Interna- tional Conference on Knowledge Discovery and Data Mining, AAAI Press, 1996, p. 226–231
work page 1996
-
[20]
M. Garland, P. S. Heckbert, Surface Simplification Using Quadric Error Metrics, in: Proceedings of the 24th annual conference on Computer graphics and interactive tech- niques, ACM Press/Addison-Wesley Publishing Co., 1997, pp. 209–216
work page 1997
- [21]
-
[22]
H. Jin, Y. Shen, J. Lou, K. Zhou, Y. Zheng, KeypointDETR: An End-to-End 3D Keypoint Detector, in: European Conference on Computer Vision, Springer, 2025, pp. 374–390
work page 2025
- [23]
-
[24]
T. Dascalu, B. Ibragimov, Assignment Theory-Augmented Neural Network for Den- tal Arch Labeling, in: International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, 2023, pp. 295–304
work page 2023
-
[25]
J. Li, J. Zhou, Y. Xiong, X. Chen, C. Chakrabarti, An Adjustable Farthest Point Sam- pling Method for Approximately-sorted Point Cloud Data, in: 2022 IEEE Workshop on Signal Processing Systems (SiPS), IEEE, 2022, pp. 1–6. 29
work page 2022
- [26]
-
[27]
T.-H. Tran, T.-D. Cao, T.-T.-H. Tran, HDBSCAN: Evaluating the Performance of Hierarchical Clustering for Big Data, in: Soft Computing: Biomedical and Related Applications, Springer, 2021, pp. 273–283
work page 2021
-
[28]
Y. You, et al., KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human Annotations , in: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2020, pp. 13644–13653
work page 2020
-
[29]
L. Maier-Hein, A. Reinke, P. Godau, M. D. Tizabi, F. Buettner, E. Christodoulou, B. Glocker, F. Isensee, J. Kleesiek, M. Kozubek, et al., Metrics reloaded: Recommen- dations for image analysis validation , Nature methods 21 (2) (2024) 195–212. 30
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.