AlphaDent: A dataset for automated tooth pathology detection
Pith reviewed 2026-05-19 02:59 UTC · model grok-4.3
The pith
A new dataset of over 1200 tooth photographs from 295 patients enables high-quality neural network predictions for instance segmentation of dental pathologies.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The AlphaDent dataset consists of over 1200 DSLR photographs of teeth from 295 patients that have been labeled into nine classes for instance segmentation, and neural networks trained on it deliver high quality predictions for automated tooth pathology detection.
What carries the argument
Instance segmentation labels across nine tooth pathology classes on the collected DSLR images, which provide the training signal for neural networks to locate and outline individual pathologies.
If this is right
- Researchers can train and evaluate instance segmentation models that achieve high quality results on dental photographs.
- The open dataset and released code allow direct reproduction of the reported prediction quality.
- Automated tools built from this data can identify tooth conditions directly from standard camera images.
- The nine-class labeling scheme supplies a concrete starting point for further model development in dental analysis.
Where Pith is reading between the lines
- The dataset could support training of models that assist initial screening from everyday photos before clinical visits.
- Combining the labels with other dental image types might improve overall detection across different capture conditions.
- Community extensions of the nine classes could address additional pathologies not covered in the initial release.
Load-bearing premise
The manual labeling of images into the nine classes is accurate, consistent, and representative of real-world tooth pathologies.
What would settle it
A neural network trained on the dataset produces low-accuracy segmentation masks that fail to match independent expert labels on a new collection of tooth photographs.
Figures
read the original abstract
In this article, we present a new unique dataset for dental research - AlphaDent. This dataset is based on the DSLR camera photographs of the teeth of 295 patients and contains over 1200 images. The dataset is labeled for solving the instance segmentation problem and is divided into 9 classes. The article provides a detailed description of the dataset and the labeling format. The article also provides the details of the experiment on neural network training for the Instance Segmentation problem using this dataset. The results obtained show high quality of predictions. The dataset is published under an open license; and the training/inference code and model weights are also available under open licenses.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces AlphaDent, a new dataset of over 1200 DSLR photographs of teeth from 295 patients, annotated for instance segmentation across 9 pathology classes. It details the data collection and labeling format, reports baseline neural network training experiments for instance segmentation that yield high-quality predictions, and releases the dataset, training code, and model weights under open licenses.
Significance. If the ground-truth annotations prove reliable, AlphaDent would provide a useful open resource for computer vision research on dental pathology detection, with the baseline experiments and code release lowering the barrier for follow-up work. The contribution is self-contained and centers on new data rather than internal parameter fitting.
major comments (2)
- Labeling section: the manual annotation process into the 9 classes is described but includes no quantitative validation such as inter-annotator agreement, overlap metrics, or expert review of a held-out subset. This is load-bearing for the central claim that the NN training results demonstrate dataset utility, because without evidence that the labels are accurate and consistent the reported prediction quality could simply reflect annotator-specific patterns rather than clinically meaningful boundaries.
- Experimental results section: the claim of 'high quality of predictions' is stated without specific metrics (e.g., mAP, mask IoU, or per-class scores), details on validation splits, or error analysis. This gap prevents full verification that the experiment supports the dataset's value for instance segmentation.
minor comments (1)
- Abstract: the statement that results show 'high quality of predictions' would be strengthened by including one or two concrete performance numbers.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive review of our manuscript on the AlphaDent dataset. We address each major comment below and have revised the manuscript to strengthen the presentation of annotation quality and experimental results where possible.
read point-by-point responses
-
Referee: Labeling section: the manual annotation process into the 9 classes is described but includes no quantitative validation such as inter-annotator agreement, overlap metrics, or expert review of a held-out subset. This is load-bearing for the central claim that the NN training results demonstrate dataset utility, because without evidence that the labels are accurate and consistent the reported prediction quality could simply reflect annotator-specific patterns rather than clinically meaningful boundaries.
Authors: We agree that quantitative validation of the annotations would provide stronger evidence of label reliability. The annotations were performed by a single experienced dental clinician following a detailed protocol based on standard clinical diagnostic criteria for each of the nine pathology classes. In the revised manuscript we have expanded the labeling section with additional details on the annotation guidelines, quality-control steps, and the clinician's qualifications. We did not collect multiple independent annotations for the full dataset due to resource constraints, so formal inter-annotator agreement statistics are not available; we have therefore noted this as a limitation and indicated that such metrics could be collected in future dataset extensions. revision: partial
-
Referee: Experimental results section: the claim of 'high quality of predictions' is stated without specific metrics (e.g., mAP, mask IoU, or per-class scores), details on validation splits, or error analysis. This gap prevents full verification that the experiment supports the dataset's value for instance segmentation.
Authors: We accept that the original phrasing was insufficiently precise. The revised manuscript now reports the concrete evaluation metrics obtained from the baseline instance-segmentation experiments, including overall mAP and mask IoU values together with per-class scores. We have also added explicit descriptions of the train/validation/test splits and a short error analysis that identifies the most frequent failure modes. These quantitative results and analyses were generated during our internal experiments and are now presented in full to allow readers to assess the dataset's utility directly. revision: yes
Circularity Check
No significant circularity: self-contained dataset contribution
full rationale
The paper introduces a new dataset AlphaDent of DSLR tooth images from 295 patients, labeled into 9 pathology classes for instance segmentation, and reports baseline neural network training results. No equations, derivations, fitted parameters, or uniqueness theorems appear. Claims rest on empirical training outcomes and open release of data/code/weights rather than any reduction to internal definitions, self-citations, or renamings. This is a standard self-contained dataset paper evaluated against external training benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The DSLR photographs from 295 patients and their 9-class labels are accurate and representative for the instance segmentation task.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The dataset is labeled for solving the instance segmentation problem and is divided into 9 classes... Training settings: 9 classes. Task type: Instance Segmentation... mAP@50 value
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Publicly Available Dental Image Datasets for Artificial Intelligence,
Uribe, S. E., Issa, J., Sohrabniya, F., Denny, A., Kim, N. N., Dayo, A. F., Chaurasia, A., Sofi -Mahmudi, A., Büttner, M., and Schwendicke, F., “Publicly Available Dental Image Datasets for Artificial Intelligence,” Journal of Dental Research 103, 1365–1374 (2024). http://doi.org/10.1177/00220345241272052
-
[2]
Hamamci, I. E., Er, S., Simsar, E., Yuksel, A. E., Gultekin, S., Ozdemir, S. D., Yang, K., Li, H. B., Pati, S., Stadlinger, B., Mehl, A., Gundogar, M., and Menze, B., “DENTEX: An Abnormal Tooth Detection with Dental Enumeration and Diagnosis Benchmark for Panoramic X-rays,” (2023). https://doi.org/10.48550/arXiv.2305.19112
-
[3]
Diffusion-Based Hierarchical Multi -label Object Detection to Analyze Panoramic Dental X -Rays,
Hamamci, I. E., Er, S., Simsar, E., Sekuboyina, A., Gundogar, M., Stadlinger, B., Mehl, A., and Menze, B., “Diffusion-Based Hierarchical Multi -label Object Detection to Analyze Panoramic Dental X -Rays,” in [ Medical Image Computing and Computer Assisted Intervention – MICCAI 2023], Greenspan, H., Mad-abhushi, A., Mousavi, P., Salcudean, S., Duncan, J., ...
-
[4]
Tufts Dental Database: A Multimodal Panoramic X-Ray Dataset for Benchmarking Diagnostic Systems,
Panetta, K., Rajendran, R., Ramesh, A., Rao, S., and Agaian, S., “Tufts Dental Database: A Multimodal Panoramic X-Ray Dataset for Benchmarking Diagnostic Systems,” IEEE Journal of Biomedical and Health Informatics 26, 1650–1659 (2022). https://doi.org/10.1109/JBHI.2021.3117575
-
[5]
A benchmark for comparison of dental radiography analysis algorithms,
Wang, C.-W., Huang, C.-T., Lee, J.-H., Li, C.-H., Chang, S.-W., Siao, M.-J., Lai, T.-M., Ibragimov, B., Vrtovec, T., Ronneberger, O., Fischer, P., Cootes, T. F., and Lindner, C., “A benchmark for comparison of dental radiography analysis algorithms,” Medical Image Analysis 31, 63–76 (2016). https://doi.org/10.1016/j.media.2016.02.004
-
[6]
AI Dental Research Dataset Catalogue
Uribe, S. E., “AI Dental Research Dataset Catalogue.” https://github.com/sergiouribe/dental_datasets_itu/blob/ main/AI_Dental_Datasets_List.md. (Accessed: 28 July 2025)
work page 2025
-
[7]
A multi - modal dental dataset for semi -supervised deep learning image segmentation,
Wang, Y., Ye, F., Chen, Y., Wang, C., Wu, C., Xu, F., Ma, Z., Liu, Y., Zhang, Y., Cao, M., and Chen, X., “A multi - modal dental dataset for semi -supervised deep learning image segmentation,” Scientific Data 12, 117 (2025). https://doi.org/10.1038/S41597-024-04306-9
-
[8]
A multimodal dental dataset facilitating machine learning research and clinic services v1.1.0
Liu, W., Huang, Y., and Tang, S., “A multimodal dental dataset facilitating machine learning research and clinic services v1.1.0.” https://physionet.org/content/multimodal-dental-dataset/1.1.0/. (Accessed: 28 July 2025)
work page 2025
-
[9]
A multimodal dental dataset facilitating machine learning research and clinic services,
Huang, Y., Liu, W., Yao, C., Miao, X., Guan, X., Lu, X., Liang, X., Ma, L., Tang, S., Zhang, Z., and Zhan, J., “A multimodal dental dataset facilitating machine learning research and clinic services,” Scientific Data 11, 1 –11 (2024). https://doi.org/10.1038/S41597-024-04130-1
-
[10]
Teeth3DS: A benchmark for teeth segmentation and labeling from intra-oral 3D scans,
Ben-Hamadou, A., Neifar, N., Rekik, A., Smaoui, O., Bouzguenda, F., Pujades, S., Boyer, E., and Ladroit, E., “Teeth3DS+: An Extended Benchmark for Intraoral 3D Scans Analysis,” arXiv:2210.06094 (2022). https://doi.org/10.48550/ARXIV.2210.06094
-
[11]
Nguyen, K. D., Hoang, H. T., Doan, T. P. H., Dao, K. Q., Wang, D. H., and Hsu, M. L., “SegmentAnyTooth: An open-source deep learning framework for tooth enumeration and segmentation in intraoral photos,” Journal of Dental Sciences 20, 1110–1117 (2025). https://doi.org/10.1016/J.JDS.2025.01.003
-
[12]
Nguyen, K. D., “ GitHub: SegmentAnyTooth: An open -source deep learning framework for tooth enumeration and segmentation in intraoral photos.” https://github.com/thangngoc89/SegmentAnyTooth. (Accessed: 28 July 2025)
work page 2025
-
[13]
You, W., Hao, A., Li, S., Wang, Y., and Xia, B., “Deep learning -based dental plaque detection on primary teeth: a comparison with clinical assessments ,” BMC Oral Health 20, 1 –7 (2020). https://doi.org/10.1186/S12903-020- 01114-6/
-
[14]
Liu, Y., Cheng, Y., Song, Y., Cai, D., and Zhang, N., “Oral screening of dental calculus, gingivitis and dental caries through segmentation on intraoral photographic images using deep learning,” BMC Oral Health 24, 1–10 (2024). https://doi.org/10.1186/S12903-024-05072-1
-
[15]
Piyarathne, N. S., Liyanage, S. N., Rasnayaka, R. M. S. G. K., Hettiarachchi, P. V. K. S., Devindi, G. A. I., Francis, F. B. A. H., Dissanayake, D. M. D. R., Ranasinghe, R. A. N. S., Pavithya, M. B. D., Nawinne, I. B., Ragel, R. G., and Jayasinghe, R. D., “A comprehensive dataset of annotated oral cavity images for diagnosis of oral cancer and oral potent...
-
[16]
He, K., Gkioxari, G., Dollár, P., and Girshick, R., “Mask R-CNN,” (2018). http://arxiv.org/abs/1703.06870
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[17]
Leading Image Video Data Annotation Platform — CVAT
“Leading Image Video Data Annotation Platform — CVAT.” https://www.cvat.ai/. (Accessed: 28 July2025)
-
[18]
“GitHub - cvat-ai/cvat: Annotate better with CVAT, the industry -leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.” https://github.com/cvat -ai/cvat. (Accessed: 28 July 2025)
work page 2025
-
[19]
Sturdevant, C. M., Roberson, T. M., Heymann, H. O., and Swift, E. J., [ Sturdevant’s art and science of operative dentistry], Elsevier, 7th ed. (2018)
work page 2018
-
[20]
V., [ Operative dentistry: The pathology of the hard tissues of the teeth (Vol
Black, G. V., [ Operative dentistry: The pathology of the hard tissues of the teeth (Vol. 1) ], Medico -Dental Publishing Company (1908)
work page 1908
-
[21]
Kaggle: AlphaDent: Teeth marking
“Kaggle: AlphaDent: Teeth marking.” https://www.kaggle.com/competitions/alpha-dent/. (Accessed: 30 July 2025)
work page 2025
-
[22]
Terven, J., Córdova -Esparza, D. M., and Romero -González, J. A., “A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO -NAS,” Machine Learning and Knowledge Extraction 2023 5, 1680–1716 (2023). https://doi.org/10.3390/MAKE5040083
-
[23]
The pascal visual object classes (VOC) challenge,
Everingham, M., Van Gool, L., Williams, C. K., Winn, J., and Zisserman, A., “The pascal visual object classes (VOC) challenge,” International Journal of Computer Vision , 303 –338 (2010). https://doi.org/10.1007/S11263-009- 0275-4
-
[24]
“GitHub: AlphaDent.” https://github.com/ZFTurbo/AlphaDent. (Accessed: 30 July 2025)
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.