Methodology for Creating a Clinically Verified Dermoscopic Image Dataset

Kozachok Elena Sergeevna

arxiv: 2605.25168 · v1 · pith:6FOW2BB6new · submitted 2026-05-24 · 💻 cs.CV · cs.AI· cs.CV

Methodology for Creating a Clinically Verified Dermoscopic Image Dataset

Kozachok Elena Sergeevna This is my paper

classification 💻 cs.CV cs.AIcs.CV

keywords datasetclinicallydiagnosticimagesmethodologyverifieddermatoscopicimage

0 comments

read the original abstract

This study presents a methodology for constructing a clinically verified dataset of dermatoscopic images for medical informatics research. The relevance of the work is driven by the fact that the performance of automated diagnostic support systems depends not only on the volume of images, but also on the reproducibility of the image acquisition procedure, the completeness of structured metadata, and the reliability of diagnostic labels. International collections were primarily created under conditions that differ substantially from routine Russian outpatient practice and mobile dermatoscopy. The proposed methodology integrates three interconnected components: (1) a standard operating procedure (SOP) for acquiring images via mobile dermatoscopy, (2) an information model comprising 16 structured metadata fields organized into six clinically oriented blocks in ISIC-compatible notation, and (3) a multi-stage expert verification of diagnostic labels (initial clinical annotation, consensus review by three specialists, and histological confirmation of all malignant neoplasms). Using this methodology, a dataset of 1,026 unique dermatoscopic images from 443 patients was collected between June 2025 and May 2026. From 1,044 initial records, 18 duplicates were excluded. The dataset includes nine nosological categories; all 39 malignant lesions (18 melanomas, 15 basal cell carcinomas, and 6 squamous cell carcinomas) were histologically verified. Patient age ranged from 2 to 90 years (median 38), with 279 females (63%) and 164 males (37%). Each image is accompanied by expert-annotated dermatoscopic structures and an explicit verification_stage field indicating the level of diagnostic confirmation. The resulting dataset serves as a pilot clinically verified resource suitable for independent model evaluation, domain shift analysis, interpretability studies, and further expansion.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Cascade Classification of Dermoscopic Images of Skin Neoplasms with Controllable Sensitivity and External Clinical Validation
cs.CV 2026-06 unverdicted novelty 4.0

Cascade classification improves macro F1 over single-stage for some models by allowing sensitivity control but reveals a large generalization gap on external clinical data.
Clinical Validation of the Melanoscope AI Mobile Dermoscopy Clinical Decision Support System
cs.CV 2026-05 unverdicted novelty 3.0

Prospective single-center validation of a cascade deep learning dermoscopy CDSS found no false negatives for five malignant lesions and 88.3% specificity, with quantitative IoU assessment of attention maps.