PaveSync: A Unified and Comprehensive Dataset for Pavement Distress Analysis and Classification

Andrews Danyo; Anthony Dontoh; Armstrong Aboah; Blessing Agyei Kyem; Eugene Denteh; Joshua Kofi Asamoah

arxiv: 2512.20011 · v1 · pith:NOJNYL5Fnew · submitted 2025-12-23 · 💻 cs.CV

PaveSync: A Unified and Comprehensive Dataset for Pavement Distress Analysis and Classification

Blessing Agyei Kyem , Joshua Kofi Asamoah , Anthony Dontoh , Andrews Danyo , Eugene Denteh , Armstrong Aboah This is my paper

classification 💻 cs.CV

keywords datasetdetectiondistresspavementacrossannotationbenchmarkcomprehensive

0 comments

read the original abstract

Automated pavement defect detection often struggles to generalize across diverse real-world conditions due to the lack of standardized datasets. Existing datasets differ in annotation styles, distress type definitions, and formats, limiting their integration for unified training. To address this gap, we introduce a comprehensive benchmark dataset that consolidates multiple publicly available sources into a standardized collection of 52747 images from seven countries, with 135277 bounding box annotations covering 13 distinct distress types. The dataset captures broad real-world variation in image quality, resolution, viewing angles, and weather conditions, offering a unique resource for consistent training and evaluation. Its effectiveness was demonstrated through benchmarking with state-of-the-art object detection models including YOLOv8-YOLOv12, Faster R-CNN, and DETR, which achieved competitive performance across diverse scenarios. By standardizing class definitions and annotation formats, this dataset provides the first globally representative benchmark for pavement defect detection and enables fair comparison of models, including zero-shot transfer to new environments.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Hybrid Congestion Classification Framework Using Flow-Guided Attention and Empirical Mode Decomposition
cs.CV 2026-05 unverdicted novelty 3.0

FLO-EMD integrates flow-guided attention and EMD on aggregated motion traces to classify light, medium, and heavy congestion at 97.5% accuracy on 1,050 surveillance clips.