iWatchRoad: Scalable Detection and Geospatial Visualization of Potholes for Smart Cities
Pith reviewed 2026-05-18 22:44 UTC · model grok-4.3
The pith
iWatchRoad combines a fine-tuned YOLO detector with GPS and OCR to turn ordinary dashcam video into geotagged pothole maps on OpenStreetMap.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
iWatchRoad demonstrates that fine-tuning Ultralytics YOLO on a self-annotated dataset of over 7,000 frames captured across varied Indian road types, lighting conditions, and weather scenarios, together with OCR timestamp extraction and GPS synchronization, produces accurate real-time pothole detections that can be stored with metadata and visualized on OpenStreetMap to support government road assessment and maintenance planning.
What carries the argument
The iWatchRoad pipeline that links fine-tuned YOLO object detection, OCR-based timestamp reading from video frames, GPS log synchronization for geotagging, database storage of detections and frames, and OpenStreetMap web visualization.
If this is right
- Maintenance teams receive location-tagged images and timestamps that can be used directly for repair scheduling.
- The same hardware and software setup works for both urban streets and rural roads without extra sensors.
- New footage can be processed continuously to update the map as vehicles drive.
- The outputs match the format needed for official road condition reports in developing regions.
Where Pith is reading between the lines
- The pipeline could be retrained to spot additional road defects such as cracks or loose gravel.
- Multiple vehicles running the same software would create crowdsourced coverage of entire cities or highways.
- Historical detections over months or years might reveal which road sections deteriorate fastest.
- Deployment on roads outside India would require checking whether the current training data still suffices.
Load-bearing premise
The self-annotated dataset of over 7,000 frames from varied Indian road types, lighting, and weather is representative enough for the fine-tuned YOLO model to generalize reliably in real-world deployment.
What would settle it
A field test on dashcam footage from road conditions absent from the training set, such as heavy monsoon flooding or completely unpaved surfaces, that shows detection accuracy falling well below the levels reported in the paper.
Figures
read the original abstract
Potholes on the roads are a serious hazard and maintenance burden. This poses a significant threat to road safety and vehicle longevity, especially on the diverse and under-maintained roads of India. In this paper, we present a complete end-to-end system called iWatchRoad for automated pothole detection, Global Positioning System (GPS) tagging, and real time mapping using OpenStreetMap (OSM). We curated a large, self-annotated dataset of over 7,000 frames captured across various road types, lighting conditions, and weather scenarios unique to Indian environments, leveraging dashcam footage. This dataset is used to fine-tune, Ultralytics You Only Look Once (YOLO) model to perform real time pothole detection, while a custom Optical Character Recognition (OCR) module was employed to extract timestamps directly from video frames. The timestamps are synchronized with GPS logs to geotag each detected potholes accurately. The processed data includes the potholes' details and frames as metadata is stored in a database and visualized via a user friendly web interface using OSM. iWatchRoad not only improves detection accuracy under challenging conditions but also provides government compatible outputs for road assessment and maintenance planning through the metadata visible on the website. Our solution is cost effective, hardware efficient, and scalable, offering a practical tool for urban and rural road management in developing regions, making the system automated. iWatchRoad is available at https://smlab.niser.ac.in/project/iwatchroad
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents iWatchRoad, an end-to-end system for automated pothole detection on Indian roads. It curates a self-annotated dataset of over 7,000 dashcam frames across varied road types, lighting, and weather; fine-tunes an Ultralytics YOLO model for real-time detection; uses a custom OCR module to extract timestamps from video frames; synchronizes these with GPS logs for accurate geotagging; stores results with metadata in a database; and visualizes them on OpenStreetMap via a user-friendly web interface. The work claims improved detection accuracy under challenging conditions and provides government-compatible outputs for road assessment and maintenance planning, positioning the system as cost-effective, hardware-efficient, and scalable for smart cities in developing regions.
Significance. If the accuracy claims are substantiated, the integration of real-time detection with precise geotagging and public geospatial visualization offers a practical, deployable tool for road maintenance in under-resourced areas. The emphasis on metadata suitable for government use and the availability of the system at a project website are positive for real-world applicability and reproducibility. The approach assembles existing components (YOLO, OCR, GPS, OSM) in a domain-specific pipeline rather than introducing novel algorithms.
major comments (2)
- [Abstract] Abstract: The central claim that iWatchRoad 'improves detection accuracy under challenging conditions' is unsupported by any quantitative results. No mAP, precision, recall, F1-score, confusion matrix, train/test split details, baseline comparisons (e.g., to off-the-shelf YOLO or prior pothole detectors), or ablation studies on the 7,000-frame dataset are provided. This absence prevents evaluation of the detection step, which is load-bearing for all downstream claims about the pipeline and its utility.
- [Dataset and Model Fine-tuning] Dataset curation and model fine-tuning description: The self-annotated dataset of over 7,000 frames is presented as representative of Indian road conditions, but no information is given on annotation protocol, quality control, inter-annotator agreement, class distribution, or how 'challenging conditions' (lighting, weather, road types) were balanced or tested. This directly affects the weakest assumption regarding reliable generalization in real-world deployment.
minor comments (2)
- The project website link is provided but the manuscript does not indicate whether the dataset, code, or trained model weights are publicly released to support reproducibility.
- Figure captions and the web-interface description could more explicitly link displayed metadata fields to the government-compatible outputs claimed in the abstract.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We address each major comment below and will revise the manuscript to provide the requested quantitative details and methodological clarifications.
read point-by-point responses
-
Referee: [Abstract] Abstract: The central claim that iWatchRoad 'improves detection accuracy under challenging conditions' is unsupported by any quantitative results. No mAP, precision, recall, F1-score, confusion matrix, train/test split details, baseline comparisons (e.g., to off-the-shelf YOLO or prior pothole detectors), or ablation studies on the 7,000-frame dataset are provided. This absence prevents evaluation of the detection step, which is load-bearing for all downstream claims about the pipeline and its utility.
Authors: We agree that the abstract and current manuscript do not include quantitative performance metrics or comparisons to support the claim of improved detection accuracy. The manuscript emphasizes the end-to-end pipeline and deployment aspects but lacks a dedicated evaluation section. We will revise the abstract to incorporate key metrics and add a new results section reporting mAP, precision, recall, F1-score, confusion matrix, train/test split details (e.g., 80/20), baseline comparisons against the off-the-shelf Ultralytics YOLO and relevant prior pothole detectors, plus ablation studies on the effects of lighting, weather, and road types. revision: yes
-
Referee: [Dataset and Model Fine-tuning] Dataset curation and model fine-tuning description: The self-annotated dataset of over 7,000 frames is presented as representative of Indian road conditions, but no information is given on annotation protocol, quality control, inter-annotator agreement, class distribution, or how 'challenging conditions' (lighting, weather, road types) were balanced or tested. This directly affects the weakest assumption regarding reliable generalization in real-world deployment.
Authors: We acknowledge the absence of these dataset details in the manuscript. We will expand the dataset and model fine-tuning section to describe the annotation protocol (multi-annotator use of tools such as LabelImg), quality control steps, inter-annotator agreement statistics, class distribution across the 7,000 frames, and the sampling approach used to balance representation of diverse lighting, weather, and road-type conditions. revision: yes
Circularity Check
No significant circularity; engineering assembly of standard components with no derivations or self-referential claims
full rationale
The paper presents an applied system for pothole detection and mapping: it curates a self-annotated dataset of 7000+ frames, fine-tunes an off-the-shelf Ultralytics YOLO model, extracts timestamps via custom OCR, synchronizes with GPS logs, and visualizes results on OSM. No equations, first-principles derivations, or predictions appear in the abstract or described pipeline. The accuracy improvement claim is an unverified assertion rather than a derived result, but it does not reduce to any input by construction, self-citation load-bearing, or renamed known pattern. The work is self-contained as a practical integration of existing tools without any load-bearing step that collapses to its own fitted values or prior author results.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Jaided AI. 2020. EasyOCR: Ready-to-use OCR with 80+ languages supported. https://github.com/JaidedAI/EasyOCR
work page 2020
-
[2]
S. Kranthi Kumar Chowdary, Y. Harshith, and T. Preethiya. 2025. Smart Pothole Detection and Traffic Sign Identification for Indian Roads: A Machine Learning Approach Using Yolov11. In 2025 International Conference on Data Science and Business Systems (ICDSBS) . 1–6. doi:10.1109/ ICDSBS63635.2025.11031494
-
[3]
Brad Dwyer, Joseph Nelson, Tom Hansen, et al. 2025. Roboflow (Version 1.0). https://roboflow.com Computer vision platform
work page 2025
-
[4]
Rui Fan, Yanan Liu, Xingrui Yang, Mohammud Junaid Bocus, Naim Dahnoun, and Scott Tancock. 2018. Real-Time Stereo Vision for Road Surface 3-D Reconstruction. In 2018 IEEE International Conference on Imaging Systems and Techniques (IST) . 1–6. doi:10.1109/IST.2018.8577119
-
[5]
M. Hoseini, S. Puliti, S. Hoffmann, and R. Astrup. 2023. Pothole detection in the woods: a deep learning approach for forest road surface monitoring with dashcams. International Journal of Forest Engineering 35, 2 (2023), 303–312. doi:10.1080/14942119.2023.2290795
-
[6]
Dharneeshkar J, Soban Dhakshana V, Aniruthan S A, Karthika R, and Latha Parameswaran. 2020. Deep Learning based Detection of potholes in Indian roads using YOLO. In 2020 International Conference on Inventive Computation Technologies (ICICT) . 381–385. doi:10.1109/ICICT48043.2020.9112424
-
[7]
Glenn Jocher, Ayush Chaurasia, and Jing Qiu. 2023. Ultralytics YOLOv8. https://github.com/ultralytics/ultralytics
work page 2023
- [8]
-
[9]
Nachuan Ma, Jiahe Fan, Wenshuo Wang, Jin Wu, Yu Jiang, Lihua Xie, and Rui Fan. 2022. Computer vision for road imaging and pothole detection: a state-of-the-art review of systems and algorithms. Transportation Safety and Environment 4, 4 (Nov. 2022). doi:10.1093/tse/tdac026
-
[10]
Mohd Omar and Pradeep Kumar. 2024. PD-ITS: Pothole Detection Using YOLO Variants for Intelligent Transport System. SN Comput. Sci. 5, 5 (May 2024), 16 pages. doi:10.1007/s42979-024-02887-1
-
[11]
OpenStreetMap contributors. 2024. Planet dump retrieved from https://planet.openstreetmap.org. https://www.openstreetmap.org
work page 2024
-
[12]
Alfandino Rasyid, Mochammad Rifki Ulil Albaab, Muhammad Fajrul Falah, Yohanes Yohanie Fridelin Panduman, Alviansyah Arman Yusuf, Dwi Kurnia Basuki, Anang Tjahjono, Rizqi Putri Nourma Budiarti, Sritrusta Sukaridhoto, Firman Yudianto, and Hendro Wicaksono. 2019. Pothole Visual Detection using Machine Learning Method integrated with Internet of Thing Video S...
-
[13]
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition . 779–788
work page 2016
-
[14]
Yeganeh Safyari, Masoud Mahdianpari, and Hamid Shiri. 2024. A Review of Vision-Based Pothole Detection Methods Using Computer Vision and Machine Learning. Sensors 24, 17 (2024), 5652. doi:10.3390/s24175652
-
[15]
Amxson Sminage, Delvin P B, Derick Davies, Vivek K J, and Jasmy Davies. 2025. SafeDrive: Intelligent Pothole Detection and Mapping System. In 2025 2nd International Conference on Trends in Engineering Systems and Technologies (ICTEST) , Vol. 1. 1–6. doi:10.1109/ICTEST64710.2025.11042541
-
[16]
Javier Yebes, David Montero, and Ignacio Arriola
J. Javier Yebes, David Montero, and Ignacio Arriola. 2021. Learning to Automatically Catch Potholes in Worldwide Road Scene Images. IEEE Intelligent Transportation Systems Magazine 13, 3 (2021), 192–205. doi:10.1109/mits.2019.2926370
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.