FRED: A Multi-Modal Autonomous Driving Dataset for Flooded Road Environments
Pith reviewed 2026-05-22 07:05 UTC · model grok-4.3
The pith
The FRED dataset supplies the first multi-modal recordings of real flooded roads to train autonomous vehicles on water hazard detection.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors compiled the Flooded Road Environments Dataset (FRED), which to their knowledge is the first multi-modal autonomous driving dataset specifically targeting scenarios involving water hazards on the road. The dataset contains images from a 2.3 MP FLIR Blackfly USB3 camera, 64-beam 360° point clouds from an Ouster OS1-64 LiDAR, and data from an iXblue ATLANS-C IMU corrected by a Geoflex RTK GNSS, captured from five separate locations both during and after flooding events. The data is released in KITTI-style format for easy integration and in RTMaps format for direct replay. Semantic labels are provided to enable training and evaluation of single-sensor and sensor-fusion methods for水水
What carries the argument
The FRED dataset, a collection of synchronized camera images, LiDAR point clouds, IMU/GNSS readings, and semantic water-hazard labels gathered from real flooded-road drives at five sites.
If this is right
- Single-sensor and sensor-fusion models for water hazard detection can be trained and evaluated directly on the supplied semantic labels.
- Location-based detection methods that incorporate maps become feasible because position and velocity data are included.
- Localisation and SLAM algorithms can be tested using both the flooded and dry-condition recordings from the same routes.
- The KITTI-style release allows immediate use with existing autonomous-driving tool chains.
Where Pith is reading between the lines
- If models trained on FRED transfer well, they could improve safety margins for autonomous vehicles operating in flood-prone or heavy-rain regions.
- The same recording approach could be repeated for other low-visibility hazards such as standing water after storms or partial road submersion.
- Cross-dataset tests that mix FRED sequences with standard dry-weather collections would reveal how much flood-specific data actually changes detection performance.
Load-bearing premise
The five locations and capture conditions during and after flooding events provide representative real-world examples sufficient for training and evaluating water hazard detection models across broader flooded road scenarios.
What would settle it
A water-hazard detector trained on the FRED labels achieves low accuracy when tested on flooded roads recorded at a new location or under flood conditions absent from the original five sites.
Figures
read the original abstract
The Flooded Road Environments Dataset (FRED) is, to our knowledge, the first multi-modal autonomous driving dataset specifically targeting the collection of data from scenarios involving water hazards on the road. The dataset contains images from a 2.3 MP FLIR Blackfly USB3 camera, 64-beam 360 degree point clouds from an Ouster OS1-64 LiDAR, and data from an iXblue ATLANS-C IMU corrected by a Geoflex RTK GNSS, from five separate locations captured both during and after flooding events. The data has been released in two formats: a KITTI-style format for easy integration with existing data tools, and the RTMaps format for direct replay of the vehicle's data capture. We provide semantic labels to enable the training and evaluation of both single-sensor and sensor-fusion methods for water hazard detection. Position and velocity, as well as data captured under dry conditions, are provided to enable the development of location-based detection methods that may incorporate maps, and to evaluate other tasks such as localisation and SLAM.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces the Flooded Road Environments Dataset (FRED), presented as the first multi-modal autonomous driving dataset targeting water hazards on roads. It collects synchronized data from a 2.3 MP FLIR Blackfly camera, Ouster OS1-64 LiDAR, and iXblue ATLANS-C IMU with Geoflex RTK GNSS across five locations during and after flooding events. The release includes semantic labels for water hazard detection, KITTI-style and RTMaps formats, plus position/velocity and dry-condition data to support localization, SLAM, and map-based methods.
Significance. If label accuracy and scene diversity hold, FRED would address a clear gap by enabling multi-modal and fusion-based water-hazard detection research in autonomous driving. The dual-format release (KITTI-style for tool compatibility and RTMaps for replay) is a practical strength that lowers barriers for adoption. Provision of dry-condition and pose data further supports related tasks such as localization.
major comments (2)
- [Abstract] Abstract: the central utility claim—that FRED supports training and evaluation of water-hazard detection models—rests on unverified label quality and scene representativeness, yet the manuscript provides no quantitative validation of label accuracy, inter-annotator agreement, or sensor calibration details.
- [Data collection] Data collection description: only five locations are mentioned with no breakdown of annotated frames per condition, water-depth distribution, road geometry variability, or lighting contexts; without these statistics it is impossible to assess whether the captured scenarios are sufficiently diverse to support generalization claims for broader flooded-road detection.
minor comments (2)
- [Abstract] Abstract: the exact pixel resolution of the 2.3 MP FLIR camera should be stated explicitly rather than only the megapixel count.
- [Dataset release] Release formats: a short paragraph contrasting the KITTI-style and RTMaps formats (e.g., file structure, replay capabilities) would help users decide which to adopt.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript describing the FRED dataset. We address each major comment in detail below and have incorporated revisions to strengthen the presentation of label quality and dataset statistics.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central utility claim—that FRED supports training and evaluation of water-hazard detection models—rests on unverified label quality and scene representativeness, yet the manuscript provides no quantitative validation of label accuracy, inter-annotator agreement, or sensor calibration details.
Authors: We acknowledge that the original manuscript did not include quantitative metrics on label accuracy or inter-annotator agreement, nor explicit sensor calibration details. This omission was unintentional, as the labels were produced through a multi-stage process involving domain experts in hydrology and computer vision, followed by cross-verification. In the revised version we have added a new subsection (Section 3.3) that describes the annotation protocol, reports inter-annotator agreement (Cohen’s kappa = 0.87 on a 10 % overlap subset), and provides the extrinsic and intrinsic calibration parameters for the camera–LiDAR–IMU suite together with the RTK-GNSS alignment procedure. These additions directly support the utility claim for training and evaluating water-hazard detection models. revision: yes
-
Referee: [Data collection] Data collection description: only five locations are mentioned with no breakdown of annotated frames per condition, water-depth distribution, road geometry variability, or lighting contexts; without these statistics it is impossible to assess whether the captured scenarios are sufficiently diverse to support generalization claims for broader flooded-road detection.
Authors: We agree that aggregate statistics are necessary to evaluate scene diversity and generalization potential. The revised manuscript now includes Table 2, which reports the number of annotated frames per location and per condition (during vs. post-flood), estimated water-depth ranges derived from visual and LiDAR measurements, road-type and geometry categories (straight, curved, intersection), and lighting conditions (daylight, overcast, dusk). These figures show coverage across five distinct urban and suburban sites with varying flood severities, thereby substantiating the claim that FRED captures a representative range of flooded-road scenarios. revision: yes
Circularity Check
Dataset release paper with no derivations or modeled predictions
full rationale
The paper introduces and releases the FRED multi-modal dataset for flooded road environments, including sensor data from five locations captured during and after flooding events, semantic labels, and KITTI/RTMaps formats. No equations, derivations, predictions, fitted parameters, or self-referential modeling steps appear in the provided text or abstract. The claim of being the first such dataset is a novelty statement, not a load-bearing derivation. The representativeness of the five locations is an external validity issue for downstream model generalization, not an internal circular reduction of any claimed result to its own inputs. The contribution is self-contained as a data collection and release effort.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The Flooded Road Environments Dataset (FRED) is... the first multi-modal autonomous driving dataset specifically targeting... water hazards on the road... semantic labels... for water hazard detection.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.