Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems
Pith reviewed 2026-05-25 05:32 UTC · model grok-4.3
The pith
Open multimodal datasets and software packages support reproducible AI modeling of multiphase transport and thermal systems.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper presents an open ecosystem consisting of multimodal NED3 datasets classified by the S+TD framework (0+0D point values through 3+0D volumetric fields and multimodal combinations) together with open-source software packages including BubbleID, SeqReg, CFDTwin, IRISApp, decode-wfs, AELab, and FlowLab; these resources are offered to enable reproducible, AI-enabled modeling of multiphase transport, thermal systems, and related diagnostics.
What carries the argument
The S+TD (spatial-plus-temporal dimensionality) framework that classifies datasets according to the dimensionality of measured or simulated fields, such as 2+1D videos or 3+0D volumetric fields.
If this is right
- Standardized multimodal datasets become available for direct comparison of AI models on boiling, acoustic, and thermographic tasks.
- SeqReg enables nonintrusive estimation of heat flux from 0+1D, 1+1D, or 2+1D time-series or video data.
- CFD-generated fields and design files can be paired with experimental measurements inside surrogate models for digital twins.
- Acoustic-emission and waveform-decoding tools support diagnostics that combine audio and thermal signals.
- Future databanks can be built by linking the released metadata, decoders, and baselines into shared repositories.
Where Pith is reading between the lines
- The S+TD labels could be adopted by other laboratories working with time-resolved imaging or sensor arrays, creating a de-facto standard for dimensionality tagging.
- Integration of the software packages with existing physics-informed neural-network frameworks would allow direct testing of whether the open data improve model generalization.
- Industrial users might test the packages on proprietary thermal systems to check whether the NED3 baselines transfer beyond laboratory conditions.
- The emphasis on waveform decoding and acoustic analysis points toward possible cross-application in non-destructive testing or structural-health monitoring outside thermal fluids.
Load-bearing premise
The released datasets, the S+TD classification, and the listed software packages will prove sufficient to overcome fragmentation and enable community-wide benchmarking and reuse.
What would settle it
Subsequent published studies in multiphase transport or thermal-fluid AI that continue to rely on private or non-interoperable datasets rather than the NED3 releases, or that report no use of the S+TD labels or the listed software packages.
Figures
read the original abstract
Data-driven modeling is becoming central to multiphase transport, electronics cooling, acoustic diagnostics, and thermal-fluid digital twins, but progress is limited by fragmented datasets and raw instrument files that are difficult to decode, reuse, or benchmark. This paper presents an open ecosystem of multimodal datasets and open-source software packages developed by the Nano Energy and Data-Driven Discovery (NED3) Laboratory for reproducible AI-enabled thermal-fluid research. We introduce a spatial-plus-temporal dimensionality framework, denoted S+TD, to classify datasets by the dimensionality of measured or simulated fields, including 0+0D point values, 0+1D time series, 1+0D profiles, 2+0D images, 2+1D videos, 3+0D volumetric fields, and multimodal combinations. We organize public NED3 datasets spanning boiling images, acoustic and thermal measurements, high-speed videos, infrared thermography, thermal-resistance measurements, CFD-generated fields, design files, and acoustic-emission data. We also describe complementary software packages, including BubbleID, SeqReg, CFDTwin, IRISApp, decode-wfs, AELab, and FlowLab, which support computer vision, sequence regression, surrogate modeling, infrared analysis, waveform decoding, acoustic-emission analysis, and multimodal diagnostics. Particular emphasis is placed on SeqReg, a general sequence-regression library for 0+1D, 1+1D, and 2+1D data, with applications such as nonintrusive heat-flux estimation. Finally, we discuss future community efforts to build interoperable thermal-fluid databanks and curated AI/ML tool libraries that connect datasets, metadata, decoders, baselines, benchmarks, and physically interpretable models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript announces the release of an open ecosystem of multimodal datasets and open-source software packages developed by the NED3 Laboratory to support reproducible, AI-enabled research in multiphase transport, thermal-fluid systems, and related areas. It introduces the S+TD classification framework (0+0D through 3+0D and multimodal) to organize datasets spanning boiling images, acoustic/thermal measurements, high-speed videos, infrared thermography, CFD fields, and acoustic-emission data. Complementary tools including BubbleID, SeqReg (with emphasis on sequence regression for heat-flux estimation), CFDTwin, IRISApp, decode-wfs, AELab, and FlowLab are described, along with a forward-looking discussion of community databanks and benchmarks.
Significance. If the resources are publicly accessible, documented, and maintained as stated, the work supplies concrete, reusable assets that directly address dataset fragmentation in thermal-fluid modeling. The S+TD taxonomy and SeqReg library offer immediate utility for computer-vision and time-series tasks; the overall release lowers barriers to benchmarking and could accelerate physically interpretable ML models in the domain.
minor comments (2)
- Abstract: the statement that the resources 'will be sufficient to overcome fragmentation' is forward-looking; a brief qualifier noting that community adoption remains to be demonstrated would align the abstract more closely with the descriptive nature of the contribution.
- Section describing SeqReg: the applications (e.g., nonintrusive heat-flux estimation) would benefit from one or two concrete performance metrics or baseline comparisons to illustrate the library's practical value beyond its general design.
Simulated Author's Rebuttal
We thank the referee for their positive assessment of the manuscript and for recommending acceptance. The review accurately captures the contributions of the S+TD framework, the multimodal datasets, and the open-source tools, particularly SeqReg.
Circularity Check
No circularity; purely descriptive resource announcement with no derivations or predictions
full rationale
The paper contains no derivation chain, equations, fitted parameters, or predictions. Its central claim is the release and organization of NED3 datasets, the S+TD classification framework (introduced as a descriptive taxonomy), and listed software packages. No step reduces to self-definition, fitted inputs renamed as predictions, or load-bearing self-citations. The forward-looking aspiration about overcoming fragmentation is not a load-bearing premise. This matches the default expectation of no circularity for non-derivational papers.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
NED3 Laboratory, ”Open-Source Software Packages,” Nano Energy and Data-Driven Discov- ery Laboratory, University of Arkansas.https://ned3.uark.edu/software/
-
[2]
NED3 Laboratory, ”Open-Source Datasets,” Nano Energy and Data-Driven Discovery Labo- ratory, University of Arkansas.https://ned3.uark.edu/datasets/ 22
-
[3]
Y. Suh, A. Chandramowlishwaran, and Y. Won, ”Recent progress of artificial intelligence for liquid-vapor phase change heat transfer,” npj Computational Materials, vol. 10, 65, 2024. https://doi.org/10.1038/s41524-024-01223-8
-
[4]
M. Ravichandran and M. Bucci, ”Online, quasi-real-time analysis of high-resolution, infrared, boiling heat transfer investigations using artificial neural networks,” Applied Thermal Engi- neering, vol. 163, 114357, 2019.https://doi.org/10.1016/j.applthermaleng.2019.114357
-
[5]
M. Ravichandran, G. Su, C. Wang, J. H. Seong, A. Kossolapov, B. Phillips, M. M. Rahman, and M. Bucci, ”Decrypting the boiling crisis through data-driven exploration of high-resolution infrared thermometry measurements,” Applied Physics Letters, vol. 118, 253903, 2021.https: //doi.org/10.1063/5.0048391
-
[6]
MIT News, ”Infrared cameras and artificial intelligence provide in- sight into boiling,” July 7, 2021.https://news.mit.edu/2021/ infrared-cameras-artificial-intelligence-provide-insight-into-boiling-0707
work page 2021
-
[7]
L. Zhang, C. Wang, G. Su, A. Kossolapov, G. M. Aguiar, J. H. Seong, F. Chavagnat, B. Phillips, M. M. Rahman, and M. Bucci, ”A unifying criterion of the boiling crisis,” Nature Communications, vol. 14, 2321, 2023.https://doi.org/10.1038/s41467-023-37899-7
-
[8]
M. Ravichandran, A. Kossolapov, G. M. Aguiar, B. Phillips, and M. Bucci, ”Autonomous and online detection of dry areas on a boiling surface using deep learning and infrared thermometry,” Experimental Thermal and Fluid Science, vol. 145, 110879, 2023.https: //doi.org/10.1016/j.expthermflusci.2023.110879
-
[9]
G. M. Hobold and A. K. da Silva, ”Machine learning classification of boiling regimes with low speed, direct and indirect visualization,” International Journal of Heat and Mass Transfer, vol. 125, pp. 1296-1309, 2018.https://doi.org/10.1016/j.ijheatmasstransfer.2018.04.156
-
[10]
G. M. Hobold and A. K. da Silva, ”Visualization-based nucleate boiling heat flux quantification using machine learning,” International Journal of Heat and Mass Transfer, vol. 134, pp. 511- 520, 2019.https://doi.org/10.1016/j.ijheatmasstransfer.2018.12.170
-
[11]
Y. Suh, R. Bostanabad, and Y. Won, ”Deep learning predicts boiling heat transfer,” Scientific Reports, vol. 11, 5622, 2021.https://doi.org/10.1038/s41598-021-85150-4
-
[12]
C.-N. Huang, S. Chang, Y. Suh, I. Mudawar, Y. Won, and C. R. Kharangate, ”Machine learning boiling prediction: From autonomous vision of flow visualization data to performance parameter theoretical modeling,” International Journal of Multiphase Flow, vol. 179, 104928, 2024.https://doi.org/10.1016/j.ijmultiphaseflow.2024.104928
-
[13]
D. Garg, A. Bard, Y. Qiu, C. R. Kharangate, and R. H. French, ”Machine learning algorithms to predict flow boiling pressure drop in mini/micro-channels based on universal consolidated data,” International Journal of Heat and Mass Transfer, vol. 178, 121607, 2021.https://doi. org/10.1016/j.ijheatmasstransfer.2021.121607
-
[14]
A. Bard, Y. Qiu, C. R. Kharangate, and R. H. French, ”Consolidated modeling and prediction of heat transfer coefficients for saturated flow boiling in mini/micro-channels using machine learning methods,” Applied Thermal Engineering, 118305, 2022.https://doi.org/10.1016/ j.applthermaleng.2022.118305 23
-
[15]
ywflow, ”BubMask: Mask R-CNN for Bubble mask extraction,” GitHub repository.https: //github.com/ywflow/BubMask
-
[16]
Y. Kim and H. Park, ”Deep learning-based automated and universal bubble detection and mask extraction in complex two-phase flows,” Scientific Reports, vol. 11, 8940, 2021.https: //doi.org/10.1038/s41598-021-88334-0
- [17]
-
[18]
Dunlap and contributors, ”BubbleID,” GitHub repository.https://github.com/ cldunlap73/BubbleID
C. Dunlap and contributors, ”BubbleID,” GitHub repository.https://github.com/ cldunlap73/BubbleID
- [19]
- [20]
-
[21]
Y. Suh, S. Chang, P. Simadiris, T. B. Inouye, M. J. Hoque, S. Khodakarami, C. R. Kharangate, N. Miljkovic, and Y. Won, ”VISION-iT: A Framework for Digitizing Bubbles and Droplets,” Energy and AI, vol. 15, 100309, 2024.https://doi.org/10.1016/j.egyai.2023.100309
- [22]
-
[23]
S. Chang, S. Arani, N. S. Nuthalapati, Y. Suh, N. Choi, S. Khodakarami, M. R. H. Roni, N. Miljkovic, A. Chandramowlishwaran, and Y. Won, ”EventFlow: Real-time neuromorphic event-driven classification of two-phase boiling flow regimes,” Droplet, 2026.https://doi. org/10.1002/dro2.70066
-
[24]
K. N. R. Sinha, V. Kumar, N. Kumar, A. Thakur, and R. Raj, ”Deep learning the sound of boiling for advance prediction of boiling crisis,” Cell Reports Physical Science, vol. 2, 100382, 2021.https://doi.org/10.1016/j.xcrp.2021.100382
-
[25]
R. Raj, ”Dataset: Deep learning the sound of boiling for advance prediction of boiling crisis,” Mendeley Data, Version 1, 2021.https://doi.org/10.17632/vnnzc7n97m.1
-
[26]
K. N. R. Sinha, V. Kumar, N. Kumar, A. Thakur, and R. Raj, ”Dataset for boiling acoustic emissions: A tool for data driven boiling regime prediction,” Data in Brief, vol. 52, 109793, 2024.https://doi.org/10.1016/j.dib.2023.109793
-
[27]
B. Suriyaprasaad, A. Upadhyay, A. Thakur, and R. Raj, ”A roadmap for decoding the sound of boiling,” npj Thermal Science and Engineering, vol. 1, 2, 2026.https://doi.org/10.1038/ s44435-025-00004-z
work page 2026
- [28]
-
[29]
C. Dunlap, C. Li, H. Pandey, and H. Hu, ”Data for: Hit2Flux: A machine learning framework for boiling heat flux prediction using hit-based acoustic emission sensing,” Dryad, 2025.https: //doi.org/10.5061/dryad.g79cnp628
-
[30]
S. M. S. Hassan, A. Feeney, A. Dhruv, J. Kim, Y. Suh, J. Ryu, Y. Won, and A. Chan- dramowlishwaran, ”BubbleML: A Multi-Physics Dataset and Benchmarks for Machine Learn- ing,” arXiv:2307.14623, 2023.https://doi.org/10.48550/arXiv.2307.14623
-
[31]
HPCForge, ”BubbleML: A multiphase multiphysics dataset and benchmarks for scientific ma- chine learning,” GitHub repository.https://github.com/HPCForge/BubbleML
-
[32]
H. Zhai, Q. Zhou, and G. Hu, ”Predicting micro-bubble dynamics with semi-physics-informed deep learning,” AIP Advances, vol. 12, 035153, 2022.https://doi.org/10.1063/5.0079602
-
[33]
comp-physics, ”bubble-dynamics-resnet: Integrate bubble dynamics faster,” GitHub reposi- tory.https://github.com/comp-physics/bubble-dynamics-resnet
-
[34]
https://github.com/BerryWei/Bubble-dynamic
BerryWei, ”Bubble-dynamic: Rayleigh Plesset & Keller Miksis equation,” GitHub repository. https://github.com/BerryWei/Bubble-dynamic
-
[35]
kozakaron, ”Bubble dynamics simulation,” GitHub repository.https://github.com/ kozakaron/Bubble_dynamics_simulation
-
[36]
J. E. Taylor, ”bubblemask: Python package for applying Gaussian Bubbles masks to image stimuli,” GitHub repository.https://github.com/JackEdTaylor/bubblemask
- [37]
-
[38]
H. Pandey and H. Hu, ”Pool Boiling Dataset: Synchronized Acoustic, Optical, and Thermal Measurements for Steady-State and Transient Heat Loads,” Harvard Dataverse, 2024.https: //doi.org/10.7910/DVN/6GLGC6
- [39]
-
[40]
H. Pandey, C. Li, C. Dunlap, and H. Hu, ”Data from: Unveiling hysteresis of transient boiling: A multimodal perspective,” Dryad, 2025.https://doi.org/10.5061/dryad.ksn02v7h2
-
[41]
C. Dunlap, H. Pandey, E. Weems, and H. Hu, ”Nonintrusive Heat Flux Quantification Using Acoustic Emissions During Pool Boiling,” Applied Thermal Engineering, 228, 120558, 2023. https://doi.org/10.1016/j.applthermaleng.2023.120558
-
[42]
C. Dunlap, C. Li, H. Pandey, Y. Sun, and H. Hu, ”A Temporal-Spatial Framework for Effi- cient Heat Flux Monitoring of Transient Boiling,” IEEE Transactions on Instrumentation and Measurement, 2024.https://doi.org/10.1109/TIM.2024.3460944
-
[43]
C. Dunlap, C. Li, H. Pandey, and H. Hu, ”Hit2Flux: A Machine Learning Framework for Boiling Heat Flux Prediction Using Hit-Based Acoustic Emission Sensing,” AI Thermal Fluids, 1, 100002, 2025.https://doi.org/10.1016/j.aitf.2025.100002 25
- [44]
-
[45]
D. Curl and H. Hu, ”Physically Interpretable Surrogate Modeling of Thermal Fields in Elec- tronics Cooling Using Combined Proper Orthogonal Decomposition and Neural Networks,” AI Thermal Fluids, 2026
work page 2026
-
[46]
H. Pandey, S. Pierson, C. Li, C. Dunlap, and H. Hu, ”Data from: Two-phase immersion cooler for medium-voltage silicon carbide MOSFETs,” Dryad pre-publication sharing record, 2025. https://doi.org/10.5061/dryad.k98sf7mk9
-
[47]
Dunlap and contributors, ”SeqReg,” GitHub repository.https://github.com/ cldunlap73/SeqReg
C. Dunlap and contributors, ”SeqReg,” GitHub repository.https://github.com/ cldunlap73/SeqReg
-
[48]
com/UARK-NED3/Data-Center-Cooling-Research-Tools
UARK-NED3, ”Data-Center-Cooling-Research-Tools,” GitHub repository.https://github. com/UARK-NED3/Data-Center-Cooling-Research-Tools
-
[49]
MultiphaseHUB, ”Datahub for Multiphase Transport.”https://www.multiphasehub.org/
-
[50]
com/UARK-NED3/Bubble-Dynamics-Research-Tools 26
UARK-NED3, ”Bubble-Dynamics-Research-Tools,” GitHub repository.https://github. com/UARK-NED3/Bubble-Dynamics-Research-Tools 26
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.