Enabling Predictive Maintenance in District Heating Substations: A Labelled Dataset and Fault Detection Evaluation Framework based on Service Data

Anna Cadenbach; Cyriana M.A. Roelofs; Edison Guevara Bastidas; Stefan Faulstich; Thomas Hugo

arxiv: 2511.14791 · v2 · submitted 2025-11-14 · 💻 cs.SE · cs.AI

Enabling Predictive Maintenance in District Heating Substations: A Labelled Dataset and Fault Detection Evaluation Framework based on Service Data

Cyriana M.A. Roelofs , Edison Guevara Bastidas , Thomas Hugo , Stefan Faulstich , Anna Cadenbach This is my paper

Pith reviewed 2026-05-17 22:30 UTC · model grok-4.3

classification 💻 cs.SE cs.AI

keywords district heatingfault detectionpredictive maintenancelabelled datasetanomaly detectionenergy systemssubstations

0 comments

The pith

Public dataset and framework enable early fault detection in district heating substations days before customer reports.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces an open dataset of operational time series from 93 district heating substations, annotated using service reports for faults and normal events. It pairs this with an evaluation framework that assesses models on their ability to recognize normal behavior accurately, detect faults reliably with few false positives, and identify issues early. Baseline results using an open-source anomaly detection tool show strong performance, including the detection of over half the faults with several days of lead time. By making the data, code, and metrics public, the work sets up a benchmark for developing predictive maintenance systems that can improve efficiency by lowering return temperatures.

Core claim

The paper establishes a public labelled dataset from district heating substations across two manufacturers, including time series data annotated with fault disturbances, maintenance actions, normal-event examples, and detailed metadata. It defines an evaluation approach based on normal-behaviour accuracy, eventwise F-score, and earliness metrics, and provides baseline results from the EnergyFaultDetector tool that reach 0.98 accuracy and 0.83 F-score while detecting 60% of faults with an average lead time of 3 to 5 days prior to customer reports. The framework also incorporates root cause analysis via feature attribution to help interpret anomalies.

What carries the argument

The labelled dataset of operational time series validated against service reports, combined with the EnergyFaultDetector for anomaly detection and ARCANA for feature attribution in autoencoders.

If this is right

Consistent benchmarks become possible for comparing different fault detection methods in district heating.
Operators gain tools to interpret anomalies and identify root causes of faults.
Early detection supports actions that reduce return temperatures and improve system efficiency.
Reproducible development of predictive maintenance methods is facilitated for energy systems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar labelled datasets could be created for other types of energy infrastructure to broaden predictive maintenance applications.
Combining this approach with additional sensor data or weather information might further increase the lead time for fault detection.
The framework could serve as a template for standardizing evaluations in related industrial anomaly detection tasks.

Load-bearing premise

Service reports provide accurate, complete, and timely ground-truth labels for faults and normal events that align with the recorded operational time series.

What would settle it

A study that cross-validates a sample of the service report labels against independent on-site inspections or sensor verifications would falsify the framework if many labels are found to be incorrect or incomplete.

Figures

Figures reproduced from arXiv: 2511.14791 by Anna Cadenbach, Cyriana M.A. Roelofs, Edison Guevara Bastidas, Stefan Faulstich, Thomas Hugo.

**Figure 2.** Figure 2: Fault label counts for incident reports of manufacturer 1, showing their cate [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗

**Figure 3.** Figure 3: Fault label counts for incident reports of manufacturer 2, showing their cate [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗

**Figure 4.** Figure 4: Schematic timeline, illustrating detection delay [PITH_FULL_IMAGE:figures/full_fig_p016_4.png] view at source ↗

**Figure 5.** Figure 5: Results for the three models for the M1 dataset [PITH_FULL_IMAGE:figures/full_fig_p019_5.png] view at source ↗

**Figure 6.** Figure 6: Results for the three models for the M2 dataset. of 310 days of training data is available for anomalous events, whereas for normal events an average of 704 days is available. For M1, the averages are 588 for anomalies and 576 days for normal events. Although accounting for day-of-year can help, short training periods mean that detections may reflect seasonal changes rather than faults. An overview of Accu… view at source ↗

**Figure 7.** Figure 7: Comparison of the Earliness score of the three models for both datasets between [PITH_FULL_IMAGE:figures/full_fig_p021_7.png] view at source ↗

**Figure 8.** Figure 8: Comparison of the Reliability score (eventwise [PITH_FULL_IMAGE:figures/full_fig_p021_8.png] view at source ↗

**Figure 9.** Figure 9: Criticality of the three model variants in the 7 days preceding an incident report [PITH_FULL_IMAGE:figures/full_fig_p023_9.png] view at source ↗

**Figure 10.** Figure 10: Top-3 deviating features for example use-case 1 according to the conditional [PITH_FULL_IMAGE:figures/full_fig_p024_10.png] view at source ↗

**Figure 11.** Figure 11: Behaviour of the secondary supply temperature changes after the training [PITH_FULL_IMAGE:figures/full_fig_p024_11.png] view at source ↗

**Figure 12.** Figure 12: Criticality of the three model variants in the 7 days preceding an incident report [PITH_FULL_IMAGE:figures/full_fig_p025_12.png] view at source ↗

**Figure 13.** Figure 13: Top-3 deviating features for example use-case 2 according to the conditional [PITH_FULL_IMAGE:figures/full_fig_p025_13.png] view at source ↗

**Figure 14.** Figure 14: Criticality of the three model variants in the 7 days preceding an incident report [PITH_FULL_IMAGE:figures/full_fig_p026_14.png] view at source ↗

**Figure 15.** Figure 15: Top-3 deviating features for example use-case 3 according to the conditional [PITH_FULL_IMAGE:figures/full_fig_p027_15.png] view at source ↗

read the original abstract

Early detection of faults in district heating substations is imperative to reduce return temperatures and enhance efficiency. However, progress in this domain has been hindered by the limited availability of public, labelled datasets. We present an open-source framework combining a service report validated public dataset, an evaluation method based on accuracy, reliability, and earliness, and baseline results implemented with EnergyFaultDetector, an open-source Python framework developed for automated anomaly detection in operational data from energy systems. The dataset contains time series of operational data from 93 substations across two manufacturers, annotated with a list of disturbances due to faults and maintenance actions, a set of normal-event examples and detailed fault metadata. We evaluate the EnergyFaultDetector using three metrics: accuracy for recognising normal behaviour, an eventwise F-score for reliable fault detection with few false alarms, and earliness for early detection. The framework also supports root cause analysis using ARCANA, a feature-attribution method for autoencoders. We demonstrate three use cases to assist operators in interpreting anomalies and identifying underlying faults. The models achieve high normal-behaviour accuracy (0.98) and eventwise F-score (beta = 0.5) of 0.83 and could detect 60% of the faults in the dataset before the customer reported a problem, with an average lead time of 3 to 5 days. Integrating an open dataset, metrics, open-source code, and baselines establishes a reproducible, fault-centric benchmark with operationally meaningful evaluation, enabling consistent comparison and development of early fault detection and diagnosis methods for district heating substations.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper's core contribution is a new public labelled dataset from 93 district heating substations drawn from service reports, with baseline fault detection results that look usable but rest on unverified label timing.

read the letter

The main takeaway is that this work releases operational time series from 93 substations across two manufacturers, annotated with faults, maintenance actions, normal events, and metadata, plus an open evaluation framework and some baseline numbers from their EnergyFaultDetector code. That directly tackles the data scarcity the abstract flags for predictive maintenance in district heating. The metrics they chose—normal-behaviour accuracy, eventwise F-score with beta 0.5, and earliness—make sense for an operational setting where false alarms cost money and early warning matters. Reporting 0.98 accuracy, 0.83 F-score, and 60% of faults caught 3–5 days before customer reports gives a concrete starting point others can compare against. Open-sourcing the code and including a feature-attribution method like ARCANA for root cause analysis adds practical value for people who want to inspect anomalies rather than just get a score. The dataset itself is the part that could see reuse. The soft spot is the ground truth. Everything hinges on service reports supplying accurate start times and complete coverage, yet the paper does not appear to describe independent checks against raw sensor patterns or timing offsets. If reports lag the first visible deviation in the traces or only capture customer-noticed issues, both the lead-time numbers and the eventwise F-score become harder to trust. Details on cleaning steps, train-test splits, and imbalance handling are also thin in the sections I checked, which leaves room for post-hoc choices to influence the reported figures. This is for researchers building or benchmarking anomaly detection methods in energy systems who need labelled industrial data rather than synthetic cases. A reader working on district heating efficiency or time-series fault detection would get immediate use from the released traces and labels. It deserves peer review because the dataset release is a real resource the field lacks, even if the evaluation needs tighter documentation on label validation to stand up to scrutiny.

Referee Report

2 major / 2 minor

Summary. The manuscript presents a publicly released labelled dataset of operational time series from 93 district heating substations (two manufacturers), annotated via service reports with faults, maintenance actions, and normal-event examples. It introduces an evaluation framework using three metrics—normal-behaviour accuracy, eventwise F-score (beta = 0.5), and earliness—and reports baseline results from the open-source EnergyFaultDetector: 0.98 accuracy, 0.83 F-score, and detection of 60 % of faults prior to customer reports with 3–5 days average lead time. The work also demonstrates root-cause analysis via the ARCANA feature-attribution method and three operator-oriented use cases.

Significance. Release of a real-world, service-report-validated dataset together with reproducible code and an operationally oriented evaluation protocol would fill a documented gap in public benchmarks for fault detection in district heating. If the label-validation and split procedures are documented, the concrete performance numbers and earliness results could serve as a reproducible reference point for subsequent methods.

major comments (2)

[Dataset annotation and evaluation metrics] The earliness metric and the claim that 60 % of faults are detected before customer reports (3–5 days lead time) rest on the unverified assumption that service-report timestamps accurately reflect fault onset in the sensor time series. No cross-checks against raw data patterns, independent verification, or quantification of reporting lag are described; this directly undermines the reliability of both the eventwise F-score (0.83) and the pre-report detection percentage.
[Experimental setup and baseline evaluation] The experimental section provides no information on data-cleaning steps, train–test split strategy, handling of class imbalance, or selection criteria for the normal-event examples. These choices are load-bearing for the reported 0.98 normal-behaviour accuracy and 0.83 F-score; without them the numerical results cannot be reproduced or interpreted.

minor comments (2)

[Results] Clarify whether the 93 substations are partitioned by manufacturer in the reported metrics or whether results are aggregated; this affects generalisability claims.
[Dataset description] The abstract states 'detailed fault metadata'; the manuscript should explicitly list the metadata fields and show how they are used by ARCANA for root-cause analysis.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments that highlight important aspects of reproducibility and metric reliability. We address each major comment below and will incorporate clarifications and additional details in the revised manuscript.

read point-by-point responses

Referee: [Dataset annotation and evaluation metrics] The earliness metric and the claim that 60 % of faults are detected before customer reports (3–5 days lead time) rest on the unverified assumption that service-report timestamps accurately reflect fault onset in the sensor time series. No cross-checks against raw data patterns, independent verification, or quantification of reporting lag are described; this directly undermines the reliability of both the eventwise F-score (0.83) and the pre-report detection percentage.

Authors: We agree that the earliness results depend on service-report timestamps as proxies for fault onset and that the manuscript does not describe explicit cross-checks against raw sensor patterns or quantify reporting lag. The annotation process is based on operator service reports that record the date and nature of reported issues, which we treat as the operational ground truth. In the revision we will add a new subsection on label provenance that discusses potential delays between fault occurrence and customer reporting, provides any available metadata on report timing, and includes illustrative examples of how report timestamps were aligned with the time-series data. We will also explicitly note this as a limitation of the current dataset. revision: yes
Referee: [Experimental setup and baseline evaluation] The experimental section provides no information on data-cleaning steps, train–test split strategy, handling of class imbalance, or selection criteria for the normal-event examples. These choices are load-bearing for the reported 0.98 normal-behaviour accuracy and 0.83 F-score; without them the numerical results cannot be reproduced or interpreted.

Authors: We concur that the current experimental description lacks the necessary implementation details for reproducibility. The manuscript presents the overall evaluation framework and baseline numbers but omits concrete choices regarding preprocessing, splitting, and example selection. In the revised version we will expand the relevant sections to document: (i) data-cleaning procedures (missing-value handling, outlier detection, and resampling), (ii) the train–test partitioning strategy (including whether splits are performed at the substation or temporal level), (iii) any techniques used to address class imbalance, and (iv) the explicit criteria applied when selecting normal-event examples. These additions will allow independent reproduction of the reported accuracy and eventwise F-score. revision: yes

Circularity Check

0 steps flagged

Empirical dataset release and benchmark evaluation with no circular derivation

full rationale

The paper releases a labelled dataset of operational time series from 93 substations annotated via service reports, then reports baseline performance of an open-source anomaly detection framework using standard metrics (normal-behaviour accuracy, eventwise F-score, and earliness). These metrics are computed by direct comparison of model outputs against the external service-report labels; no equations, fitted parameters, or self-citation chains reduce the reported numbers (0.98 accuracy, 0.83 F-score, 60 % pre-report detections) to quantities defined inside the paper itself. The contribution is therefore self-contained as an empirical benchmark rather than a closed derivation.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claims rest on the assumption that service reports constitute reliable ground truth for fault timing and type; no free parameters or invented entities are introduced in the abstract.

axioms (1)

domain assumption Service reports accurately reflect the occurrence, timing, and nature of faults and maintenance actions in the operational time series.
All labels in the dataset are derived from these reports, making this assumption load-bearing for the claimed performance numbers.

pith-pipeline@v0.9.0 · 5613 in / 1252 out tokens · 32110 ms · 2026-05-17T22:30:03.328718+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

The EnergyFaultDetector uses an AE as NBM... anomaly score... Mahalanobis distance... criticality counter C... eventwise F0.5... Earliness E = max(0, min(1, (treport - tdetect)/W))

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

26 extracted references · 26 canonical work pages

[1]

Lund, Renewable heating strategies and their consequences for stor- age and grid infrastructures comparing a smart grid to a smart energy systems approach, Energy 151 (2018) 94–102

H. Lund, Renewable heating strategies and their consequences for stor- age and grid infrastructures comparing a smart grid to a smart energy systems approach, Energy 151 (2018) 94–102. doi:https://doi.org/ 10.1016/j.energy.2018.03.010

work page doi:10.1016/j.energy.2018.03.010 2018
[2]

H. Lund, S. Werner, R. Wiltshire, S. Svendsen, J. E. Thorsen, F. Hvelplund, B. V. Mathiesen, 4th generation district heating (4gdh), Energy 68 (2014) 1–11. doi:10.1016/j.energy.2014.02.089

work page doi:10.1016/j.energy.2014.02.089 2014
[3]

H. Gadd, S. Werner, Achieving low return temperatures from district heating substations, Applied Energy 136 (2014) 59–67. URL:https: //linkinghub.elsevier.com/retrieve/pii/S0306261914009696. doi:10.1016/j.apenergy.2014.09.022

work page doi:10.1016/j.apenergy.2014.09.022 2014
[4]

Autor:innen: Dr

Agora Energiewende, Prognos, GEF, Wärmenetze – klimaneutral, wirtschaftlich und bezahlbar, Technical Report 335/07-S-2024/DE, Agora Energiewende; Prognos; GEF, 2024. Autor:innen: Dr. Noha Saad; Nils Thamling; Mohammad Alkasabreh (Prognos); Susanne Ochse (GEF). Letzte Überarbeitung: 30. Dezember 2024. Projekt: Wärmenetze: klimaneutral, wirtschaftlich und bezahlbar

work page 2024
[5]

Neumayer, D

M. Neumayer, D. Stecher, S. Grimm, A. Maier, D. Bücker, J. Schmidt, Fault and anomaly detection in district heating sub- stations: A survey on methodology and data sets, Energy 276 (2023) 127569. URL:https://linkinghub.elsevier.com/retrieve/ pii/S0360544223009635. doi:10.1016/j.energy.2023.127569

work page doi:10.1016/j.energy.2023.127569 2023
[6]

Månsson, I

S. Månsson, I. Lundholm Benzi, M. Thern, R. Salenbien, K. Sernhed, P.- O. Johansson Kallioniemi, A taxonomy for labeling deviations in district heating customer data, Smart Energy 2 (2021) 100020. doi:https:// doi.org/10.1016/j.segy.2021.100020. 27

work page doi:10.1016/j.segy.2021.100020 2021
[7]

Månsson, P.-O

S. Månsson, P.-O. Johansson Kallioniemi, M. Thern, T. Van Oevelen, K. Sernhed, Faults in district heating customer installations and ways to approach them: Experiences from swedish utilities, Energy 180 (2019) 163–174. doi:https://doi.org/10.1016/j.energy.2019.04.220

work page doi:10.1016/j.energy.2019.04.220 2019
[8]

H. Gadd, S. Werner, Fault detection in district heating sub- stations, Applied Energy 157 (2015) 51–59. URL:https: //linkinghub.elsevier.com/retrieve/pii/S0306261915009010. doi:10.1016/j.apenergy.2015.07.061

work page doi:10.1016/j.apenergy.2015.07.061 2015
[9]

doi:https://doi.org/10.1016/ j.energy.2022.123529

D.S.Østergaard, K.M.Smith, M.Tunzi, S.Svendsen, Low-temperature operation of heating systems to enable 4th generation district heating: A review, Energy 248 (2022) 123529. doi:https://doi.org/10.1016/ j.energy.2022.123529

work page arXiv 2022
[10]

Leoni, R

P. Leoni, R. Geyer, R.-R. Schmidt, Developing innovative busi- ness models for reducing return temperatures in district heat- ing systems: Approach and first results, Energy 195 (2020) 116963. URL:https://linkinghub.elsevier.com/retrieve/pii/ S0360544220300700. doi:10.1016/j.energy.2020.116963

work page doi:10.1016/j.energy.2020.116963 2020
[11]

Guevara Bastidas, S

E. Guevara Bastidas, S. Faulstich, H. Dittmer, M. Neumayer, G. S. Mohan, K. Sercan-Calismaz, F. Hosenfelder, T. Gle- newinkel, K. Fischer-Florschütz, A. Cadenbach, Prioritisa- tion of faults in district heating substations: Towards predic- tive maintenance and optimised operation, Energy 333 (2025) 137210. URL:https://linkinghub.elsevier.com/retrieve/pii/...

work page doi:10.1016/j.energy.2025.137210 2025
[12]

Van Dreven, V

J. Van Dreven, V. Boeva, S. Abghari, H. Grahn, J. Al Koussa, E. Mo- toasca, Intelligent Approaches to Fault Detection and Diagnosis in Dis- trict Heating: Current Trends, Challenges, and Opportunities, Elec- tronics 12 (2023) 1448. URL:https://www.mdpi.com/2079-9292/12/ 6/1448. doi:10.3390/electronics12061448

work page doi:10.3390/electronics12061448 2023
[13]

C. Gück, C. M. A. Roelofs, S. Faulstich, CARE to Compare: A Real- World Benchmark Dataset for Early Fault Detection in Wind Turbine Data, Data 9 (2024) 138. URL:https://www.mdpi.com/2306-5729/ 9/12/138. doi:10.3390/data9120138. 28

work page doi:10.3390/data9120138 2024
[14]

Månsson, K

S. Månsson, K. Davidsson, P. Lauenburg, M. Thern, Automated Sta- tistical Methods for Fault Detection in District Heating Customer In- stallations, Energies 12 (2018) 113. URL:https://www.mdpi.com/ 1996-1073/12/1/113. doi:10.3390/en12010113

work page doi:10.3390/en12010113 2018
[15]

Theusch, P

F. Theusch, P. Klein, R. Bergmann, W. Wilke, W. Bock, A. We- ber, Fault Detection and Condition Monitoring in District Heat- ing Using Smart Meter Data, PHM Society European Conference 6 (2021) 11. URL:https://papers.phmsociety.org/index.php/phme/ article/view/2786. doi:10.36001/phme.2021.v6i1.2786

work page doi:10.36001/phme.2021.v6i1.2786 2021
[16]

Leiria, K

D. Leiria, K. H. Andersen, S. P. Melgaard, H. Johra, A. Marszal- Pomianowska, M. S. Piscitelli, A. Capozzoli, M. Z. Pomianowski, To- wards automated fault detection and diagnosis in district heating cus- tomers: generation and analysis of a labeled dataset with ground truth, in: ProceedingsofBuildingSimulation2023: 18thConferenceofIBPSA, 2023, pp. 3615 – ...

work page doi:10.26868/25222708.2023 2023
[17]

Calikus, S

E. Calikus, S. Nowaczyk, A. Sant’Anna, H. Gadd, S. Werner, A data-driven approach for discovering heat load patterns in dis- trict heating, Applied Energy 252 (2019) 113409. URL:https: //linkinghub.elsevier.com/retrieve/pii/S0306261919310839. doi:10.1016/j.apenergy.2019.113409

work page doi:10.1016/j.apenergy.2019.113409 2019
[18]

Simulation-based testing to improve safety of autonomous robots,

S. Abghari, V. Boeva, J. Brage, C. Johansson, H. Grahn, N. Laves- son, Higher Order Mining for Monitoring District Heating Substa- tions, in: 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA), IEEE, Washington, DC, USA, 2019, pp. 382–391. URL:https://ieeexplore.ieee.org/document/8964173/. doi:10.1109/DSAA.2019.00053

work page doi:10.1109/dsaa.2019.00053 2019
[19]

Farouq, S

S. Farouq, S. Byttner, M.-R. Bouguelia, N. Nord, H. Gadd, Large-scale monitoring of operationally diverse district heating substations: A reference-group based approach, Engineering Ap- plications of Artificial Intelligence 90 (2020) 103492. URL:https: //linkinghub.elsevier.com/retrieve/pii/S0952197620300117. doi:10.1016/j.engappai.2020.103492. 29

work page doi:10.1016/j.engappai.2020.103492 2020
[20]

Zhang, H

F. Zhang, H. Fleyeh, Anomaly Detection of Heat Energy Usage in Dis- trict Heating Substations Using LSTM based Variational Autoencoder Combined with Physical Model, in: 2020 15th IEEE Conference on Industrial Electronics and Applications (ICIEA), IEEE, Kristiansand, Norway, 2020, pp. 153–158. URL:https://ieeexplore.ieee.org/ document/9248108/. doi:10.1109...

work page doi:10.1109/iciea48937.2020.9248108 2020
[21]

Y. Choi, S. Yoon, Autoencoder-driven fault detection and di- agnosis in building automation systems: Residual-based and la- tent space-based approaches, Building and Environment 203 (2021) 108066. URL:https://linkinghub.elsevier.com/retrieve/ pii/S0360132321004686. doi:10.1016/j.buildenv.2021.108066

work page doi:10.1016/j.buildenv.2021.108066 2021
[22]

Vallée, T

M. Vallée, T. Wissocq, Y. Gaoua, N. Lamaison, Genera- tion and evaluation of a synthetic dataset to improve fault de- tection in district heating and cooling systems, Energy 283 (2023) 128387. URL:https://linkinghub.elsevier.com/retrieve/ pii/S0360544223017814. doi:10.1016/j.energy.2023.128387

work page doi:10.1016/j.energy.2023.128387 2023
[23]

Vallée, T

M. Vallée, T. Wissocq, N. Lamaison, Kaggle Dataset: Fault Detection and Diagnosis in District Heating, 2024. URL:https://www.kaggle. com/datasets/mathieuvallee/ai-dhc/data

work page 2024
[24]

Van Dreven, V

J. Van Dreven, V. Boeva, S. Abghari, H. Grahn, J. Al Koussa, A systematic approach for data generation for intelligent fault de- tection and diagnosis in District Heating, Energy 307 (2024) 132711. URL:https://linkinghub.elsevier.com/retrieve/pii/ S036054422402485X. doi:10.1016/j.energy.2024.132711

work page doi:10.1016/j.energy.2024.132711 2024
[25]

Stecher, L

D. Stecher, L. Ziegltrum, P. Reiprich, C. Fuchs, A. Maier, J. Schmidt, Neural network synthetic dataset generation for fault detection in district heating substations, Smart Energy 20 (2025) 100206. URL:https://linkinghub.elsevier.com/retrieve/ pii/S2666955225000346. doi:10.1016/j.segy.2025.100206

work page doi:10.1016/j.segy.2025.100206 2025
[26]

C. M. Roelofs, M.-A. Lutz, S. Faulstich, S. Vogt, Autoencoder-based anomaly root cause analysis for wind turbines, Energy and AI 4 (2021) 100065. URL:https://linkinghub.elsevier.com/retrieve/ pii/S2666546821000197. doi:10.1016/j.egyai.2021.100065. 30

work page doi:10.1016/j.egyai.2021.100065 2021

[1] [1]

Lund, Renewable heating strategies and their consequences for stor- age and grid infrastructures comparing a smart grid to a smart energy systems approach, Energy 151 (2018) 94–102

H. Lund, Renewable heating strategies and their consequences for stor- age and grid infrastructures comparing a smart grid to a smart energy systems approach, Energy 151 (2018) 94–102. doi:https://doi.org/ 10.1016/j.energy.2018.03.010

work page doi:10.1016/j.energy.2018.03.010 2018

[2] [2]

H. Lund, S. Werner, R. Wiltshire, S. Svendsen, J. E. Thorsen, F. Hvelplund, B. V. Mathiesen, 4th generation district heating (4gdh), Energy 68 (2014) 1–11. doi:10.1016/j.energy.2014.02.089

work page doi:10.1016/j.energy.2014.02.089 2014

[3] [3]

H. Gadd, S. Werner, Achieving low return temperatures from district heating substations, Applied Energy 136 (2014) 59–67. URL:https: //linkinghub.elsevier.com/retrieve/pii/S0306261914009696. doi:10.1016/j.apenergy.2014.09.022

work page doi:10.1016/j.apenergy.2014.09.022 2014

[4] [4]

Autor:innen: Dr

Agora Energiewende, Prognos, GEF, Wärmenetze – klimaneutral, wirtschaftlich und bezahlbar, Technical Report 335/07-S-2024/DE, Agora Energiewende; Prognos; GEF, 2024. Autor:innen: Dr. Noha Saad; Nils Thamling; Mohammad Alkasabreh (Prognos); Susanne Ochse (GEF). Letzte Überarbeitung: 30. Dezember 2024. Projekt: Wärmenetze: klimaneutral, wirtschaftlich und bezahlbar

work page 2024

[5] [5]

Neumayer, D

M. Neumayer, D. Stecher, S. Grimm, A. Maier, D. Bücker, J. Schmidt, Fault and anomaly detection in district heating sub- stations: A survey on methodology and data sets, Energy 276 (2023) 127569. URL:https://linkinghub.elsevier.com/retrieve/ pii/S0360544223009635. doi:10.1016/j.energy.2023.127569

work page doi:10.1016/j.energy.2023.127569 2023

[6] [6]

Månsson, I

S. Månsson, I. Lundholm Benzi, M. Thern, R. Salenbien, K. Sernhed, P.- O. Johansson Kallioniemi, A taxonomy for labeling deviations in district heating customer data, Smart Energy 2 (2021) 100020. doi:https:// doi.org/10.1016/j.segy.2021.100020. 27

work page doi:10.1016/j.segy.2021.100020 2021

[7] [7]

Månsson, P.-O

S. Månsson, P.-O. Johansson Kallioniemi, M. Thern, T. Van Oevelen, K. Sernhed, Faults in district heating customer installations and ways to approach them: Experiences from swedish utilities, Energy 180 (2019) 163–174. doi:https://doi.org/10.1016/j.energy.2019.04.220

work page doi:10.1016/j.energy.2019.04.220 2019

[8] [8]

H. Gadd, S. Werner, Fault detection in district heating sub- stations, Applied Energy 157 (2015) 51–59. URL:https: //linkinghub.elsevier.com/retrieve/pii/S0306261915009010. doi:10.1016/j.apenergy.2015.07.061

work page doi:10.1016/j.apenergy.2015.07.061 2015

[9] [9]

doi:https://doi.org/10.1016/ j.energy.2022.123529

D.S.Østergaard, K.M.Smith, M.Tunzi, S.Svendsen, Low-temperature operation of heating systems to enable 4th generation district heating: A review, Energy 248 (2022) 123529. doi:https://doi.org/10.1016/ j.energy.2022.123529

work page arXiv 2022

[10] [10]

Leoni, R

P. Leoni, R. Geyer, R.-R. Schmidt, Developing innovative busi- ness models for reducing return temperatures in district heat- ing systems: Approach and first results, Energy 195 (2020) 116963. URL:https://linkinghub.elsevier.com/retrieve/pii/ S0360544220300700. doi:10.1016/j.energy.2020.116963

work page doi:10.1016/j.energy.2020.116963 2020

[11] [11]

Guevara Bastidas, S

E. Guevara Bastidas, S. Faulstich, H. Dittmer, M. Neumayer, G. S. Mohan, K. Sercan-Calismaz, F. Hosenfelder, T. Gle- newinkel, K. Fischer-Florschütz, A. Cadenbach, Prioritisa- tion of faults in district heating substations: Towards predic- tive maintenance and optimised operation, Energy 333 (2025) 137210. URL:https://linkinghub.elsevier.com/retrieve/pii/...

work page doi:10.1016/j.energy.2025.137210 2025

[12] [12]

Van Dreven, V

J. Van Dreven, V. Boeva, S. Abghari, H. Grahn, J. Al Koussa, E. Mo- toasca, Intelligent Approaches to Fault Detection and Diagnosis in Dis- trict Heating: Current Trends, Challenges, and Opportunities, Elec- tronics 12 (2023) 1448. URL:https://www.mdpi.com/2079-9292/12/ 6/1448. doi:10.3390/electronics12061448

work page doi:10.3390/electronics12061448 2023

[13] [13]

C. Gück, C. M. A. Roelofs, S. Faulstich, CARE to Compare: A Real- World Benchmark Dataset for Early Fault Detection in Wind Turbine Data, Data 9 (2024) 138. URL:https://www.mdpi.com/2306-5729/ 9/12/138. doi:10.3390/data9120138. 28

work page doi:10.3390/data9120138 2024

[14] [14]

Månsson, K

S. Månsson, K. Davidsson, P. Lauenburg, M. Thern, Automated Sta- tistical Methods for Fault Detection in District Heating Customer In- stallations, Energies 12 (2018) 113. URL:https://www.mdpi.com/ 1996-1073/12/1/113. doi:10.3390/en12010113

work page doi:10.3390/en12010113 2018

[15] [15]

Theusch, P

F. Theusch, P. Klein, R. Bergmann, W. Wilke, W. Bock, A. We- ber, Fault Detection and Condition Monitoring in District Heat- ing Using Smart Meter Data, PHM Society European Conference 6 (2021) 11. URL:https://papers.phmsociety.org/index.php/phme/ article/view/2786. doi:10.36001/phme.2021.v6i1.2786

work page doi:10.36001/phme.2021.v6i1.2786 2021

[16] [16]

Leiria, K

D. Leiria, K. H. Andersen, S. P. Melgaard, H. Johra, A. Marszal- Pomianowska, M. S. Piscitelli, A. Capozzoli, M. Z. Pomianowski, To- wards automated fault detection and diagnosis in district heating cus- tomers: generation and analysis of a labeled dataset with ground truth, in: ProceedingsofBuildingSimulation2023: 18thConferenceofIBPSA, 2023, pp. 3615 – ...

work page doi:10.26868/25222708.2023 2023

[17] [17]

Calikus, S

E. Calikus, S. Nowaczyk, A. Sant’Anna, H. Gadd, S. Werner, A data-driven approach for discovering heat load patterns in dis- trict heating, Applied Energy 252 (2019) 113409. URL:https: //linkinghub.elsevier.com/retrieve/pii/S0306261919310839. doi:10.1016/j.apenergy.2019.113409

work page doi:10.1016/j.apenergy.2019.113409 2019

[18] [18]

Simulation-based testing to improve safety of autonomous robots,

S. Abghari, V. Boeva, J. Brage, C. Johansson, H. Grahn, N. Laves- son, Higher Order Mining for Monitoring District Heating Substa- tions, in: 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA), IEEE, Washington, DC, USA, 2019, pp. 382–391. URL:https://ieeexplore.ieee.org/document/8964173/. doi:10.1109/DSAA.2019.00053

work page doi:10.1109/dsaa.2019.00053 2019

[19] [19]

Farouq, S

S. Farouq, S. Byttner, M.-R. Bouguelia, N. Nord, H. Gadd, Large-scale monitoring of operationally diverse district heating substations: A reference-group based approach, Engineering Ap- plications of Artificial Intelligence 90 (2020) 103492. URL:https: //linkinghub.elsevier.com/retrieve/pii/S0952197620300117. doi:10.1016/j.engappai.2020.103492. 29

work page doi:10.1016/j.engappai.2020.103492 2020

[20] [20]

Zhang, H

F. Zhang, H. Fleyeh, Anomaly Detection of Heat Energy Usage in Dis- trict Heating Substations Using LSTM based Variational Autoencoder Combined with Physical Model, in: 2020 15th IEEE Conference on Industrial Electronics and Applications (ICIEA), IEEE, Kristiansand, Norway, 2020, pp. 153–158. URL:https://ieeexplore.ieee.org/ document/9248108/. doi:10.1109...

work page doi:10.1109/iciea48937.2020.9248108 2020

[21] [21]

Y. Choi, S. Yoon, Autoencoder-driven fault detection and di- agnosis in building automation systems: Residual-based and la- tent space-based approaches, Building and Environment 203 (2021) 108066. URL:https://linkinghub.elsevier.com/retrieve/ pii/S0360132321004686. doi:10.1016/j.buildenv.2021.108066

work page doi:10.1016/j.buildenv.2021.108066 2021

[22] [22]

Vallée, T

M. Vallée, T. Wissocq, Y. Gaoua, N. Lamaison, Genera- tion and evaluation of a synthetic dataset to improve fault de- tection in district heating and cooling systems, Energy 283 (2023) 128387. URL:https://linkinghub.elsevier.com/retrieve/ pii/S0360544223017814. doi:10.1016/j.energy.2023.128387

work page doi:10.1016/j.energy.2023.128387 2023

[23] [23]

Vallée, T

M. Vallée, T. Wissocq, N. Lamaison, Kaggle Dataset: Fault Detection and Diagnosis in District Heating, 2024. URL:https://www.kaggle. com/datasets/mathieuvallee/ai-dhc/data

work page 2024

[24] [24]

Van Dreven, V

J. Van Dreven, V. Boeva, S. Abghari, H. Grahn, J. Al Koussa, A systematic approach for data generation for intelligent fault de- tection and diagnosis in District Heating, Energy 307 (2024) 132711. URL:https://linkinghub.elsevier.com/retrieve/pii/ S036054422402485X. doi:10.1016/j.energy.2024.132711

work page doi:10.1016/j.energy.2024.132711 2024

[25] [25]

Stecher, L

D. Stecher, L. Ziegltrum, P. Reiprich, C. Fuchs, A. Maier, J. Schmidt, Neural network synthetic dataset generation for fault detection in district heating substations, Smart Energy 20 (2025) 100206. URL:https://linkinghub.elsevier.com/retrieve/ pii/S2666955225000346. doi:10.1016/j.segy.2025.100206

work page doi:10.1016/j.segy.2025.100206 2025

[26] [26]

C. M. Roelofs, M.-A. Lutz, S. Faulstich, S. Vogt, Autoencoder-based anomaly root cause analysis for wind turbines, Energy and AI 4 (2021) 100065. URL:https://linkinghub.elsevier.com/retrieve/ pii/S2666546821000197. doi:10.1016/j.egyai.2021.100065. 30

work page doi:10.1016/j.egyai.2021.100065 2021