Non-technical Loss Detection with Statistical Profile Images Based on Semi-supervised Learning

Fei Wang; Jiangteng Li

arxiv: 1907.03925 · v1 · pith:L3WEVIH6new · submitted 2019-07-09 · 💻 cs.LG · stat.ML

Non-technical Loss Detection with Statistical Profile Images Based on Semi-supervised Learning

Jiangteng Li , Fei Wang This is my paper

Pith reviewed 2026-05-25 00:34 UTC · model grok-4.3

classification 💻 cs.LG stat.ML

keywords non-technical loss detectionsmart gridsemi-supervised learningtime-series to imageanomaly detectiondeep learningpower consumptionmeter data

0 comments

The pith

Converting electricity time series into statistical profile images lets a semi-supervised model detect non-technical losses more accurately with few labeled examples.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes a detection method for non-technical losses in smart grids by first transforming raw meter time-series records into image representations. These images are designed to encode longer-term consumption patterns across multiple aspects. The images then feed a deep learning architecture, adapted from computer vision, that operates in semi-supervised mode to exploit abundant unlabeled data alongside scarce verified abnormal cases. A sympathetic reader would care because non-technical losses create both physical security risks and large revenue shortfalls across hundreds of millions of meters, and current scale makes manual or fully supervised inspection impractical.

Core claim

The authors show that time-series electricity consumption data can be turned into statistical profile images that preserve long-term user behavior, and that a semi-supervised deep model taking these images as input produces joint features that improve anomaly detection when only limited on-field-verified abnormal labels are available.

What carries the argument

Statistical profile images formed by transforming time-series consumption records, paired with a semi-supervised deep learning model that extracts joint features from the images.

If this is right

The approach scales to the hundreds of millions of smart-meter records already collected by power grids.
It reduces dependence on large numbers of manually labeled abnormal samples.
Different causes of non-technical loss can be captured through the multiple aspects encoded in each image.
The model yields measurable gains when evaluated directly against on-field verification results.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same image-conversion step could be tested on other high-volume sensor streams that lack dense labels.
Replacing the semi-supervised component with fully unsupervised alternatives would show whether the image representation alone carries most of the signal.
Extending the images to include spatial neighborhood information from nearby meters could tighten detection further.

Load-bearing premise

The transformation from time-series meter readings to statistical profile images actually encodes the long-term consumption behaviors that distinguish normal from abnormal users, and the semi-supervised model can reliably learn anomalies from very few labeled abnormal cases.

What would settle it

Run the same pipeline on a fresh set of on-field-inspected meters where the image transformation step is replaced by raw time-series input or by random images; if detection accuracy drops to the level of prior methods, the central claim fails.

Figures

Figures reproduced from arXiv: 1907.03925 by Fei Wang, Jiangteng Li.

**Figure 1.** Figure 1: (a) Two time series from a normal (upper) and an abnormal (lower) customer respectively; (b) The corresponding scatter plots of the same piece of data with load rate being the x-axis. Here, we did not aim to get a precise mathematical representation of the feature distributions. Rather, we designed a data transformation inspired by kernel density estimation (KDE), a nonparameter distribution estimation me… view at source ↗

read the original abstract

In order to keep track of the operational state of power grid, the world's largest sensor systems, smart grid, was built by deploying hundreds of millions of smart meters. Such system makes it possible to discover and make quick response to any hidden threat to the entire power grid. Non-technical losses (NTLs) have always been a major concern for its consequent security risks as well as immeasurable revenue loss. However, various causes of NTL may have different characteristics reflected in the data. Accurately capturing these anomalies faced with such large scale of collected data records is rather tricky as a result. In this paper, we proposed a new methodology of detecting abnormal electricity consumptions. We did a transformation of the collected time-series data which turns it into an image representation that could well reflect users' relatively long term consumption behaviors. Inspired by the excellent neural network architecture used for objective detection in computer vision domain, we designed our deep learning model that takes the transformed images as input and yields joint featured inferred from the multiple aspects the input provides. Considering the limited labeled samples, especially the abnormal ones, we used our model in a semi-supervised fashion that is brought out in recent years. The model is tested on samples which are verified by on-field inspections and our method showed significant improvement.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The abstract claims significant gains from turning consumption time series into statistical profile images and running semi-supervised deep learning, but supplies no metrics, baselines, or method details to check the claim.

read the letter

The one thing to know is that this paper asserts its image-plus-semi-supervised pipeline beats prior approaches on field-verified NTL samples, yet the abstract contains zero numbers, no model architecture, no dataset sizes, and no description of how the images are actually constructed. That makes the central result impossible to evaluate from what is given. The work applies two existing techniques—time-series-to-image conversion and semi-supervised learning—to the NTL detection setting in smart grids. It correctly notes that labeled anomalies are scarce and that utilities need methods that scale to hundreds of millions of meters. Those observations are reasonable and the framing around revenue loss and grid security is on target. Beyond that, the paper does not introduce new mathematics or a novel framework; it is an application of known tools. The soft spots are substantial and directly affect the main claim. There is no quantitative evidence at all—no accuracy, precision, recall, or comparison against simple statistical rules or fully supervised baselines. The image transformation is described only at the level of “statistical profile images” that “could well reflect users’ relatively long term consumption behaviors,” with no specification of which statistics, binning, or aggregation windows are used and no ablation showing that this step preserves the signals that matter for NTL. The semi-supervised component is mentioned but not explained: how the unlabeled data are incorporated, what loss is used, or how the method avoids overfitting the few labeled anomalies. The stress-test concern holds up on the available text; without those details the improvement cannot be assessed and could easily be an artifact of the particular data split or an untested representation. This paper would be of interest to a small group of researchers working on applied anomaly detection in energy systems, but only if the full version supplies the missing implementation, results, and comparisons. In its current form it does not contain enough substance to justify referee time. I would not bring it to a reading group and would not cite it.

Referee Report

3 major / 0 minor

Summary. The paper proposes transforming electricity consumption time-series data into statistical profile images to capture long-term user behaviors, then applies a semi-supervised deep learning model (inspired by computer vision object detection architectures) to detect non-technical losses (NTLs) in smart-grid data. It reports that the approach yields significant improvement when tested on samples verified by on-field inspections, addressing the challenge of limited labeled abnormal samples.

Significance. If the central claims hold with proper validation, the work could contribute a practical image-based representation for anomaly detection in large-scale utility data where labels are scarce. However, the absence of any quantitative results, baselines, ablation studies, or implementation details in the manuscript prevents assessment of whether the image transformation or semi-supervised component actually delivers the claimed gains.

major comments (3)

[Abstract] Abstract: the claim that the method 'showed significant improvement' on verified samples is unsupported by any metrics, dataset sizes, baseline comparisons, or model details, rendering the central empirical claim unevaluable.
[Method] The manuscript provides no description or ablation of the statistical profile image construction (specific statistics, binning, or temporal aggregation), so there is no evidence that this transformation captures long-term behaviors better than raw time-series or alternative features.
[Experiments] No details are given on the semi-supervised training procedure, loss functions, or how the model mitigates label scarcity, preventing evaluation of whether the reported gains are due to the architecture or simply to overfitting the limited labeled abnormals.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the detailed and constructive review. We agree that the current manuscript lacks the quantitative results, methodological details, and experimental descriptions needed to fully evaluate the claims. We will revise the paper to address all points raised.

read point-by-point responses

Referee: [Abstract] Abstract: the claim that the method 'showed significant improvement' on verified samples is unsupported by any metrics, dataset sizes, baseline comparisons, or model details, rendering the central empirical claim unevaluable.

Authors: We acknowledge this limitation in the abstract. In the revised version we will replace the qualitative claim with specific metrics (precision, recall, F1-score, AUC), the size of the field-verified test set, the number of labeled and unlabeled samples used, and direct comparisons against at least two baselines (e.g., raw time-series classifiers and standard supervised CNNs). revision: yes
Referee: [Method] The manuscript provides no description or ablation of the statistical profile image construction (specific statistics, binning, or temporal aggregation), so there is no evidence that this transformation captures long-term behaviors better than raw time-series or alternative features.

Authors: We agree that the image-construction procedure is under-specified. The revision will add an explicit subsection detailing the statistics computed per time window (mean, variance, skewness, selected quantiles), the binning and normalization steps, and the temporal aggregation window sizes. We will also include an ablation table comparing detection performance when the model receives the statistical profile images versus raw time-series and versus simpler feature vectors. revision: yes
Referee: [Experiments] No details are given on the semi-supervised training procedure, loss functions, or how the model mitigates label scarcity, preventing evaluation of whether the reported gains are due to the architecture or simply to overfitting the limited labeled abnormals.

Authors: We accept that the semi-supervised training protocol is missing. The revised manuscript will describe the exact semi-supervised framework (including the consistency-regularization or pseudo-labeling loss), the combined supervised-plus-unsupervised objective, all hyperparameters, and the data-augmentation strategy used to mitigate label scarcity. We will add an ablation that isolates the contribution of the semi-supervised component versus a fully supervised counterpart on the same labeled set. revision: yes

Circularity Check

0 steps flagged

No circularity in derivation chain

full rationale

The provided abstract and description contain no equations, derivations, parameter fittings, or load-bearing steps that reduce by construction to inputs. The methodology is described at a high level as a data transformation followed by semi-supervised learning, with claims resting on empirical testing against on-field verified samples rather than any self-referential definitions, fitted quantities renamed as predictions, or self-citation chains. No uniqueness theorems, ansatzes, or renamings of known results are invoked in a way that creates circularity. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no information on free parameters, axioms, or invented entities.

pith-pipeline@v0.9.0 · 5754 in / 1100 out tokens · 29379 ms · 2026-05-25T00:34:36.405090+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

26 extracted references · 26 canonical work pages

[1]

Rashed Mohassel ; A

R. Rashed Mohassel ; A. Fung; F. Mohammadi; K. Raahemifar. A survey on A dvanced Metering Infrastructure. International Journal of Electrical Power & Energy Systems. 2014, 63, 473–484

work page 2014
[2]

V. Ford; A. Siraj; and W. Eberle. Smart Grid Energy Fraud Detection Using Artificial Neural Networks. IEEE Symposium on Computational Intelligence Applications in Smart Grid, CIASG. 2014, 1-6

work page 2014
[3]

C. Cody; V. Ford; A. Siraj. Decision tree learning for fraud detection in consumer energy consumption. IEEE International Conference on Machine Learning and Applications, ICMLA. 2015, 1175–1179

work page 2015
[4]

Malik; K

A. Malik; K. Barker; R. Alhajj. A comprehensive survey of numeric and symbolic outlier mining techniques. Intelligent Data Analysis. 2006, 10(6), 521-538

work page 2006
[5]

Similarity Measures for Categorical Data: A Comparative Evaluation

Boriah S; Chandola V; Kumar V. Similarity Measures for Categorical Data: A Comparative Evaluation. Siam International Conference on Data Mining. Atlanta, Georgia, USA, April 24-26, 2008; 243-254

work page 2008
[6]

J. Nagi; K. S. Yap; S. K. Tiong ; S. K. A hmed; M. Mohamad. Nontechnical loss detection for metered customers in power utility using support vector machines. IEEE Transactions on Power Delivery . 2010, 25, 1162–1171

work page 2010
[7]

Glauner; J

P. Glauner; J. A. Meira; L. Dolberg; R. State; F. Bettinger; Y. Rangoni. Neighborhood features help detecting non-technical losses in big data sets. Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, 2016, 253–261

work page 2016
[8]

B. C. Costa; B. L. a. Alberto; A. M. Portela; W. Madur o; O. Eler; B. Horizonte. Fraud Detection in Electric Power Distribution Networks Using an ANN-Based Knowledge-Discovery Process. International Journal of Artificial Intelligence & Applications, IJAIA. 2013, 4, 17–23,

work page 2013
[9]

Coma-Puig; J

B. Coma-Puig; J. Carmona; R. Gavalda; S. Alcoverro; V. Martin. Fraud Detection in Energy Consumption : A Supervised Approach. IEEE International Conference on Data Science and Advanced Analytics , DSAA. 2016, 120-129

work page 2016
[10]

Imagenet classification with deep convolutional neural networks

Alex Krizhevsky; Ilya Sutskever; Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. Neural Information Processing Systems. 2012, 25,1097–1105

work page 2012
[11]

Oates T

Wang Z. ; Oates T . Imaging Time -Series to Improve Classification and Imput ation. Twenty-Fourth International Joint Conference on Artificial Intelligence. 2015

work page 2015
[12]

Darrell T ., et al

Girshick R.; Donahue J. ; Darrell T ., et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. IEEE Conference on Computer Vision & Pattern Recognition. 2014

work page 2014
[13]

Fast R-CNN

Girshick R. Fast R-CNN. Proceedings of the IEEE international conference on computer vision, 2015, 1440- 1448

work page 2015
[14]

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Ren S.; He K.; Girshick R., et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2015, 39(6), 1137-1149

work page 2015
[15]

Generative adversarial nets

Goodfellow I J; Pouget -Abadie J; Mirza M, et al. Generative adversarial nets. International Conference on Neural Information Processing Systems, 2014

work page 2014
[16]

Learning with Pseudo -Ensembles

Bachman P.; Alsharif O.; Precup D.. Learning with Pseudo -Ensembles. Advances in Neural Information Processing Systems, 2014, 4, 3365-3373

work page 2014
[17]

Regularization With Stochastic Transformations and Perturbations for Deep Semi-Supervised Learning

Sajjadi M.; Javanmardi M.; Tasdizen T. Regularization With Stochastic Transformations and Perturbations for Deep Semi-Supervised Learning. Advances in Neural Information Processing Systems, 2016, 1163-1171

work page 2016
[18]

Laine S.; Aila T.; Temporal Ensembling for Semi-Supervised Learning. 2016

work page 2016
[19]

Mean teachers are better role models: Weight -averaged consistency targets improve semi-supervised deep learning results

Tarvainen A .; Valpola H .. Mean teachers are better role models: Weight -averaged consistency targets improve semi-supervised deep learning results. Advances in neural information processing systems, 2017, 1195-1204

work page 2017
[20]

Laurens V. D. M.; Hinton G . Visualizing Data using t-SNE. Journal of Machine Learning Research, 2008, 9(2605), 2579-2605

work page 2008
[21]

Bradley A. P.. The use of the area under the ROC curve in the evaluation of machine learning algorithms . Pattern recognition, 1997, 30(7), 1145-1159

work page 1997
[22]

FaceNet: A unified embedding for face recognition and clustering

Schroff F.; Kalenichenko D.; Philbin J.. FaceNet: A unified embedding for face recognition and clustering . IEEE Conference on Computer Vision and Pattern Recogn ition (CVPR) - Boston, MA, USA, 2015.6.7 - 2015.6.12. 2015:815-823

work page 2015
[23]

O.; Boechat A.; Dolberg L., et al

Glauner P. O.; Boechat A.; Dolberg L., et al. Large-Scale Detection of Non-Technical Losses in Imbalanced Data Sets. IEEE Power & Energy Society Innovative Smart Grid Technologies Conference(ISGT ). IEEE, 2016, 1-5

work page 2016
[24]

A.; Glauner P.; State R., et al

Meira J. A.; Glauner P.; State R., et al. Distilling provider -independent data for general dete ction of non - technical losses. Power & Energy Conference at Illinois. IEEE, 2017

work page 2017
[25]

M.; Tejedor-Aguilera J.; Cruz-Romero P., et al

Buzau M. M.; Tejedor-Aguilera J.; Cruz-Romero P., et al. Detection o f Non-Technical Losses Using Smart Meter Data and Supervised Learning. IEEE Transactions on Smart Grid, 2018, 99, 1-1

work page 2018
[26]

Emerging Markets Smart Grid: Outlook 2015, LLC Northeast Group, Washington, DC, USA, 2014

work page 2015

[1] [1]

Rashed Mohassel ; A

R. Rashed Mohassel ; A. Fung; F. Mohammadi; K. Raahemifar. A survey on A dvanced Metering Infrastructure. International Journal of Electrical Power & Energy Systems. 2014, 63, 473–484

work page 2014

[2] [2]

V. Ford; A. Siraj; and W. Eberle. Smart Grid Energy Fraud Detection Using Artificial Neural Networks. IEEE Symposium on Computational Intelligence Applications in Smart Grid, CIASG. 2014, 1-6

work page 2014

[3] [3]

C. Cody; V. Ford; A. Siraj. Decision tree learning for fraud detection in consumer energy consumption. IEEE International Conference on Machine Learning and Applications, ICMLA. 2015, 1175–1179

work page 2015

[4] [4]

Malik; K

A. Malik; K. Barker; R. Alhajj. A comprehensive survey of numeric and symbolic outlier mining techniques. Intelligent Data Analysis. 2006, 10(6), 521-538

work page 2006

[5] [5]

Similarity Measures for Categorical Data: A Comparative Evaluation

Boriah S; Chandola V; Kumar V. Similarity Measures for Categorical Data: A Comparative Evaluation. Siam International Conference on Data Mining. Atlanta, Georgia, USA, April 24-26, 2008; 243-254

work page 2008

[6] [6]

J. Nagi; K. S. Yap; S. K. Tiong ; S. K. A hmed; M. Mohamad. Nontechnical loss detection for metered customers in power utility using support vector machines. IEEE Transactions on Power Delivery . 2010, 25, 1162–1171

work page 2010

[7] [7]

Glauner; J

P. Glauner; J. A. Meira; L. Dolberg; R. State; F. Bettinger; Y. Rangoni. Neighborhood features help detecting non-technical losses in big data sets. Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, 2016, 253–261

work page 2016

[8] [8]

B. C. Costa; B. L. a. Alberto; A. M. Portela; W. Madur o; O. Eler; B. Horizonte. Fraud Detection in Electric Power Distribution Networks Using an ANN-Based Knowledge-Discovery Process. International Journal of Artificial Intelligence & Applications, IJAIA. 2013, 4, 17–23,

work page 2013

[9] [9]

Coma-Puig; J

B. Coma-Puig; J. Carmona; R. Gavalda; S. Alcoverro; V. Martin. Fraud Detection in Energy Consumption : A Supervised Approach. IEEE International Conference on Data Science and Advanced Analytics , DSAA. 2016, 120-129

work page 2016

[10] [10]

Imagenet classification with deep convolutional neural networks

Alex Krizhevsky; Ilya Sutskever; Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. Neural Information Processing Systems. 2012, 25,1097–1105

work page 2012

[11] [11]

Oates T

Wang Z. ; Oates T . Imaging Time -Series to Improve Classification and Imput ation. Twenty-Fourth International Joint Conference on Artificial Intelligence. 2015

work page 2015

[12] [12]

Darrell T ., et al

Girshick R.; Donahue J. ; Darrell T ., et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. IEEE Conference on Computer Vision & Pattern Recognition. 2014

work page 2014

[13] [13]

Fast R-CNN

Girshick R. Fast R-CNN. Proceedings of the IEEE international conference on computer vision, 2015, 1440- 1448

work page 2015

[14] [14]

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Ren S.; He K.; Girshick R., et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2015, 39(6), 1137-1149

work page 2015

[15] [15]

Generative adversarial nets

Goodfellow I J; Pouget -Abadie J; Mirza M, et al. Generative adversarial nets. International Conference on Neural Information Processing Systems, 2014

work page 2014

[16] [16]

Learning with Pseudo -Ensembles

Bachman P.; Alsharif O.; Precup D.. Learning with Pseudo -Ensembles. Advances in Neural Information Processing Systems, 2014, 4, 3365-3373

work page 2014

[17] [17]

Regularization With Stochastic Transformations and Perturbations for Deep Semi-Supervised Learning

Sajjadi M.; Javanmardi M.; Tasdizen T. Regularization With Stochastic Transformations and Perturbations for Deep Semi-Supervised Learning. Advances in Neural Information Processing Systems, 2016, 1163-1171

work page 2016

[18] [18]

Laine S.; Aila T.; Temporal Ensembling for Semi-Supervised Learning. 2016

work page 2016

[19] [19]

Mean teachers are better role models: Weight -averaged consistency targets improve semi-supervised deep learning results

Tarvainen A .; Valpola H .. Mean teachers are better role models: Weight -averaged consistency targets improve semi-supervised deep learning results. Advances in neural information processing systems, 2017, 1195-1204

work page 2017

[20] [20]

Laurens V. D. M.; Hinton G . Visualizing Data using t-SNE. Journal of Machine Learning Research, 2008, 9(2605), 2579-2605

work page 2008

[21] [21]

Bradley A. P.. The use of the area under the ROC curve in the evaluation of machine learning algorithms . Pattern recognition, 1997, 30(7), 1145-1159

work page 1997

[22] [22]

FaceNet: A unified embedding for face recognition and clustering

Schroff F.; Kalenichenko D.; Philbin J.. FaceNet: A unified embedding for face recognition and clustering . IEEE Conference on Computer Vision and Pattern Recogn ition (CVPR) - Boston, MA, USA, 2015.6.7 - 2015.6.12. 2015:815-823

work page 2015

[23] [23]

O.; Boechat A.; Dolberg L., et al

Glauner P. O.; Boechat A.; Dolberg L., et al. Large-Scale Detection of Non-Technical Losses in Imbalanced Data Sets. IEEE Power & Energy Society Innovative Smart Grid Technologies Conference(ISGT ). IEEE, 2016, 1-5

work page 2016

[24] [24]

A.; Glauner P.; State R., et al

Meira J. A.; Glauner P.; State R., et al. Distilling provider -independent data for general dete ction of non - technical losses. Power & Energy Conference at Illinois. IEEE, 2017

work page 2017

[25] [25]

M.; Tejedor-Aguilera J.; Cruz-Romero P., et al

Buzau M. M.; Tejedor-Aguilera J.; Cruz-Romero P., et al. Detection o f Non-Technical Losses Using Smart Meter Data and Supervised Learning. IEEE Transactions on Smart Grid, 2018, 99, 1-1

work page 2018

[26] [26]

Emerging Markets Smart Grid: Outlook 2015, LLC Northeast Group, Washington, DC, USA, 2014

work page 2015