Leveraging Unsupervised Learning for Cost-Effective Visual Anomaly Detection
Pith reviewed 2026-05-23 20:56 UTC · model grok-4.3
The pith
Unsupervised learning enables visual anomaly detection on Raspberry Pi hardware using only ten normal images.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The research develops a low-cost visual anomaly detection solution that uses minimal data for model training while maintaining generalizability and scalability. The system utilises unsupervised learning models from Anomalib and is deployed on affordable Raspberry Pi hardware through openVINO. The results show that this cost-effective system can complete anomaly detection training and inference on a Raspberry Pi in just 90 seconds using only 10 normal product images, achieving an F1 macro score exceeding 0.95. While the system is slightly sensitive to environmental changes like lighting, product positioning, or background, it remains a swift and economical method for factory automation.
What carries the argument
Unsupervised learning models from Anomalib, adapted and deployed via openVINO on Raspberry Pi hardware, which perform training and inference from only 10 normal images.
If this is right
- Small and medium-sized manufacturers gain access to automated inspection without large data collections or expert teams.
- The approach supports rapid setup for new production lines using only a handful of good samples.
- Deployment on affordable single-board computers reduces capital costs compared with traditional inspection rigs.
- The pipeline preserves scalability while cutting the usual requirements for repetitive model retraining.
Where Pith is reading between the lines
- The same minimal-data pattern could be tested on other low-power edge devices to widen hardware options.
- Extending the method to multi-product lines would require checking whether 10 images per product remain sufficient.
- The acknowledged sensitivity to environment suggests a natural next step of adding lightweight adaptation steps at inference time.
Load-bearing premise
The unsupervised models from Anomalib continue to deliver strong detection performance when run on Raspberry Pi hardware via openVINO even when supplied with only 10 normal images.
What would settle it
A direct test on the Raspberry Pi deployment that records substantially lower detection reliability when the same 10 training images are used but lighting, positioning, or background is altered.
Figures
read the original abstract
Traditional machine learning-based visual inspection systems require extensive data collection and repetitive model training to improve accuracy. These systems typically require expensive camera, computing equipment and significant machine learning expertise, which can substantially burden small and medium-sized enterprises. This study explores leveraging unsupervised learning methods with pre-trained models and low-cost hardware to create a cost-effective visual anomaly detection system. The research aims to develop a low-cost visual anomaly detection solution that uses minimal data for model training while maintaining generalizability and scalability. The system utilises unsupervised learning models from Anomalib and is deployed on affordable Raspberry Pi hardware through openVINO. The results show that this cost-effective system can complete anomaly defection training and inference on a Raspberry Pi in just 90 seconds using only 10 normal product images, achieving an F1 macro score exceeding 0.95. While the system is slightly sensitive to environmental changes like lighting, product positioning, or background, it remains a swift and economical method for factory automation inspection for small and medium-sized manufacturers. The code is available at https://github.com/Yunbo-max/Cost-Effective-Visual-Anomaly-Detection-using-Unsupervised-Learning.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript describes a cost-effective visual anomaly detection system that applies unsupervised models from the Anomalib library, optimized and deployed on Raspberry Pi hardware using OpenVINO. It reports that training and inference can be completed in 90 seconds using only 10 normal product images while attaining an F1 macro score above 0.95, positioning the approach as accessible for small and medium-sized manufacturers despite noted sensitivity to environmental factors such as lighting and positioning. The code is made available via GitHub.
Significance. If the headline performance numbers prove robust, the work demonstrates a practical route to lowering hardware and data barriers for industrial visual inspection by repurposing existing open-source tools (Anomalib + OpenVINO) on commodity edge devices. Public code release aids reproducibility and potential follow-on use by practitioners.
major comments (2)
- [Abstract] Abstract: the central empirical claims (90 s training+inference on Raspberry Pi, F1 macro >0.95 with 10 normal images) are stated without any description of the test dataset (number of anomalous samples, source, or split), the specific Anomalib models selected, or statistical validation (multiple runs, error bars). These omissions are load-bearing for assessing whether the unsupervised approach actually delivers the advertised performance.
- [Abstract] Abstract: the text acknowledges sensitivity to lighting, product positioning, and background yet supplies no quantitative ablation, perturbation tests, or robustness metrics under controlled variations of these factors. Because the target use case is factory-floor inspection where such variability is routine and only 10 normal samples are used, the absence of this analysis directly weakens support for the generalizability claim.
minor comments (2)
- [Abstract] Abstract contains a clear typo: 'anomaly defection training' should be 'anomaly detection training'.
- The manuscript would benefit from a table or figure summarizing hardware specifications, model variants tested, and exact timing breakdowns (training vs. inference) to make the 90-second figure reproducible.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on the abstract and the need for stronger support of the performance claims. We address each major comment below and indicate where revisions will be made.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central empirical claims (90 s training+inference on Raspberry Pi, F1 macro >0.95 with 10 normal images) are stated without any description of the test dataset (number of anomalous samples, source, or split), the specific Anomalib models selected, or statistical validation (multiple runs, error bars). These omissions are load-bearing for assessing whether the unsupervised approach actually delivers the advertised performance.
Authors: We agree the abstract should be more self-contained. The full manuscript details the custom product-image dataset (including the number of anomalous samples and the train/test split), the specific pre-trained Anomalib models employed, and the single-run experimental protocol. In revision we will expand the abstract to briefly state the dataset source and size, name the models, and clarify that the reported F1 macro >0.95 is the primary observed value under the described conditions. revision: yes
-
Referee: [Abstract] Abstract: the text acknowledges sensitivity to lighting, product positioning, and background yet supplies no quantitative ablation, perturbation tests, or robustness metrics under controlled variations of these factors. Because the target use case is factory-floor inspection where such variability is routine and only 10 normal samples are used, the absence of this analysis directly weakens support for the generalizability claim.
Authors: We acknowledge that the absence of quantitative robustness metrics limits the strength of the generalizability claim. Our experiments focused on demonstrating feasibility with 10 normal images and 90-second training on Raspberry Pi; the sensitivity statement is based on informal observations rather than controlled ablations. We will revise the abstract and discussion to more explicitly frame this as a limitation and identify controlled robustness testing as future work. revision: partial
- Quantitative ablation or perturbation results for lighting, positioning, and background variations, because no such controlled experiments were conducted.
Circularity Check
No circularity; performance claims are direct empirical measurements from standard models
full rationale
The paper reports measured runtime (90 s) and F1 macro (>0.95) obtained by running pre-existing Anomalib unsupervised models on Raspberry Pi hardware via openVINO with a fixed set of 10 normal images. No derivation chain, equations, fitted parameters, or self-referential predictions appear in the provided text. The results are presented as experimental outcomes rather than quantities derived from the setup itself. Acknowledged sensitivity to lighting/positioning is a robustness concern, not a circularity issue. The central claims therefore remain independent of any self-definition or self-citation reduction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Unsupervised anomaly detection models can learn normal patterns sufficiently from a small number (10) of normal samples to achieve high detection accuracy
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The system utilises unsupervised learning models from Anomalib and is deployed on affordable Raspberry Pi hardware through openVINO. The results show that this cost-effective system can complete anomaly detection training and inference on a Raspberry Pi in just 90 seconds using only 10 normal product images, achieving an F1 macro score exceeding 0.95.
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
While the system is slightly sensitive to environmental changes like lighting, product positioning, or background
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
- [1]
-
[2]
Apostolopoulos, I. D. and M. A. Tzani (2023). Industrial object and defect recognition utilizing multilevel feature extraction from industrial scenes with deep learning approach. Journal of Ambient Intelligence and Humanized Computing\/ 14\/ (8), 10263--10276
work page 2023
- [3]
-
[4]
Chew, Y. Z. (2022). Anomaly detection for vision-based inspection . Ph.\ D. thesis, UTAR
work page 2022
-
[5]
Christie, S., A. Prakash, E. Ashton, and V. Tanner (2023). Enhancing manual visual inspection process by using opencv ai and anomalib for defect detection in automotive assembly and manufacturing process. In International Electronic Packaging Technical Conference and Exhibition , Volume 87516, pp.\ V001T08A003. American Society of Mechanical Engineers
work page 2023
- [6]
-
[7]
Dewangan, D. K. and S. P. Sahu (2020). Deep learning-based speed bump detection model for intelligent vehicle system using raspberry pi. IEEE sensors journal\/ 21\/ (3), 3570--3578
work page 2020
-
[8]
D \"u rr, O., Y. Pauchard, D. Browarnik, R. Axthelm, and M. Loeser (2015). Deep learning on a raspberry pi for real time face recognition. In Eurographics (Posters) , pp.\ 11--12
work page 2015
-
[9]
Gorbachev, Y., M. Fedorov, I. Slavutin, A. Tugarev, M. Fatekhov, and Y. Tarkan (2019). Openvino deep learning workbench: Comprehensive analysis and tuning of neural networks inference. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops , pp.\ 0--0
work page 2019
-
[10]
Gudovskiy, D., S. Ishizaka, and K. Kozuka (2022). Cflow-ad: Real-time unsupervised anomaly detection with localization via conditional normalizing flows. In Proceedings of the IEEE/CVF winter conference on applications of computer vision , pp.\ 98--107
work page 2022
-
[11]
Hattori, K., T. Izumi, and L. Meng (2023). Defect detection of apples using patchcore. In 2023 International Conference on Advanced Mechatronic Systems (ICAMechS) , pp.\ 1--6. IEEE
work page 2023
-
[12]
Hussain, M. (2023). Yolo-v1 to yolo-v8, the rise of yolo and its complementary nature toward digital manufacturing and industrial defect detection. Machines\/ 11\/ (7), 677
work page 2023
-
[13]
Jha, S. B. and R. F. Babiceanu (2023). Deep cnn-based visual defect detection: Survey of current literature. Computers in Industry\/ 148 , 103911
work page 2023
- [14]
- [15]
-
[16]
Kim, D., S. Baik, and T. H. Kim (2023). Sanflow: Semantic-aware normalizing flow for anomaly detection. Advances in Neural Information Processing Systems\/ 36 , 75434--75454
work page 2023
-
[17]
Kim, I., Y. Jeon, J. W. Kang, and J. Gwak (2023). Rag-padim: residual attention guided padim for defects segmentation in railway tracks. Journal of Electrical Engineering & Technology\/ 18\/ (2), 1429--1438
work page 2023
-
[18]
Liu, J., G. Xie, J. Wang, S. Li, C. Wang, F. Zheng, and Y. Jin (2024). Deep industrial image anomaly detection: A survey. Machine Intelligence Research\/ 21\/ (1), 104--135
work page 2024
-
[19]
Mounir, T., S. Boumerdassi, A. Benhamada, A. Alla, and M. Kherarba (2020). Performance evaluation of basic image processing algorithms in cpu, gpu, raspberry pi and fpga. Int. J. Comput. Sci. Eng.(IJCSE)\/ 9\/ (4), 312--325
work page 2020
-
[20]
Qurishee, M. A. (2019). Low-cost deep learning uav and raspberry pi solution to real time pavement condition assessment
work page 2019
-
[21]
Rippel, O. and D. Merhof (2023). Anomaly detection for automated visual inspection: A review. Bildverarbeitung in der Automation: Ausgew \"a hlte Beitr \"a ge des Jahreskolloquiums BVAu 2022\/ , 1--13
work page 2023
- [22]
-
[23]
Russakovsky, O., J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, et al. (2015). Imagenet large scale visual recognition challenge. International journal of computer vision\/ 115 , 211--252
work page 2015
-
[24]
Santos, J., T. Tran, and O. Rippel (2023). Optimizing patchcore for few/many-shot anomaly detection. arXiv preprint arXiv:2307.10792\/
-
[25]
S \"u zen, A. A., B. Duman, and B. S en (2020). Benchmark analysis of jetson tx2, jetson nano and raspberry pi using deep-cnn. In 2020 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA) , pp.\ 1--5. IEEE
work page 2020
-
[26]
Tao, X., X. Gong, X. Zhang, S. Yan, and C. Adak (2022). Deep learning for unsupervised anomaly localization in industrial images: A survey. IEEE Transactions on Instrumentation and Measurement\/ 71 , 1--21
work page 2022
-
[27]
Xie, G., J. Wang, J. Liu, J. Lyu, Y. Liu, C. Wang, F. Zheng, and Y. Jin (2024). Im-iad: Industrial image anomaly detection benchmark in manufacturing. IEEE Transactions on Cybernetics\/
work page 2024
-
[28]
Yong, B. X. and A. Brintrup (2022). Bayesian autoencoders with uncertainty quantification: Towards trustworthy anomaly detection. Expert Systems with Applications\/ 209 , 118196
work page 2022
- [30]
-
[31]
Ziegler, T. (2023). Applications of AI on Resource-ConstrainedHardware with a focus on Anomaly Detection . Ph.\ D. thesis, Massachusetts Institute of Technology
work page 2023
-
[32]
Zipfel, J., F. Verworner, M. Fischer, U. Wieland, M. Kraus, and P. Zschech (2023). Anomaly detection for industrial quality assurance: A comparative evaluation of unsupervised deep learning models. Computers & Industrial Engineering\/ 177 , 109045
work page 2023
-
[33]
\@ifxundefined[1] #1\@undefined \@firstoftwo \@secondoftwo \@ifnum[1] #1 \@firstoftwo \@secondoftwo \@ifx[1] #1 \@firstoftwo \@secondoftwo [2] @ #1 \@temptokena #2 #1 @ \@temptokena \@ifclassloaded agu2001 natbib The agu2001 class already includes natbib coding, so you should not add it explicitly Type <Return> for now, but then later remove the command n...
-
[34]
\@lbibitem[] @bibitem@first@sw\@secondoftwo \@lbibitem[#1]#2 \@extra@b@citeb \@ifundefined br@#2\@extra@b@citeb \@namedef br@#2 \@nameuse br@#2\@extra@b@citeb \@ifundefined b@#2\@extra@b@citeb @num @parse #2 @tmp #1 NAT@b@open@#2 NAT@b@shut@#2 \@ifnum @merge>\@ne @bibitem@first@sw \@firstoftwo \@ifundefined NAT@b*@#2 \@firstoftwo @num @NAT@ctr \@secondoft...
-
[35]
Fastflow: Unsupervised anomaly detection and localization via 2d normalizing flows,
@open @close @open @close and [1] URL: #1 \@ifundefined chapter * \@mkboth \@ifxundefined @sectionbib * \@mkboth * \@mkboth\@gobbletwo \@ifclassloaded amsart * \@ifclassloaded amsbook * \@ifxundefined @heading @heading NAT@ctr thebibliography [1] @ \@biblabel @NAT@ctr \@bibsetup #1 @NAT@ctr @ @openbib .11em \@plus.33em \@minus.07em 4000 4000 `\.\@m @bibit...
-
[36]
, " * write output.state after.block = add.period
ENTRY address archive author booktitle chapter collaboration edition editor eid howpublished institution journal key lastchecked month note number numpages organization pages publisher school series title type url urldate volume year label extra.label sort.label INTEGERS output.state before.all mid.sentence after.sentence after.block FUNCTION init.state.c...
-
[37]
" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize " " * FUNCT...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.