Towards transparent and data-driven fault detection in manufacturing: A case study on univariate, discrete time series
Pith reviewed 2026-05-19 07:32 UTC · model grok-4.3
The pith
A data-driven fault detection system for manufacturing achieves 95.9% accuracy while generating interpretable explanations for operators.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The integrated system of a supervised multi-class classifier, post-hoc SHAP explanations, and domain visualization achieves 95.9% fault detection accuracy on univariate discrete time series from the crimping process, with quantitative analysis showing selective explanations and qualitative expert assessment confirming their relevance and interpretability.
What carries the argument
Supervised machine learning model for multi-class fault classification combined with Shapley Additive Explanations (SHAP) and a domain-specific visualization that maps explanations to operator-interpretable features.
If this is right
- Quality control can shift from manual threshold setting to adaptive data-driven classification without losing operator trust.
- Explanations generated by the model allow targeted review of which parts of the time series signal indicate specific faults.
- Quantitative selectivity analysis combined with expert feedback offers a repeatable way to validate both detection performance and explanation quality.
- The human-centric design supports broader adoption of machine learning in safety-critical manufacturing environments.
Where Pith is reading between the lines
- If the visualization technique generalizes, similar transparency layers could be added to other time-series monitoring tasks in assembly or welding lines.
- Operators might use the highlighted time-series segments to adjust process parameters proactively rather than only reacting to detected faults.
- Periodic retraining on recent data while preserving the same explanation mapping could keep the system current without retraining operators on new black-box outputs.
Load-bearing premise
The univariate discrete time series dataset from the crimping process is representative of real-world manufacturing variability and the post-hoc SHAP explanations plus domain visualizations will remain meaningful when applied to new production runs or different fault types.
What would settle it
Apply the trained model and explanation pipeline to a fresh dataset from a different production run or fault category and measure whether accuracy drops substantially below 95.9 percent or whether domain experts judge the visualizations as unhelpful or misleading.
read the original abstract
Ensuring consistent product quality in modern manufacturing is crucial, particularly in safety-critical applications. Conventional quality control approaches, reliant on manually defined thresholds and features, lack adaptability to the complexity and variability inherent in production data and necessitate extensive domain expertise. Conversely, data-driven methods, such as machine learning, demonstrate high detection performance but typically function as black-box models, thereby limiting their acceptance in industrial environments where interpretability is paramount. This paper introduces a methodology for industrial fault detection, which is both data-driven and transparent. The approach integrates a supervised machine learning model for multi-class fault classification, Shapley Additive Explanations for post-hoc interpretability, and a do-main-specific visualisation technique that maps model explanations to operator-interpretable features. Furthermore, the study proposes an evaluation methodology that assesses model explanations through quantitative perturbation analysis and evaluates visualisations by qualitative expert assessment. The approach was applied to the crimping process, a safety-critical joining technique, using a dataset of univariate, discrete time series. The system achieves a fault detection accuracy of 95.9 %, and both quantitative selectivity analysis and qualitative expert evaluations confirmed the relevance and inter-pretability of the generated explanations. This human-centric approach is designed to enhance trust and interpretability in data-driven fault detection, thereby contributing to applied system design in industrial quality control.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces a methodology for transparent, data-driven fault detection in manufacturing that integrates a supervised multi-class ML classifier on univariate discrete time series, post-hoc SHAP explanations, and a domain-specific visualization mapping explanations to operator-interpretable features. An evaluation framework combining quantitative perturbation analysis for explanation selectivity and qualitative expert assessment for visualizations is proposed and applied to a crimping-process dataset, yielding a reported fault-detection accuracy of 95.9 % with supporting confirmation of explanation relevance.
Significance. If the empirical results and evaluation hold, the work offers a practical human-centric pipeline that could improve acceptance of ML-based quality control in safety-critical industrial settings by addressing both detection performance and interpretability. The mixed quantitative-qualitative assessment of explanations is a constructive element. As a single-process case study, however, the contribution remains primarily demonstrative rather than establishing broad methodological advances.
major comments (3)
- [§4 and abstract] §4 (Experimental results) and abstract: The headline claim of 95.9 % accuracy is presented without any reported dataset size, class distribution, train-test split details, cross-validation strategy, or baseline comparisons (e.g., against threshold-based or other standard classifiers). These omissions are load-bearing for assessing whether the performance result is robust or merely an artifact of the particular split and data characteristics.
- [§1 and §6] §1 (Introduction) and §6 (Conclusion): The assertion that the approach contributes to enhancing trust and interpretability in industrial quality control rests on a single univariate discrete time-series dataset from one crimping process. No additional datasets, multivariate or continuous signals, or hold-out production runs are evaluated, leaving the transferability assumption untested and central to the broader applicability claim.
- [§3] §3 (Methodology, evaluation subsection): The quantitative perturbation analysis for assessing explanation selectivity is described at a high level but lacks concrete specification of perturbation mechanisms, the exact selectivity metric, and how post-hoc exclusions of samples or features were handled; this directly affects the credibility of the “confirmed relevance” statement.
minor comments (2)
- [abstract] Abstract: the word “inter-pretability” contains an extraneous hyphen; correct to “interpretability”.
- [§3] Notation for the visualization mapping and the precise definition of the perturbation operator could be formalized with a short equation or pseudocode to improve reproducibility.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed feedback on our manuscript. We have carefully addressed each major comment below with point-by-point responses. Revisions have been incorporated to improve transparency, completeness, and the framing of our contributions as a focused case study.
read point-by-point responses
-
Referee: [§4 and abstract] §4 (Experimental results) and abstract: The headline claim of 95.9 % accuracy is presented without any reported dataset size, class distribution, train-test split details, cross-validation strategy, or baseline comparisons (e.g., against threshold-based or other standard classifiers). These omissions are load-bearing for assessing whether the performance result is robust or merely an artifact of the particular split and data characteristics.
Authors: We agree that these details are essential for assessing robustness and reproducibility. The revised manuscript now explicitly reports the dataset size, class distribution, train-test split, and cross-validation strategy in an expanded Section 4 and updates the abstract accordingly. We have also added baseline comparisons against a threshold-based detector and other standard classifiers (e.g., SVM) to contextualize the 95.9% accuracy result and demonstrate its relative performance. revision: yes
-
Referee: [§1 and §6] §1 (Introduction) and §6 (Conclusion): The assertion that the approach contributes to enhancing trust and interpretability in industrial quality control rests on a single univariate discrete time-series dataset from one crimping process. No additional datasets, multivariate or continuous signals, or hold-out production runs are evaluated, leaving the transferability assumption untested and central to the broader applicability claim.
Authors: We acknowledge that the evaluation is confined to a single crimping-process case study, which is explicitly framed as such in the title, abstract, and introduction. The revised Sections 1 and 6 now more precisely scope the contribution to demonstrating the integrated transparent pipeline on univariate discrete time series, while adding an explicit limitations discussion and future-work directions for validation on additional processes, multivariate signals, and hold-out production data. revision: yes
-
Referee: [§3] §3 (Methodology, evaluation subsection): The quantitative perturbation analysis for assessing explanation selectivity is described at a high level but lacks concrete specification of perturbation mechanisms, the exact selectivity metric, and how post-hoc exclusions of samples or features were handled; this directly affects the credibility of the “confirmed relevance” statement.
Authors: We thank the referee for identifying this need for greater precision. The revised Section 3 now specifies the perturbation mechanism (replacement of high-SHAP time-series segments with values drawn from the empirical distribution), defines the selectivity metric (relative drop in predicted-class confidence), and states that the analysis was conducted on the full test set with no post-hoc exclusions. These clarifications directly support the relevance assessment. revision: yes
Circularity Check
No circularity in empirical ML case study
full rationale
The paper presents an applied case study applying supervised ML for multi-class fault classification on a univariate discrete time series from the crimping process, followed by post-hoc SHAP explanations and domain-specific visualization. The 95.9% accuracy is reported as an empirical result measured on held-out data, and explanation quality is assessed via quantitative perturbation analysis plus qualitative expert evaluation. No derivation chain, equations, or self-citations reduce any claimed result to its own inputs by construction; the work contains no mathematical predictions or uniqueness theorems that could exhibit self-definitional, fitted-input, or ansatz-smuggling circularity. The central claims remain independent empirical observations on the chosen dataset.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
International Electrotechnical Commission, Solderless connections - Part 2: Crimped connections - General requirements, test meth- ods and practical guidance, 2021
work page 2021
-
[2]
P. Bründl, M. Stoidner, J. Bredthauer, H.G. Nguyen, A. Baechler, J. Franke, 2024. Unlocking the potential of digitalization and auto- mation: a qualitative and quantitative study of the control cabinet manufacturing industry. Production & Manufacturing Research 12, 2306820. https://doi.org/10.1080/21693277.2024.2306820
-
[3]
M. Meiners, A. Mayr, M. Kuhn, B. Raab, J. Franke, Towards an Inline Quality Monitoring for Crimping Processes Utilizing Ma- chine Learning Techniques, in: 2020 10th International Electric Drives Production Conference (EDPC), Ludwigsburg, Germany, IEEE, 2020, pp. 1–6
work page 2020
- [4]
-
[5]
A. Mayr, D. Kißkalt, M. Meiners, B. Lutz, F. Schäfer, R. Seidel, A. Selmaier, J. Fuchs, M. Metzner, A. Blank, J. Franke, Machine Learning in Production – Potentials, Challenges and Exemplary Applications, Procedia CIRP 86 (2019) 49–54. https://doi.org/10.1016/j.procir.2020.01.035
-
[6]
M. Meiners, M. Kuhn, J. Franke, Manufacturing process curve monitoring with deep learning, Manufacturing Letters 30 (2021) 15–18. https://doi.org/10.1016/j.mfglet.2021.09.006
-
[7]
J. Song, P. Kumar, Y. Kim, H.S. Kim, A Fault Detection System for Wiring Harness Manufacturing Using Artificial Intelligence, Mathematics 12 (2024) 537. https://doi.org/10.3390/math12040537
-
[8]
B. Hofmann, A. Scheck, H.G. Nguyen, M. Meiners, J. Franke, En- hancing Crimp Curve Monitoring in Wiring Harness Production: A Machine Learning Approach with Emphasis on Diverse Data, in: L.C. Tang (Ed.), Proceedings of the 11th International Conference on Industrial Engineering and Applications, Springer Nature Sin- gapore, Singapore, 2025, pp. 3–13
work page 2025
-
[9]
R. Branco, C. Agostinho, S. Gusmeroli, E. Lavasa, Z. Dikopoulou, D. Monzo, F. Lampathaki, Explainable AI in Manufacturing: an Analysis of Transparency and Interpretability Methods for the XMANAI Platform, in: 2023 IEEE International Conference on Engineering, Technology and Innovation (ICE/ITMC), Edinburgh, United Kingdom, IEEE, 2023, pp. 1–8
work page 2023
-
[10]
McGOWAN, Kavanagh Ronan (Tyco Electronics UK Limited) EP 1 149 438 B1, 2000
B. McGOWAN, Kavanagh Ronan (Tyco Electronics UK Limited) EP 1 149 438 B1, 2000
work page 2000
-
[11]
Rob Boyd, The Impact of the LV 214-4 Standard: A new standard developed by German automotive OEMs provides insight into crimp force monitoring
-
[12]
Kratt, Crimp Force Monitoring - The Force curve
V. Kratt, Crimp Force Monitoring - The Force curve. https://crimppedia.com/index.php/en/technikbibliothek-en/measur- ing-and-testing/crimp-force-monitoring/1162-crimp-force-moni- toring-the-force-curve (accessed 10 January 2025)
work page 2025
-
[13]
ZVEI e. V. Electro and Digital Industry Association, Validation of Low Voltage Automotive Connectors: Technical Guideline - TLF 0214, 2024
work page 2024
-
[14]
https://www.engberts.de/en/produkt/crimpkraf- tueberwachung-bb07i4/ (accessed 29 January 2025)
Engberts Mess-, Steuer- und Regelsysteme GmbH, Crimp force monitor BB07i4. https://www.engberts.de/en/produkt/crimpkraf- tueberwachung-bb07i4/ (accessed 29 January 2025)
work page 2025
-
[15]
H.G. Nguyen, A. Scheck, B. Hofmann, M. Meiners, S. Neubauer, A. Schäfer, J. Franke, Ganzheitliche und auf maschinellen Lern- verfahren basierende Qualitätsüberwachung, Zeitschrift für wirt- schaftlichen Fabrikbetrieb 118 (2023) 198–203. https://doi.org/10.1515/zwf-2023-1045
-
[16]
V. Hassija, V. Chamola, A. Mahapatra, A. Singal, D. Goel, K. Huang, S. Scardapane, I. Spinelli, M. Mahmud, A. Hussain, Inter- preting Black-Box Models: A Review on Explainable Artificial In- telligence, Cogn Comput 16 (2024) 45–74. https://doi.org/10.1007/s12559-023-10179-8
-
[17]
A. Mohammed, C. Geppert, A. Hartmann, P. Kuritcyn, V. Bruns, U. Schmid, T. Wittenberg, M. Benz, B. Finzel, Explaining and Evaluating Deep Tissue Classification by Visualizing Activations of Most Relevant Intermediate Layers, Current Directions in Bio- medical Engineering 8 (2022) 229–232. https://doi.org/10.1515/cdbme-2022-1059
-
[18]
J. Zhou, F. Chen, Human and Machine Learning: Visible, Explain- able, Trustworthy and Transparent, Springer International Publish- ing, Cham, 2018
work page 2018
-
[19]
B. Abdollahi, O. Nasraoui, Transparency in Fair Machine Learn- ing: the Case of Explainable Recommender Systems, in: J. Zhou, F. Chen (Eds.), Human and Machine Learning, Springer Interna- tional Publishing, Cham, 2018, pp. 21–35
work page 2018
-
[20]
P. Dennis, Lean production simplified: A plain language guide to the world's most powerful production system, secondnd Ed., third[rd. print.], Productivity Press, New York, 2008
work page 2008
-
[21]
S. Chaudhary, P. Joshi, P. Bhattacharya, V.K. Prasad, R. Shah, S. Tanwar, Untangling Explainable AI in Applicative Domains: Tax- onomy, Tools, and Open Challenges, in: S. Tanwar, S.T. Wierzchon, P.K. Singh, M. Ganzha, G. Epiphaniou (Eds.), Pro- ceedings of Fourth International Conference on Computing, Com- munications, and Cyber-Security, Springer Nature ...
work page 2023
-
[22]
R. Hoffmann, C. Reich, A Systematic Literature Review on Artifi- cial Intelligence and Explainable Artificial Intelligence for Visual Quality Assurance in Manufacturing, Electronics 12 (2023) 4572. https://doi.org/10.3390/electronics12224572
-
[23]
IEEE Access 10, 100700–100724 (2022)
A. Theissler, F. Spinnato, U. Schlegel, R. Guidotti, Explainable AI for Time Series Classification: A Review, Taxonomy and Research Directions, IEEE Access 10 (2022) 100700–100724. https://doi.org/10.1109/ACCESS.2022.3207765
-
[24]
A Unified Approach to Interpreting Model Predictions
S. Lundberg, S.-I. Lee, A Unified Approach to Interpreting Model Predictions (2017). https://doi.org/10.48550/arXiv.1705.07874
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1705.07874 2017
-
[25]
Hofmann, Crimp Force Curve Dataset
B. Hofmann, Crimp Force Curve Dataset. V2 (2025). https://doi.org/10.7910/DVN/WBDKN6
-
[26]
https://schaefer.biz/en/portfolio/crimping-ma- chine-eps-2001/ (accessed 11 January 2025)
Schäfer Werkzeug- und Sondermaschinenbau GmbH, Crimping Machine EPS 2001. https://schaefer.biz/en/portfolio/crimping-ma- chine-eps-2001/ (accessed 11 January 2025)
work page 2001
-
[27]
https://schaefer.biz/en/portfolio/crimp-quality- sensor-sds100/ (accessed 11 January 2025)
Schäfer Werkzeug- und Sondermaschinenbau GmbH, Crimp Qual- ity Sensor SDS100. https://schaefer.biz/en/portfolio/crimp-quality- sensor-sds100/ (accessed 11 January 2025)
work page 2025
-
[28]
M. Bilal, G. Ali, M.W. Iqbal, M. Anwar, M.S.A. Malik, R.A. Ka- dir, Auto-Prep: Efficient and Automated Data Preprocessing Pipe- line, IEEE Access 10 (2022) 107764–107784. https://doi.org/10.1109/ACCESS.2022.3198662
-
[29]
A. Suad, B. Wesam, Review of Data Preprocessing Techniques in Data Mining, Journal of Engineering and Applied Science 12(16), 2017, pp. 4102–4107
work page 2017
-
[30]
M. Russell, J. Kershaw, Y. Xia, T. Lv, Y. Li, H. Ghassemi-Ar- maki, B.E. Carlson, P. Wang, Comparison and explanation of data- driven modeling for weld quality prediction in resistance spot welding, J Intell Manuf 35 (2024) 1305–1319. https://doi.org/10.1007/s10845-023-02108-1
-
[31]
API design for machine learning software: experiences from the scikit-learn project
L. Buitinck, G. Louppe, M. Blondel, F. Pedregosa, A. Mueller, O. Grisel, V. Niculae, P. Prettenhofer, A. Gramfort, J. Grobler, R. Layton, J. Vanderplas, A. Joly, B. Holt, G. Varoquaux, API design for machine learning software: experiences from the scikit-learn project (2013). https://doi.org/10.48550/arXiv.1309.0238
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1309.0238 2013
-
[32]
scikit-learn developers, MinMaxScaler. https://scikit-learn.org/sta- ble/modules/generated/sklearn.preprocessing.MinMaxScaler.html (accessed 11 January 2025)
work page 2025
-
[33]
scikit-learn developers, RandomForestClassifier. https://scikit- learn.org/stable/modules/generated/sklearn.ensemble.Random- ForestClassifier.html (accessed 11 January 2025)
work page 2025
-
[34]
scikit-learn developers, GridSearchCV. https://scikit-learn.org/sta- ble/modules/generated/sklearn.model_selec- tion.GridSearchCV.html (accessed 11 January 2025)
work page 2025
-
[35]
https://shap.readthedocs.io/en/lat- est/api.html (accessed 11 January 2025)
Scott Lundberg, API Reference. https://shap.readthedocs.io/en/lat- est/api.html (accessed 11 January 2025)
work page 2025
-
[36]
S.M. Lundberg, G. Erion, H. Chen, A. DeGrave, J.M. Prutkin, B. Nair, R. Katz, J. Himmelfarb, N. Bansal, S.-I. Lee, From Local Ex- planations to Global Understanding with Explainable AI for Trees, Nat. Mach. Intell. 2 (2020) 56–67. https://doi.org/10.1038/s42256- 019-0138-9
-
[37]
F. Mujkanovic, V. Doskoč, M. Schirneck, P. Schäfer, T. Friedrich, timeXplain -- A Framework for Explaining the Predictions of Time Series Classifiers. https://doi.org/10.48550/arXiv.2007.07606
-
[38]
U. Schlegel, D.A. Keim, Time Series Model Attribution Visualiza- tions as Explanations. https://doi.org/10.48550/arXiv.2109.12935
-
[39]
D. Solís-Martín, J. Galán-Páez, J. Borrego-Díaz, On the Soundness of XAI in Prognostics and Health Management (PHM), Infor- mation 14 (2023) 256. https://doi.org/10.3390/info14050256
-
[40]
U. Schlegel, H. Arnout, M. El-Assady, D. Oelke, D.A. Keim, To- wards a Rigorous Evaluation of XAI Methods on Time Series (2019). https://doi.org/10.48550/arXiv.1909.07082
- [41]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.