AquaSight: Automatic Water Impurity Detection Utilizing Convolutional Neural Networks
Pith reviewed 2026-05-24 20:14 UTC · model grok-4.3
The pith
A convolutional neural network trained on 105 water images detects impurities at 96 percent accuracy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
After training a convolutional neural network on 105 images of water with different contamination magnitudes, the model classifies impurity levels by analyzing turbidity and transparency, reaching 96 percent accuracy and enabling a mobile application that supplies rapid water-quality estimates to individuals and authorities.
What carries the argument
Convolutional Neural Networks that classify water images according to visual cues of turbidity and transparency.
If this is right
- Individuals obtain a low-cost estimate of their local water quality from a phone photograph.
- Contamination alerts can be sent to local and national governments for follow-up action.
- The method supplies an alternative to laboratory-based water testing that requires no specialized equipment.
- Widespread use could contribute to reduced exposure to polluted water supplies.
Where Pith is reading between the lines
- The same image-classification approach might be adapted to monitor other visible environmental indicators such as air quality or soil condition.
- Pairing the app output with geographic data could produce crowd-sourced maps of water quality over time.
- Expanding the training set beyond 105 images would be a direct next step to test whether accuracy holds for more diverse water bodies.
Load-bearing premise
The collection of 105 images adequately represents the range of real-world water contamination and the trained model will generalize to new samples from varied sources.
What would settle it
Running the published model on several hundred new water photographs collected from multiple independent real-world locations and obtaining accuracy well below 96 percent would falsify the reported performance.
read the original abstract
According to the United Nations World Water Assessment Programme, every day, 2 million tons of sewage and industrial and agricultural waste are discharged into the worlds water. In order to address this pervasive issue of increasing water pollution, while ensuring that the global population has an efficient, accurate, and low cost method to assess whether the water they drink is contaminated, we propose AquaSight, a novel mobile application that utilizes deep learning methods, specifically Convolutional Neural Networks, for automated water impurity detection. After comprehensive training with a dataset of 105 images representing varying magnitudes of contamination, the deep learning algorithm achieved a 96 percent accuracy and loss of 0.108. Furthermore, the machine learning model uses efficient analysis of the turbidity and transparency levels of water to estimate a particular sample of waters level of contamination. When deployed, the AquaSight system will provide an efficient way for individuals to secure an estimation of water quality, alerting local and national government to take action and potentially saving millions of lives worldwide.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes AquaSight, a mobile application that employs convolutional neural networks to detect water impurities from images of water samples. It reports that a CNN trained on a dataset of 105 images achieved 96% accuracy with a loss of 0.108, and claims the system can provide low-cost water quality estimates to alert governments and save lives.
Significance. A validated, accessible mobile tool for turbidity-based water impurity detection could have practical value in low-resource settings. However, the reported result provides no evidence of generalization beyond the training set, limiting any assessment of real-world significance.
major comments (2)
- [Abstract] Abstract: The headline result of 96% accuracy (loss 0.108) on 105 images is presented with no description of train/validation/test partitioning, cross-validation procedure, data augmentation, or external test sets from different sources/lighting conditions. This omission makes it impossible to determine whether the metric reflects training performance or true generalization.
- [Abstract] Abstract: Standard CNN image classifiers require thousands of examples (or transfer learning plus augmentation) to learn robust features; with N=105 and no mention of these techniques, the reported accuracy is likely to be an overfit training-set figure that will not support the downstream claim of reliable real-world deployment for government alerts.
Simulated Author's Rebuttal
We thank the referee for their constructive comments. We respond point-by-point to the major comments below.
read point-by-point responses
-
Referee: [Abstract] Abstract: The headline result of 96% accuracy (loss 0.108) on 105 images is presented with no description of train/validation/test partitioning, cross-validation procedure, data augmentation, or external test sets from different sources/lighting conditions. This omission makes it impossible to determine whether the metric reflects training performance or true generalization.
Authors: We agree that the abstract omits these details. The submitted manuscript describes only a dataset of 105 images with no further specification of partitioning, cross-validation, augmentation, or external sets. We will revise the abstract to state the data split used and explicitly note the absence of cross-validation and external test sets. revision: partial
-
Referee: [Abstract] Abstract: Standard CNN image classifiers require thousands of examples (or transfer learning plus augmentation) to learn robust features; with N=105 and no mention of these techniques, the reported accuracy is likely to be an overfit training-set figure that will not support the downstream claim of reliable real-world deployment for government alerts.
Authors: We acknowledge the validity of this concern. The abstract provides no information on transfer learning or augmentation, and the small dataset size raises a legitimate risk that the reported figure reflects training rather than generalization performance. We will revise the manuscript to discuss these limitations explicitly and moderate the claims regarding real-world deployment and life-saving potential to reflect the preliminary character of the study. revision: yes
- No external test sets from different sources or lighting conditions exist in the study, so evidence of generalization beyond the collected images cannot be supplied.
Circularity Check
No circularity; empirical performance report with no derivation chain
full rationale
The paper reports training a CNN on 105 images and states an achieved accuracy of 96% with loss 0.108. No equations, derivations, first-principles claims, or load-bearing self-citations appear in the provided text. The result is presented as an empirical outcome of training rather than a derived prediction that reduces to its inputs by construction. No steps match the enumerated circularity patterns.
Axiom & Free-Parameter Ledger
free parameters (1)
- CNN hyperparameters
axioms (1)
- domain assumption The 105 images sufficiently represent contamination variations
Reference graph
Works this paper leans on
-
[1]
AI-Driven Test System Detects Bacteria in Water. (2018, March 22). Retrieved from https://software.intel.com/en- us/articles/ai-driven-test-system-detects-bacteria-in-water
work page 2018
-
[2]
C., Meier, U., Masci, J., Gambardella, L
Ciresan, D. C., Meier, U., Masci, J., Gambardella, L. M., Schmidhuber, J. (2011, June). Flexible, high performance convolutional neural networks for image classification. In Twenty-Second International Joint Conference on Artificial Intelligence
work page 2011
-
[3]
Krizhevsky, A., Sutskever, I., Hinton, G. E. (2012). Im- agenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097-1105)
work page 2012
-
[4]
Landrigan, P. J., Fuller, R., Acosta, N. J., Adeyi, O., Arnold, R., Bald, A. B., ... Chiles, T. (2018). The Lancet Commission on pollution and health. The Lancet, 391(10119), 462-512
work page 2018
-
[5]
Mateo-Sagasta, J., Zadeh, S. M., Turral, H. (Eds.). (2018). More people, more food, worse water?: a global review of water pollution from agriculture. Rome, Italy: FAO Colombo, Sri Lanka: International Water Management Insti- tute (IWMI). CGIAR Research Program on Water, Land and Ecosystems (WLE)
work page 2018
-
[6]
World Population Prospects: The 2017 Revision, Key Findings and Advance Tables
United Nations, Department of Economic and Social Af- fairs, Population Division (2017). World Population Prospects: The 2017 Revision, Key Findings and Advance Tables. Work- ing Paper No. ESA/P/WP/248
work page 2017
-
[7]
Algae Detection Using Computer Vision and Deep Learning
Samantaray, A., Yang, B., Dietz, J. E., Min, B. C. (2018). Algae Detection Using Computer Vision and Deep Learning. arXiv preprint arXiv:1811.10847
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[8]
Schwartz, J., Levin, R. (1999). Drinking water turbidity and health. Epidemiology, 86-90
work page 1999
-
[9]
Toivanen, T., Koponen, S., Kotovirta, V ., Molinier, M., Chengyuan, P. (2013). Water quality analysis using an inex- pensive device and a mobile phone. Environmental Systems Research, 2(1), 9
work page 2013
-
[10]
WHO/UNICEF Joint Monitoring Programme. (2017, July 12). 2.1 billion people lack safe drinking water at home, more than twice as many lack safe sanitation [Press release]. Retrieved from https://www.who.int/news-room/detail/12-07- 2017-2-1-billion-people-lack-safe-drinking-water-at-home- more-than-twice-as-many-lack-safe-sanitation
work page 2017
-
[11]
Codevilla, F., Gaya, J. D. O., Duarte, N., Botelho, S. (2004). Achieving turbidity robustness on underwater images local feature detection. International journal of computer vision, 60(2), 91-110
work page 2004
-
[12]
Ge, Z., McCool, C., Sanderson, C., Corke, P. (2015, September). Modelling local deep convolutional neural net- work features to improve fine-grained image classification. In 2015 IEEE International Conference on Image Processing (ICIP) (pp. 4112-4116) . IEEE
work page 2015
-
[13]
Mahapatra, S. S., Nanda, S. K., Panigrahy, B. K. (2011). A Cascaded Fuzzy Inference System for Indian river water quality prediction. Advances in Engineering Software , 42(10), 787-796
work page 2011
-
[14]
Yuan, F., Huang, Y ., Chen, X., Cheng, E. (2018). A Biological Sensor System Using Computer Vision for Water Quality Monitoring. IEEE Access, 6, 61535-61546
work page 2018
-
[15]
Zhang, Y ., Pulliainen, J., Koponen, S., Hallikainen, M. (2002). Application of an empirical neural network to surface water quality estimation in the Gulf of Finland using combined optical data and microwave data. Remote sensing of environment, 81(2-3), 327-336
work page 2002
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.