SentinelSphere: Integrating AI-Powered Real-Time Threat Detection with Cybersecurity Awareness Training
Pith reviewed 2026-05-10 17:47 UTC · model grok-4.3
The pith
A single AI platform can combine real-time threat detection with LLM-driven user training to address both technical and human cybersecurity vulnerabilities.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
SentinelSphere shows that an Enhanced Deep Neural Network trained on CIC-IDS2017 and CIC-DDoS2019 datasets with added HTTP-layer features attains high detection accuracy and low false positives across DDoS, brute force, and web exploits, while a quantized Phi-4 model fine-tuned for cybersecurity enables conversational training that runs on 16 GB RAM hardware and proves intuitive in workshops with professionals and students.
What carries the argument
The SentinelSphere unified framework that couples an Enhanced Deep Neural Network using novel HTTP-layer features for threat detection with a quantized LLM assistant for adaptive security awareness training.
If this is right
- The enhanced DNN achieves high accuracy with substantially lower false positives than baseline models on critical attack categories.
- The traffic light visualization and conversational AI assistant make security concepts accessible and effective for users without technical backgrounds.
- The full system deploys on commodity hardware using only 16 GB RAM without needing a GPU.
- Validation with industry professionals and students confirms the platform's usability for both detection and education components.
- Coupling detection and training in one framework addresses technical vulnerabilities and human-factor weaknesses together.
Where Pith is reading between the lines
- Smaller organizations lacking dedicated security staff could run continuous detection and ongoing education through this single accessible system.
- The LLM assistant could be extended to pull real-time threat data from the DNN module and customize training examples to the specific attacks observed.
- Longer-term field trials would be required to confirm whether the training produces measurable reductions in phishing clicks or other human errors beyond the initial workshops.
- This design suggests treating user education as an active part of the defense layer rather than a separate annual exercise.
Load-bearing premise
The enhanced DNN will keep high accuracy and low false positives on unseen real-world traffic, and brief workshops will demonstrate that the LLM training produces lasting changes in user behavior.
What would settle it
Deploying the Enhanced DNN on live enterprise network traffic for several months and comparing its false positive rate and recall to ground-truth labels from security analysts, or re-testing workshop participants after three to six months to measure whether security incident rates or error behaviors have changed.
Figures
read the original abstract
The field of cybersecurity is confronted with two interrelated challenges: a worldwide deficit of qualified practitioners and ongoing human-factor weaknesses that account for the bulk of security incidents. To tackle these issues, we present SentinelSphere, a platform driven by artificial intelligence that unifies machine learning-based threat identification with security training powered by a Large Language Model (LLM). The detection module uses an Enhanced Deep Neural Network (DNN) trained on the CIC-IDS2017 and CIC-DDoS2019 benchmark datasets, enriched with novel HTTP-layer feature engineering that captures application level attack signatures. For the educational component, we deploy a quantised variant of Phi-4 model (Q4_K_M), fine-tuned for the cybersecurity domain, enabling deployment on commodity hardware requiring only 16 GB of RAM without dedicated GPU resources. Experimental results show that the Enhanced DNN attains high detection accuracy while substantially lowering false positives relative to baseline models, and maintains strong recall across critical attack categories such as DDoS, brute force, and web-based exploits. Validation workshops involving industry professionals and university students confirmed that the Traffic Light visualisation system and conversational AI assistant are both intuitive and effective for users without technical backgrounds. SentinelSphere illustrates that coupling intelligent threat detection with adaptive, LLM-driven security education can meaningfully address both technical and human-factor cybersecurity vulnerabilities within a single, cohesive framework.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents SentinelSphere, a unified AI platform that integrates an enhanced deep neural network (DNN) for real-time threat detection—trained on the CIC-IDS2017 and CIC-DDoS2019 benchmark datasets with novel HTTP-layer feature engineering—with an LLM-based cybersecurity awareness training module using a quantized Phi-4 model (Q4_K_M) that runs on commodity hardware with 16 GB RAM. The paper reports that the DNN achieves high detection accuracy, substantially lower false positives than baselines, and strong recall across attack categories including DDoS, brute force, and web exploits. The educational component is validated via workshops with industry professionals and university students, who found the Traffic Light visualization and conversational AI assistant intuitive and effective for non-technical users. The central claim is that coupling intelligent threat detection with adaptive LLM-driven education can meaningfully address both technical and human-factor cybersecurity vulnerabilities within a single cohesive framework.
Significance. If the empirical claims hold, this work would be significant by demonstrating a practical, integrated AI framework that simultaneously tackles machine-detectable threats and the human element responsible for most security incidents. The deployment of a quantized domain-specific LLM on standard hardware without GPUs is a clear practical strength that could promote broader adoption. The use of established public benchmarks for the detection module provides a reproducible starting point, and the overall concept of a cohesive detection-plus-education platform could stimulate further research in holistic cybersecurity solutions.
major comments (2)
- [Validation workshops] The validation of the LLM-driven educational component (described in the abstract and validation workshops section) rests entirely on qualitative feedback from brief workshops with professionals and students indicating that the Traffic Light system and conversational assistant are 'intuitive and effective.' No quantitative metrics are reported on knowledge retention, pre/post behavior change, simulated attack resistance over time, or comparison to control groups. This is load-bearing for the central claim that the framework 'meaningfully address[es] both technical and human-factor cybersecurity vulnerabilities,' as the education module's long-term impact remains unproven while the DNN component has benchmark support.
- [Experimental results] The abstract and experimental results claim the Enhanced DNN 'attains high detection accuracy while substantially lowering false positives relative to baseline models' and 'maintains strong recall' but provide no specific numeric values, error bars, confusion matrices, or detailed comparison tables against standard models on the same CIC-IDS2017/CIC-DDoS2019 datasets. If these details (including the exact definition and impact of the novel HTTP-layer features) are absent or insufficiently documented in the full experimental section, the technical superiority claim cannot be fully evaluated.
minor comments (2)
- [Abstract] The abstract relies on qualitative terms ('high detection accuracy,' 'strong recall') without referencing specific performance numbers or tables; adding these would improve clarity even if detailed results appear later in the paper.
- [Educational component] The fine-tuning procedure and prompt engineering for the quantized Phi-4 model are mentioned but not described in sufficient detail (e.g., dataset used for fine-tuning, hyperparameters, or evaluation metrics for the LLM component).
Simulated Author's Rebuttal
We thank the referee for their thorough and constructive review. We address each major comment below, committing to revisions that improve clarity, documentation, and scope without overstating our current results.
read point-by-point responses
-
Referee: [Validation workshops] The validation of the LLM-driven educational component (described in the abstract and validation workshops section) rests entirely on qualitative feedback from brief workshops with professionals and students indicating that the Traffic Light system and conversational assistant are 'intuitive and effective.' No quantitative metrics are reported on knowledge retention, pre/post behavior change, simulated attack resistance over time, or comparison to control groups. This is load-bearing for the central claim that the framework 'meaningfully address[es] both technical and human-factor cybersecurity vulnerabilities,' as the education module's long-term impact remains unproven while the DNN component has benchmark support.
Authors: We agree that the educational validation relies on qualitative workshop feedback and does not include quantitative metrics such as pre/post knowledge scores, retention tests, or controlled comparisons. In the revised manuscript we will expand the Validation Workshops section with additional details on participant demographics, workshop protocol, session duration, and representative feedback quotes. We will also add an explicit limitations paragraph noting that long-term behavioral impact studies lie outside the current scope of this platform paper. These changes will better frame the strength of the existing evidence. revision: partial
-
Referee: [Experimental results] The abstract and experimental results claim the Enhanced DNN 'attains high detection accuracy while substantially lowering false positives relative to baseline models' and 'maintains strong recall' but provide no specific numeric values, error bars, confusion matrices, or detailed comparison tables against standard models on the same CIC-IDS2017/CIC-DDoS2019 datasets. If these details (including the exact definition and impact of the novel HTTP-layer features) are absent or insufficiently documented in the full experimental section, the technical superiority claim cannot be fully evaluated.
Authors: We thank the referee for highlighting this gap. While the experimental section contains performance tables, we will revise the abstract and main text to prominently report key numeric results (accuracy, precision, recall, F1-score, false-positive rate) with direct baseline comparisons on both datasets. We will also insert confusion matrices, error bars from repeated runs or cross-validation, and a new subsection that precisely defines the novel HTTP-layer features together with an ablation study quantifying their contribution to improved detection. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper describes an empirical system integrating a DNN for threat detection (trained on public CIC-IDS2017 and CIC-DDoS2019 benchmarks with added HTTP features) and a quantized LLM for education, validated via workshops. No equations, derivations, or mathematical claims exist that could reduce predictions to inputs by construction. No self-citations, ansatzes, or uniqueness theorems are invoked as load-bearing elements. The central claim of a cohesive framework rests on independent empirical training results and qualitative feedback rather than any self-referential reduction, making the derivation chain self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption CIC-IDS2017 and CIC-DDoS2019 datasets contain representative samples of real-world attack traffic.
- domain assumption Quantized Phi-4 model fine-tuned on cybersecurity text will produce accurate and helpful responses for non-experts.
Reference graph
Works this paper leans on
-
[1]
Ramezan, C.A. (2023). Examining the Cyber Skills Gap: An Analysis of Cybersecurity Positions by Sub-Field. Journal of Information Systems Education, 34(1):94--105. https://jise.org/Volume34/n1/JISE2023v34n1pp94-105.html
work page 2023
-
[2]
ISC2 Cybersecurity Workforce Study 2023
ISC2 (2023). ISC2 Cybersecurity Workforce Study 2023. International Information System Security Certification Consortium, Alexandria, VA. https://www.isc2.org/research
work page 2023
-
[3]
Kucuk, M.F. and Uysal, I. (2022). Anomaly Detection in Self-Organizing Networks: Conventional Versus Contemporary Machine Learning. IEEE Access, 10:61744--61752. doi:10.1109/ACCESS.2022.3181910
-
[4]
The Cost of Malware Containment
Ponemon Institute (2023). The Cost of Malware Containment. Ponemon Institute LLC, Traverse City, MI
work page 2023
-
[5]
2025 Data Breach Investigations Report
Verizon (2025). 2025 Data Breach Investigations Report. Verizon Business, New York. https://www.verizon.com/business/resources/reports/dbir/
work page 2025
-
[6]
ENISA (2023). ENISA Threat Landscape 2023. European Union Agency for Cybersecurity, Luxembourg. doi:10.2824/782573
-
[7]
2024.The NIST Cybersecurity Framework 2.0
National Institute of Standards and Technology (2024). The NIST Cybersecurity Framework (CSF) 2.0. NIST Cybersecurity White Paper, U.S. Department of Commerce, Washington DC. doi:10.6028/NIST.CSWP.29
-
[8]
ResilMesh: Situation Aware enabled Cyber Resilience for Dispersed, Heterogenous Cyber Systems
ResilMesh Consortium (2023). ResilMesh: Situation Aware enabled Cyber Resilience for Dispersed, Heterogenous Cyber Systems. EU Horizon Europe Project 101119681, Technical Report
work page 2023
-
[9]
Nguyen, T., Sipola, T., and Hautam \"a ki, J. (2024). Machine Learning Applications of Quantum Computing: A Review. In Proceedings of the 23rd European Conference on Cyber Warfare and Security (ECCWS 2024), pp.\ 370--377. Academic Conferences International. doi:10.34190/eccws.23.1.2390
-
[10]
Sadlek, L., Hus \'a k, M., and C eleda, P. (2024). Hierarchical Modeling of Cyber Assets in Kill Chain Attack Graphs. In 2024 20th International Conference on Network and Service Management (CNSM), pp.\ 1--7. IEEE. doi:10.23919/CNSM62983.2024
-
[11]
Tantaroudas, N.D., Karachalios, I., and McCracken, A.J. (2026). SentinelSphere: AI-Driven Cybersecurity Platform Combining Threat Detection with Security Awareness. In Proceedings of the 21st International Conference on Availability, Reliability and Security (ARES 2025). ACM. doi:10.1145/3664476.3670446
-
[12]
Outside the closed world: On using machine learning for network intrusion detection
Sommer, R. and Paxson, V. (2010). Outside the Closed World: On Using Machine Learning for Network Intrusion Detection. In 2010 IEEE Symposium on Security and Privacy, pp.\ 305--316. IEEE. doi:10.1109/SP.2010.25
-
[13]
Yin, C., Zhu, Y., Fei, J., and He, X. (2017). A Deep Learning Approach for Intrusion Detection Using Recurrent Neural Networks. IEEE Access, 5:21954--21961. doi:10.1109/ACCESS.2017.2762418
-
[14]
Vinayakumar, R., Alazab, M., Soman, K.P., Poornachandran, P., Al-Nemrat, A., and Venkatraman, S. (2019). Deep Learning Approach for Intelligent Intrusion Detection System. IEEE Access, 7:41525--41550. doi:10.1109/ACCESS.2019.2895334
-
[15]
Sharafaldin, I., Lashkari, A.H., and Ghorbani, A.A. (2018). Toward Generating a New Intrusion Detection Dataset and Intrusion Traffic Characterisation. In Proceedings of the 4th International Conference on Information Systems Security and Privacy (ICISSP), pp.\ 108--116. doi:10.5220/0006639801080116
-
[16]
Sharafaldin, I., Lashkari, A.H., Hakak, S., and Ghorbani, A.A. (2019). Developing Realistic Distributed Denial of Service (DDoS) Attack Dataset and Taxonomy. In 2019 International Carnahan Conference on Security Technology (ICCST), pp.\ 1--8. IEEE. doi:10.1109/CCST.2019.8888419
-
[17]
Sharafaldin, I., Lashkari, A.H., and Ghorbani, A.A. (2018). A Detailed Analysis of the CICIDS2017 Data Set. In Information Systems Security and Privacy, pp.\ 172--188. Springer. doi:10.1007/978-3-030-25109-3_9
-
[18]
Khan, M.A., Iqbal, N., Jamil, H., and Kim, D.-H. (2023). An optimised ensemble prediction model using AutoML based on soft voting classifier for network intrusion detection. Journal of Network and Computer Applications, 212:103560. doi:10.1016/j.jnca.2022.103560
-
[19]
Ferrag, M.A., Maglaras, L., Moschoyiannis, S., and Janicke, H. (2020). Deep Learning for Cyber Security Intrusion Detection: Approaches, Datasets, and Comparative Study. Journal of Information Security and Applications, 50:102419. doi:10.1016/j.jisa.2019.102419
-
[20]
Aktar, S. and Nur, A.Y. (2023). Towards DDoS attack detection using deep learning approach. Computers & Security, 129:103251. doi:10.1016/j.cose.2023.103251
-
[21]
Motlagh, F.N., Hajizadeh, M., Majd, M., Najafi, P., Cheng, F., and Meinel, C. (2024). Large Language Models in Cybersecurity: State-of-the-Art. arXiv preprint arXiv:2402.00891. doi:10.48550/arXiv.2402.00891
-
[22]
Xu, H., Wang, S., Li, N., Zhao, Y., Chen, K., Wang, K., Liu, Y., Yu, T., and Wang, H. (2024). Large Language Models for Cyber Security: A Systematic Literature Review. arXiv preprint arXiv:2405.04760. doi:10.48550/arXiv.2405.04760
-
[23]
Zhang, J., Bu, H., Wen, H., Liu, Y., Fei, H., Xi, R., Li, L., Yang, Y., Zhu, H., and Meng, D. (2025). When LLMs Meet Cybersecurity: A Systematic Literature Review. Cybersecurity, 8:55. doi:10.1186/s42400-025-00357-y
-
[24]
Jaffal, N.O., Alkhanafseh, M., and Mohaisen, D. (2025). Large Language Models in Cybersecurity: A Survey of Applications, Vulnerabilities, and Defense Techniques. AI, 6(9):216. doi:10.3390/ai6090216
-
[25]
Atlam, H.F. (2025). LLMs in Cyber Security: Bridging Practice and Education. Big Data and Cognitive Computing, 9(7):184. doi:10.3390/bdcc9070184
-
[26]
Hassanin, M. and Moustafa, N. (2024). A Comprehensive Overview of Large Language Models (LLMs) for Cyber Defences: Opportunities and Directions. arXiv preprint arXiv:2405.14487. doi:10.48550/arXiv.2405.14487
-
[27]
Chhetri, C. (2024). Exploring Large Language Model-Powered Pedagogical Approaches to Cybersecurity Education. In Proceedings of the 25th Annual Conference on Information Technology Education (SIGITE '24), pp.\ 115--120. ACM. doi:10.1145/3686612.3686643
-
[28]
Hadlington, L. (2017). Human Factors in Cybersecurity: Examining the Link Between Internet Addiction, Impulsivity, Attitudes Towards Cybersecurity, and Risky Cybersecurity Behaviours. Heliyon, 3(7):e00346. doi:10.1016/j.heliyon.2017.e00346
-
[29]
Aldawood, H. and Skinner, G. (2019). Reviewing Cyber Security Social Engineering Training and Awareness Programs---Pitfalls and Ongoing Issues. Future Internet, 11(3):73. doi:10.3390/fi11030073
-
[30]
Bada, M., Sasse, A.M., and Nurse, J.R. (2019). Cyber Security Awareness Campaigns: Why do they fail to change behaviour? arXiv preprint arXiv:1901.02672. doi:10.48550/arXiv.1901.02672
-
[31]
Zhang-Kennedy, L. and Chiasson, S. (2021). A Systematic Review of Multimedia Tools for Cybersecurity Awareness and Education. ACM Computing Surveys, 54(1):12. doi:10.1145/3427920
-
[32]
Araujo, M.S.d., Machado, B.A.S., and Passos, F.U. (2024). Resilience in the Context of Cyber Security: A Review of the Fundamental Concepts and Relevance. Applied Sciences, 14(5):2116. doi:10.3390/app14052116
-
[33]
Ramos-Cruz, B., Andreu-Perez, J., and Mart \' nez, L. (2024). The Cybersecurity Mesh: A Comprehensive Survey of Involved Artificial Intelligence Methods, Cryptographic Protocols and Challenges for Future Research. arXiv preprint arXiv:2402.18373. doi:10.48550/arXiv.2402.18373
-
[34]
Somma, M., Flatscher, A., and Stojanovi \'c , B. (2024). Edge-Based Anomaly Detection: Enhancing Performance and Sustainability in Smart Water Distribution Systems. In 2024 32nd Telecommunications Forum (TELFOR), pp.\ 1--4. IEEE. doi:10.1109/TELFOR63250.2024
-
[35]
Bhatt, S., Manadhata, P.K., and Zomlot, L. (2014). The Operational Role of Security Information and Event Management Systems. IEEE Security & Privacy, 12(5):35--41. doi:10.1109/MSP.2014.103
-
[36]
Tantaroudas, N.D., Karachalios, I., and McCracken, A.J. (2026). SentinelSphere: An AI-driven cybersecurity platform integrating real-time threat detection with security awareness education [version 1; peer review: 2 approved with reservations]. Open Research Europe, 6:58. doi:10.12688/openreseurope.22957.1
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.