Context-aware Entity-Relation Extraction for Threat Intelligence Knowledge Graphs
Pith reviewed 2026-05-20 19:53 UTC · model grok-4.3
The pith
A pipeline framework combines SecureBERT+ embeddings with domain ontology knowledge to extract entities and relations from cybersecurity threat reports while reducing error propagation.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The CTiKG framework accurately extracts and classifies threat entities and their relationships from CTI reports by incorporating hybrid NLP models that leverage SecureBERT+ contextual embeddings and expert knowledge from a domain ontology to reduce misclassifications and mitigate cascading errors.
What carries the argument
The CTiKG pipeline architecture, which chains hybrid NLP models that fuse SecureBERT+ contextual embeddings with domain-ontology rules to extract and classify entity-relation triples from CTI text.
If this is right
- Higher NER and RE accuracy produces cleaner triples for constructing queryable cybersecurity knowledge graphs.
- Lower error propagation across the pipeline raises end-to-end reliability for real-time threat analysis.
- Validation on DNRTI and STUCCO benchmarks indicates the approach generalizes beyond the main test set.
- Public release of the DNRTI-AUG-STIX2 dataset supports direct replication and extension by others.
Where Pith is reading between the lines
- The same hybrid pattern could be tested on other specialized report domains where jargon and structure create similar extraction problems.
- If the ontology rules can be kept up to date, the framework might lower the ongoing manual curation burden for security teams.
- Success here suggests that lightweight domain injection into pre-trained language models can outperform purely data-driven baselines in narrow technical fields.
Load-bearing premise
Integrating SecureBERT+ embeddings with domain ontology knowledge will meaningfully reduce misclassifications and stop errors from cascading through the extraction pipeline.
What would settle it
Re-running the experiments on the DNRTI-AUG-STIX2 dataset and finding no improvement or a drop in precision, recall, or F1 for named-entity recognition and relation extraction compared with prior baselines would falsify the performance claim.
Figures
read the original abstract
Cybersecurity Knowledge Graphs (CKGs) unify diverse Cyber Threat Intelligence (CTI) sources into structured, queryable formats, offering scalable solutions for automating proactive and real-time security responses. Their increasing adoption has significantly enhanced the workflow and decision-making efficiency of security professionals. However, constructing CKGs requires extracting entity-relation triples from unstructured CTI reports, a task hindered by complex report structure, domain-specific language, and semantic ambiguity. As a result, existing pipeline-based approaches often suffer from error propagation, reducing extraction accuracy and limiting generalizability. This paper introduces the Context-aware Threat Intelligence Knowledge Graph (CTiKG) framework, a pipeline architecture designed to accurately extract and classify threat entities and their relationships from CTI reports. CTiKG incorporates hybrid NLP models that leverage SecureBERT+ contextual embeddings and expert knowledge from a domain ontology to reduce misclassifications and mitigate cascading errors. Experiments on the DNRTI-AUG-STIX2 dataset, which comprises 21 entity types aligned with STIX 2.1, demonstrate significant improvements over state-of-the-art baselines, yielding 3-4% gains in NER and up to 8% in RE performance, based on precision, recall, and F1-score. Additional validation on DNRTI and STUCCO benchmarks confirms the framework's robustness and practical applicability. All datasets, including the curated DNRTI-AUG-STIX2, are released on GitHub to foster reproducibility and further research.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces the Context-aware Threat Intelligence Knowledge Graph (CTiKG) framework, a pipeline architecture for extracting entity-relation triples from unstructured cyber threat intelligence (CTI) reports. It employs hybrid NLP models that combine SecureBERT+ contextual embeddings with expert knowledge from a domain ontology to reduce misclassifications and mitigate error propagation. Experiments on the newly curated DNRTI-AUG-STIX2 dataset (21 entity types aligned with STIX 2.1) report 3-4% gains in named entity recognition (NER) and up to 8% in relation extraction (RE) over state-of-the-art baselines, measured by precision, recall, and F1-score; additional results are shown on the DNRTI and STUCCO benchmarks. All datasets are released on GitHub.
Significance. If the performance gains prove robust under proper statistical controls, the work could meaningfully advance automated construction of cybersecurity knowledge graphs by addressing a practical bottleneck in CTI processing. The public release of the DNRTI-AUG-STIX2 dataset and the focus on a domain-specific embedding (SecureBERT+) constitute clear strengths for reproducibility and applicability. The central empirical claim, however, rests on small absolute improvements whose reliability cannot be assessed without variance estimates or significance testing.
major comments (1)
- [Abstract] Abstract and Experiments section: The claim of 'significant improvements' (3-4% NER, up to 8% RE) is load-bearing for the paper's contribution yet is presented without statistical significance tests, standard deviations across multiple random seeds, error bars, or details on baseline re-implementations and train-test split stability. On an augmented dataset, these omissions leave open the possibility that observed deltas fall within run-to-run noise.
minor comments (2)
- The description of the domain ontology integration and how expert knowledge is injected into the hybrid model could be expanded with a concrete example or pseudocode to clarify the pipeline.
- Table or figure captions for the benchmark results should explicitly state the number of runs and any hyperparameter search protocol used for the reported F1 scores.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on the need for statistical rigor in validating the reported performance gains. We address the major comment below and will strengthen the manuscript accordingly.
read point-by-point responses
-
Referee: [Abstract] Abstract and Experiments section: The claim of 'significant improvements' (3-4% NER, up to 8% RE) is load-bearing for the paper's contribution yet is presented without statistical significance tests, standard deviations across multiple random seeds, error bars, or details on baseline re-implementations and train-test split stability. On an augmented dataset, these omissions leave open the possibility that observed deltas fall within run-to-run noise.
Authors: We agree that the absence of variance estimates and significance testing leaves the robustness of the small absolute gains open to question. In the revised manuscript we will add: results from at least five independent runs with different random seeds, reporting means and standard deviations for all metrics; error bars on all bar charts; paired statistical significance tests (e.g., t-test or Wilcoxon signed-rank) with p-values comparing CTiKG to each baseline; and explicit documentation of baseline re-implementations together with the exact train-test split procedure used for DNRTI-AUG-STIX2. These additions will confirm that the observed 3-4 % NER and up to 8 % RE improvements exceed run-to-run variability. revision: yes
Circularity Check
No significant circularity: empirical application study without derivation chain
full rationale
The paper describes a pipeline framework (CTiKG) for NER and RE on CTI reports, using SecureBERT+ embeddings plus a domain ontology. No equations, first-principles derivations, or predictions are presented that reduce by construction to fitted parameters or input data. Reported 3-4% NER and 8% RE gains are empirical results on DNRTI-AUG-STIX2 and other benchmarks; they are not tautological with any model definition or self-citation. The work is self-contained as an applied ML study with no load-bearing self-citations, uniqueness theorems, or ansatz smuggling. This is the expected non-finding for an empirical application paper.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Hybrid models combining contextual embeddings and domain ontologies reduce misclassifications in entity-relation extraction pipelines.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
hybrid NLP models that leverage SecureBERT+ contextual embeddings and expert knowledge from a domain ontology
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
SecureBERT+-BiGRU-CRF model ... CRF layer to enforce valid tag transitions
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
MITRE: MITRE Response to Cyber Attack in One of Its R&D Netw orks, https://www.mitre.org/news-insights/news-release/mitre-response-cyber-attack- one-its-rd-networks (accessed June 02, 2024)
work page 2024
-
[2]
Crumpton, L., Clancy, C.: Advanced Cyber Threats Impact E ven the Most Pre- pared, https://medium.com/mitre-engenuity/advanced-c yber-threats-impact-even- the-most-prepared-56444e980dc8 (accessed June 02, 2024)
work page 2024
-
[3]
Pingle, A., Piplai, A., Mittal, S., Joshi, A., Holt, J., Za k, R.: RelExt: relation ex- traction using deep learning approaches for cybersecurity knowledge graph improve- ment. ASONAM ’19: Proceedings of the 2019 IEEE/ACM Internat ional Conference on Advances in Social Networks Analysis and Mining, vol. 2, p p. 879-886, ACM, Vancouver, British Columbia, C...
work page 2019
-
[4]
Sarhan, I., Spruit, M.: Open-CyKG: An Open Cyber Threat In telli- gence Knowledge Graph. Knowledge-Based Systems, vol. 233, (2021), https://doi.org/10.1016/j.knosys.2021.107524
-
[5]
2022 the 7th Inter national Conference on Big Data Analytics (ICBDA), pp
Zuo, J., Gao, Y., Li, X., Yuan, J.: An End-to-end Entity and Relation Joint Extrac- tion Model for Cyber Threat Intelligence. 2022 the 7th Inter national Conference on Big Data Analytics (ICBDA), pp. 204-209. IEEE, Guangzhou , China (2022), https://doi.org/10.1109/ICBDA55095.2022.9760342 CTiKG Framework 15
-
[6]
Zhong, Z., Chen, D.: A Frustratingly Easy Approach for Ent ity and Relation Ex- traction, In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human La nguage Technologies, ACL, pp. 50-61, 2021
work page 2021
-
[7]
Yan, Z., Jia, Z., Tu, K.: An Empirical Study of Pipeline vs. Joint approaches to Entity and Relation Extraction, In Proceedings of the 2nd Co nference of the Asia- Pacific Chapter of the Association for Computational Lingui stics and the 12th In- ternational Joint Conference on Natural Language Processi ng, ACL, pp. 437-443, 2022
work page 2022
-
[8]
Ahmed, K., Khurshid, S., K., Hina, S.: CyberEntRel: Joint extraction of cyber entities and relations using deep learning. Computers & Sec urity, vol. 136, (2024), https://doi.org/10.1016/j.cose.2023.103579
-
[9]
Information and Communications Security: 23rd Inte rnational Con- ference, ICICS 2021, pp
Guo, Y., Liu, Z., Huang, C., Liu, J., Jing, W., Wang, Z., Wan g, Y.: CyberRel: Joint Entity and Relation Extraction for Cyberse curity Con- cepts. Information and Communications Security: 23rd Inte rnational Con- ference, ICICS 2021, pp. 447—463, Springer, Chongqing, Chi na (2021), https://doi.org/10.1007/978-3-030-86890-1_25
-
[10]
Mouiche, I., Saad, S.: Entity and relation extractions f or threat in- telligence knowledge graphs. Computers & Security, vol. 14 8, (2025), https://doi.org/10.1016/j.cose.2024.104120
-
[11]
Hugging Face, https://huggingface.co/ehsanaghaei/SecureBERT_Plu, last ac- cessed 2025/02/25
work page 2025
-
[12]
Security and Privacy in C ommunication Net- works, vol
Aghaei, E., Niu, X., Shadid, W., Al-Shaer, E.: SecureBER T: A Domain-Specific Language Model for Cybersecurity. Security and Privacy in C ommunication Net- works, vol. 462, (2023) https://doi.org/10.1109/TrustCom50675.2020.00083
-
[13]
In Proceeding s of NAACL-HLT, 2016
Lample, G., Ballesteros, M., Subramanian, S., Kawakami , K., Dyer, C.: Neural Architectures for Named Entity Recognition. In Proceeding s of NAACL-HLT, 2016
work page 2016
-
[14]
Security and Privacy in C ommunication Net- works, vol
Wang, X. et al.: DNRTI: A Large-Scale Dataset for Named En tity Recognition in Threat Intelligence. 2020 IEEE 19th Interna tional Con- ference on Trust, Security and Privacy in Computing and Comm uni- cations (TrustCom), pp. 1842-1848. IEEE, Guangzhou, China , (2020), https://doi.org/10.1109/TrustCom50675.2020.00252
-
[15]
Bridges, R. A., Jones, C. L., Iannacone, M. D., Goodall, J , R.: Automatic Labeling for Entity Extraction in Cyber Security. The Third ASE Inter national Conference on Cyber Security 2014, (2014)
work page 2014
-
[16]
Wang, X., Liu, Z., Liu, J.: Information extraction of cyb ersecurity concepts: an lstm approach. Computers & Security, vol. 144, (2024)
work page 2024
-
[17]
Guo, Z., Liu, Z., Huang, C., Wang, N., Min, H., Guo, W., Liu , J.: A framework for threat intelligence extraction and fusion. Computers & Security, vol. 132, 2024, https://doi.org/10.1016/j.cose.2023.103371
-
[18]
Bilayer-induced asymmetric quantum Hall effect in epitaxial graphene
Liu, Y., Han, X., Zuo, W., Lv, H., Guo, J.: CTI-JE: A Joint E xtrac- tion Framework of Entities and Relations in Unstructured Cy ber Threat In- telligence. 27th International Conference on Computer Sup ported Coopera- tive Work in Design (CSCWD), pp. 2728-2733. IEEE, Tianjin, C hina (2024), https://doi.org/10.1109/CSCWD61410.2024.10580210
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1109/cscwd61410.2024.10580210 2024
-
[19]
Interpretability in mapping weeds and crops from drone images
Lv, H., Han, X., Cui, H., Wang, P., Zuo, W., Zhou, Z.: Joint Extrac- tion of Entities and Relationships from Cyber Threat Intell igence based on Task-specific Fourier Network. 2024 International Joint Conference on Neural Networks (IJCNN), pp. 1-8, IEEE, Yokohama, Japan, 2024, https://doi.org/10.1109/IJCNN60899.2024.10650942 16 I. Mouiche and S. Saad
-
[20]
in IEEE Interne t of Things Journal, pp
Zhu, F., Cheng, Z., Li, P., Xu, H.: ITIRel: Joint Entity an d Relation Extraction for Internet of Things Threat Intelligence. in IEEE Interne t of Things Journal, pp. 20867-20878, 2024, https://doi.org/10.1109/JIOT.2024.3373799
-
[21]
TechRxiv , 2024, https://doi.org/10.36227/techrxiv.174286575.55673704/v1
Mouiche, I., Saad, S.: TIJERE: A Novel Threat Intelligen ce Joint Ex- traction Model based on Analyst Expert Knowledge. TechRxiv , 2024, https://doi.org/10.36227/techrxiv.174286575.55673704/v1
-
[22]
Gasmi, H., Laval, J., Bouras, A.: Information extractio n of cybersecurity concepts: an lstm approach. Applied Sciences, vol. 9, (2019)
work page 2019
-
[23]
Zhao, J., Yan, Q., Liu, X., Li, B., Zuo, G.: Cyber threat in telligence modeling based on heterogeneous graph convolutional network. In: In Proceedings of the 23rd international symposium on research in attacks, intrusion s and defenses (RAID 2020), pp. 241–256, USENIX, San Sebastian (2020)
work page 2020
-
[24]
Jo, H., Lee, Y., Shin, S.:Vulcan: Automatic extraction a nd analysis of cyber threat intelligence from unstructured text. Computers & Security , vol. 120, (2022)
work page 2022
-
[25]
Marchiori, F., Conti, M., Verde, N., V.: STIXnet: A Novel and Modular Solution for Extracting All STIX Objects in CTI Reports. ARES ’23: Pro ceedings of the 18th International Conference on A vailability, Reliabili ty, and Security, 2023, doi: 10.1145/3600160.3600182
-
[26]
, Extracting Information about Security Vulnerabilities from Web Text
Mulwad, V., Li, W., Joshi, A., Finin, T., Viswanathan, K. , Extracting Information about Security Vulnerabilities from Web Text. 2011 IEEE/WI C/ACM International Conferences on Web Intelligence and Intelligent Agent Tech nology, Lyon, France, pp. 257-260, 2011, doi: 10.1109/WI-IAT.2011.26
-
[27]
Securit y and Communication Networks, (2022), https://doi.org/10.1155/2022/8477260
Li, Y., Guo, Y., Fang, C., Liu, Y., Chen, Q.: A Novel Threat Intelligence Informa- tion Extraction System Combining Multiple Models. Securit y and Communication Networks, (2022), https://doi.org/10.1155/2022/8477260
-
[28]
Piplai, A., Mittal, S., Joshi, A., Finin, T., Holt, J., Za k, R.: Creating Cybersecurity Knowledge Graphs From Malware After Action Reports. in IEEE Access, vol. 8, pp. 211691-211703, (2020)
work page 2020
-
[29]
ICML ’01: Proceedings of the Eighteenth International Conference on Machine Learning, pp
Lafferty, JJ., McCallum, A., Pereira, F.: Conditional Ra ndom Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. ICML ’01: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 282-289, 2001
work page 2001
-
[30]
In Proceedings of the 21st In ternational Conference on Security and Cryptography, vol
Mouiche, I., Saad, S.: TI-NERmerger: Semi-automated Fr amework for Integrating NER Datasets in Cybersecurity. In Proceedings of the 21st In ternational Conference on Security and Cryptography, vol. 1, pp. 357–370, SciTePre ss, Dijon, France (2024)
work page 2024
-
[31]
OASIS OPEN, https://docs.oasis-open.org/cti/stix/v 2.1/cs02/stix-v2.1- cs02.html, last accessed 2025/02/10
work page 2025
-
[32]
et al.: Scikit-learn: Mach ine learning in Python
Pedregosa, F., Varoquaux, G. et al.: Scikit-learn: Mach ine learning in Python. The Journal of Machine Learning Research, vol. 12, PP. 2825–283 0, 2011
work page 2011
-
[33]
Proceedings of the AAAI Workshop on Artific ial Intelligence for Cyber Security, pp
Syed, Z., Padia, A., Finin, T., Mathews, L., Joshi, A.: UC O: A Unified Cyberse- curity Ontology. Proceedings of the AAAI Workshop on Artific ial Intelligence for Cyber Security, pp. 195–202, AAAI Press, (2016)
work page 2016
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.