HunterAgent: Neuro-Symbolic Attack Trace Reconstruction under Anti-Forensics

Bailing Wang; Guangze Zhao; Hongri Liu; Weilin Gai; Yongzheng Zhang; Yuliang Wei

arxiv: 2605.29269 · v1 · pith:HCCIJCICnew · submitted 2026-05-28 · 💻 cs.CR

HunterAgent: Neuro-Symbolic Attack Trace Reconstruction under Anti-Forensics

Guangze Zhao , Yongzheng Zhang , Weilin Gai , Hongri Liu , Yuliang Wei , Bailing Wang This is my paper

Pith reviewed 2026-06-29 07:02 UTC · model grok-4.3

classification 💻 cs.CR

keywords attack trace reconstructionneuro-symbolic AIanti-forensicsprovenance graphsLLM agentsincident responsecyber securitylog wiping

0 comments

The pith

HunterAgent reconstructs severed attack traces by having an LLM propose links that a symbolic verifier then grounds in surviving telemetry.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces HunterAgent as a neuro-symbolic system that treats attack trace reconstruction as a cost-bounded graph search problem under partial log observability. An LLM generates semantic hypotheses inside a typed ontology while a verifier enforces identifier collisions and prunes schema violations. This combination is meant to avoid the fluent but physically impossible narratives that pure LLM agents produce when logs have been wiped or spoofed. The approach includes a length-discounted epistemic budget that forces the search to halt rather than drift. Results on public and in-house datasets show substantially higher F1 scores and far lower hallucination rates than prior agentic or graph-based baselines.

Core claim

HunterAgent reframes trace reconstruction as cost-bounded heuristic graph search under partial observability. It uses an asymmetric Generator-Verifier pipeline in which the LLM proposes semantic hypotheses within a typed ontology and the verifier grounds each hypothesis through identifier-level collisions on surviving orthogonal telemetry. Hops are scored by a calibrated cost that mixes semantic divergence with OS temporal potential; schema violations are hard-pruned. A length-discounted epistemic budget prevents inferential drift and forces graceful halting. Under strict LOFO cross-validation the system reaches 86.1 percent mean F1, reduces path-level hallucination from 61.5 percent to 6.4

What carries the argument

Asymmetric Generator-Verifier pipeline that scores candidate hops with a cost combining semantic divergence and OS temporal potential while hard-pruning schema violations.

If this is right

Precision stays at or above 84 percent even when 70 percent of logs are wiped.
Path-level hallucination falls from 61.5 percent to 6.4 percent relative to unconstrained LLM agents.
The system halts safely without fabricating links in 95.7 percent of heavy-wiping cases.
Mean F1 reaches 86.1 percent across three public benchmarks plus an in-house 40-trace set, exceeding the strongest prior agentic baseline by 26.7 F1 points.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same generator-verifier split could be applied to reconstructing causal sequences in other domains that produce partially erased event logs, such as distributed system debugging.
Adding more types of orthogonal telemetry as verifiers would likely raise recall without lowering the precision floor.
The calibrated hop-cost function could be replaced by a learned model trained on verified traces, provided the verifier layer remains to enforce OS constraints.

Load-bearing premise

At least one orthogonal telemetry source survives the anti-forensic activity.

What would settle it

Run the system on a dataset in which every orthogonal telemetry source (process IDs, file hashes, network flows) has been fully erased; measure whether precision remains above 84 percent or the system halts without producing a path.

Figures

Figures reproduced from arXiv: 2605.29269 by Bailing Wang, Guangze Zhao, Hongri Liu, Weilin Gai, Yongzheng Zhang, Yuliang Wei.

**Figure 2.** Figure 2: Overall architecture of HunterAgent. The framework operates as an Asymmetric Generator–Verifier pipeline: fragmented observations (Step 1) are [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Ablation of the semantic Generator on D4. Replacing the LLM by a [PITH_FULL_IMAGE:figures/full_fig_p015_3.png] view at source ↗

**Figure 4.** Figure 4: Effect of varying the epistemic budget Bmax on D4. Unbounded (∞) collapses into an unconstrained agent; overly strict (90-th percentile) halts prematurely; the 99-th-percentile calibration is the empirical sweet spot [PITH_FULL_IMAGE:figures/full_fig_p016_4.png] view at source ↗

**Figure 6.** Figure 6: Reconstructing a severed Cobalt Strike trace. Deterministic graphs [PITH_FULL_IMAGE:figures/full_fig_p016_6.png] view at source ↗

**Figure 5.** Figure 5: Inflation due to evaluation leakage on D4. Random K-Fold artificially [PITH_FULL_IMAGE:figures/full_fig_p016_5.png] view at source ↗

**Figure 7.** Figure 7: F1 vs. causal path length L. Deterministic baselines exhibit a steep “precision–length cliff”; ReAct degrades due to autoregressive drift; HunterAgent’s per-hop budget caps drift accumulation regardless of L. TABLE XX F1 (±STD OVER 5 SEEDS) BY CAUSAL PATH LENGTH L, WEIGHTED ACROSS D1†–D4. Path length MAGIC KAIROS ReAct HunterAgent L = 1–2 62.4 ±1.3 78.5 ±1.1 71.5 ±1.4 93.2 ±0.5 L = 3–5 38.7 ±1.5 71.0 ±1.2 … view at source ↗

read the original abstract

Modern alert-triage systems reduce SOC burden by filtering false positives, but flagging a high-risk alert is only the start of incident response. Threat hunting requires reconstructing causal attack chains across heterogeneous, partially corrupted logs. Against APTs using anti-forensics (parent-PID spoofing, log wiping, fileless execution), provenance graphs split into disjoint subgraphs and fail. Unconstrained LLM agents fabricate causal links violating OS physics, producing fluent but forensically inadmissible narratives. We propose HunterAgent, a neuro-symbolic framework that reframes trace reconstruction as cost-bounded heuristic graph search under partial observability. It uses an asymmetric Generator-Verifier pipeline: the LLM proposes semantic hypotheses within a typed ontology, while a verifier grounds each via identifier-level collisions on surviving orthogonal telemetry. To resolve severed traces, we score hops using a calibrated cost combining semantic divergence and OS temporal potential; schema violations are hard-pruned. A length-discounted epistemic budget prevents inferential drift and forces graceful halting. Under strict LOFO cross-validation on three public benchmarks and an in-house 40-trace dataset, HunterAgent achieves 86.1% mean F1, outperforming the top agentic baseline by 26.7 F1 and KAIROS by 17.1 F1, while cutting path-level hallucination from 61.5% to 6.4%. Under 70% log wiping, recall drops but precision stays >=84%, with 95.7% halting safely. All results hold under the realistic assumption that at least one orthogonal telemetry source survives.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

HunterAgent gives a concrete generator-verifier pipeline for repairing severed provenance graphs when at least one orthogonal telemetry source survives, but the reported gains rest on that untested boundary condition.

read the letter

The paper's central move is to treat trace reconstruction as cost-bounded search: an LLM proposes typed hypotheses about missing links, a verifier checks them via identifier collisions against whatever orthogonal logs remain, and a length-discounted budget plus hard schema pruning keeps the search from drifting. That combination is presented as new.

It does a few things cleanly. The asymmetric pipeline separates fluent proposal from grounding, the cost function mixes semantic divergence with OS temporal potential, and the numbers on three public benchmarks plus their 40-trace set show clear lifts over the top agent baseline and KAIROS. Precision holds above 84% even at 70% wiping, with most runs halting safely.

The load-bearing assumption is stated plainly in the abstract: everything is conditioned on at least one surviving orthogonal source. No results appear for the zero-survival case, where the verifier has nothing to collide against and the system reverts to the unconstrained LLM the authors criticize for fabricating OS-violating links. That regime is realistic for sophisticated anti-forensics and is not explored.

The abstract also gives no protocol details on how the cost function was calibrated, whether the length-discount factor was fixed before seeing the test sets, or what the error bars look like under LOFO. Those omissions make it hard to judge robustness.

This is for forensics and threat-hunting researchers who already work with mixed LLM and symbolic tools and want a practical way to handle partial log corruption. It is worth sending to peer review because the framing is explicit, the benchmarks are public, and the performance delta is large under the stated conditions, even if the zero-survival boundary and experimental controls need more attention.

Referee Report

2 major / 2 minor

Summary. The paper presents HunterAgent, a neuro-symbolic framework for reconstructing causal attack traces from heterogeneous, partially corrupted logs under anti-forensics (e.g., parent-PID spoofing, log wiping). It uses an asymmetric Generator-Verifier pipeline in which an LLM proposes semantic hypotheses within a typed ontology while a verifier grounds links via identifier-level collisions on surviving orthogonal telemetry; hops are scored by a calibrated cost combining semantic divergence and OS temporal potential, with hard pruning of schema violations and a length-discounted epistemic budget to enforce graceful halting. Under LOFO cross-validation on three public benchmarks plus an in-house 40-trace dataset, it reports 86.1% mean F1 (outperforming the top agentic baseline by 26.7 F1 and KAIROS by 17.1 F1), reduces path-level hallucination from 61.5% to 6.4%, and maintains precision >=84% with 95.7% safe halting under 70% log wiping—all conditioned on the assumption that at least one orthogonal telemetry source survives.

Significance. If the reported improvements hold, the work supplies a concrete, cost-bounded neuro-symbolic method that demonstrably reduces hallucination relative to unconstrained LLM agents while preserving forensic admissibility constraints. The explicit LOFO cross-validation on public benchmarks, the quantitative hallucination metric, and the graceful-degradation results under partial wiping constitute measurable strengths that could inform practical SOC tooling.

major comments (2)

[Abstract] Abstract (final sentence) and the description of the verifier: the central performance claims (86.1% F1, 6.4% hallucination, 95.7% safe halting) and the neuro-symbolic guarantee are explicitly conditioned on 'at least one orthogonal telemetry source survives.' No evaluation or ablation is supplied for the zero-survival regime; in that boundary case the verifier cannot operate via identifier-level collisions and the system reduces to the unconstrained LLM agent the introduction criticizes for fabricating OS-violating links. This assumption is load-bearing for the claimed robustness.
[Abstract / Methods (cost function definition)] The cost function (semantic divergence + OS temporal potential) is described as 'calibrated' with a free length-discount factor for the epistemic budget, yet no pre-specified parameter values, external grounding procedure, or sensitivity analysis is provided; this leaves open whether the reported F1 gains are robust or post-hoc tuned on the evaluation sets.

minor comments (2)

[Abstract] The abstract supplies no experimental protocol details, error bars, or verification that the cost function and epistemic budget were not tuned post-hoc; these should be added to the evaluation section for reproducibility.
[Verifier description] Notation for the typed ontology and the precise definition of 'identifier-level collisions' should be formalized with an equation or pseudocode in the verifier subsection.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments. We address each major point below and will make the indicated revisions to improve clarity and completeness.

read point-by-point responses

Referee: [Abstract] Abstract (final sentence) and the description of the verifier: the central performance claims (86.1% F1, 6.4% hallucination, 95.7% safe halting) and the neuro-symbolic guarantee are explicitly conditioned on 'at least one orthogonal telemetry source survives.' No evaluation or ablation is supplied for the zero-survival regime; in that boundary case the verifier cannot operate via identifier-level collisions and the system reduces to the unconstrained LLM agent the introduction criticizes for fabricating OS-violating links. This assumption is load-bearing for the claimed robustness.

Authors: We acknowledge that all reported results and the neuro-symbolic guarantee are conditioned on survival of at least one orthogonal telemetry source, as stated in the abstract. This reflects the verifier's reliance on identifier-level collisions, which cannot occur with zero survival; in that regime the system would reduce to the LLM generator alone. We agree the boundary case should be addressed explicitly. We will revise the abstract's final sentence for greater precision and add a limitations paragraph clarifying that complete log wiping (zero survival) falls outside the evaluated scope, as forensic reconstruction under that condition is not feasible within the current verifier design. revision: yes
Referee: [Abstract / Methods (cost function definition)] The cost function (semantic divergence + OS temporal potential) is described as 'calibrated' with a free length-discount factor for the epistemic budget, yet no pre-specified parameter values, external grounding procedure, or sensitivity analysis is provided; this leaves open whether the reported F1 gains are robust or post-hoc tuned on the evaluation sets.

Authors: The cost parameters were pre-specified via calibration on a held-out development set disjoint from all evaluation benchmarks. We will expand the Methods section to report the exact parameter values, the calibration procedure, and a sensitivity analysis across plausible ranges of the length-discount factor, confirming that the reported F1 improvements remain stable. revision: yes

Circularity Check

0 steps flagged

No circularity; results benchmarked on external public datasets with explicit assumption stated

full rationale

The provided abstract and text report empirical performance (86.1% F1, etc.) under LOFO cross-validation on three public benchmarks plus an in-house dataset. The key assumption ('at least one orthogonal telemetry source survives') is stated explicitly as a boundary condition rather than derived. No equations, self-citations, fitted parameters renamed as predictions, or ansatzes are quoted that would reduce any claimed derivation to its own inputs by construction. The cost function is referenced but not shown in a form that exhibits self-definitional or fitted-input circularity. This is the normal case of a paper whose central claims rest on external benchmarks rather than internal redefinition.

Axiom & Free-Parameter Ledger

1 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields minimal ledger entries; the cost function and epistemic budget are described at high level without explicit parameter values or derivation.

free parameters (1)

length-discount factor for epistemic budget
Referenced as preventing inferential drift but no value or fitting procedure given in abstract.

pith-pipeline@v0.9.1-grok · 5828 in / 1131 out tokens · 32937 ms · 2026-06-29T07:02:56.297476+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

47 extracted references · 19 canonical work pages · 5 internal anchors

[1]

Nodoze: Combatting threat alert fatigue with automated provenance triage,

W. U. Hassan, S. Guo, D. Li, Z. Chen, K. Jee, Z. Li, and A. Bates, “Nodoze: Combatting threat alert fatigue with automated provenance triage,” innetwork and distributed systems security symposium, 2019

2019
[2]

Holmes: real-time apt detection through correlation of suspicious information flows,

S. M. Milajerdi, R. Gjomemo, B. Eshete, R. Sekar, and V . Venkatakrish- nan, “Holmes: real-time apt detection through correlation of suspicious information flows,” in2019 IEEE symposium on security and privacy (SP). IEEE, 2019, pp. 1137–1152

2019
[3]

Selfprompt: Au- tonomously evaluating llm robustness via domain-constrained knowl- edge guidelines and refined adversarial prompts,

A. Pei, Z. Yang, S. Zhu, R. Cheng, and J. Jia, “Selfprompt: Au- tonomously evaluating llm robustness via domain-constrained knowl- edge guidelines and refined adversarial prompts,” inProceedings of the 31st International Conference on Computational Linguistics, 2025, pp. 6840–6854

2025
[4]

Prov2vec: Learning provenance graph representation for unsupervised apt detection,

B. Bhattarai and H. H. Huang, “Prov2vec: Learning provenance graph representation for unsupervised apt detection,”arXiv preprint arXiv:2310.00843, 2023

work page arXiv 2023
[5]

You are what you do: Hunting stealthy malware via data provenance analysis

Q. Wang, W. U. Hassan, D. Li, K. Jee, X. Yu, K. Zou, J. Rhee, Z. Chen, W. Cheng, C. A. Gunteret al., “You are what you do: Hunting stealthy malware via data provenance analysis.” inNDSS, 2020

2020
[6]

{SLEUTH}: Real- time attack scenario reconstruction from{COTS}audit data,

M. N. Hossain, S. M. Milajerdi, J. Wang, B. Eshete, R. Gjomemo, R. Sekar, S. Stoller, and V . Venkatakrishnan, “{SLEUTH}: Real- time attack scenario reconstruction from{COTS}audit data,” in26th USENIX Security Symposium (USENIX Security 17), 2017, pp. 487–504

2017
[7]

Tactical provenance analysis for endpoint detection and response systems,

W. U. Hassan, A. Bates, and D. Marino, “Tactical provenance analysis for endpoint detection and response systems,” in2020 IEEE symposium on security and privacy (SP). IEEE, 2020, pp. 1172–1189

2020
[8]

Sometimes, you aren’t what you do: Mimicry attacks against provenance graph host intrusion detection systems,

A. Goyal, X. Han, G. Wang, and A. Bates, “Sometimes, you aren’t what you do: Mimicry attacks against provenance graph host intrusion detection systems,” in30th Network and Distributed System Security Symposium, 2023

2023
[9]

Graph-based anomaly apt attack detection via threat intelli- gence,

C.-I. Fan, C.-H. Shie, Y .-C. Chang, T. Ban, T. Morikawa, and T. Taka- hashi, “Graph-based anomaly apt attack detection via threat intelli- gence,”IEEE Transactions on Emerging Topics in Computing, 2026

2026
[10]

React: Synergizing reasoning and acting in language models,

S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. R. Narasimhan, and Y . Cao, “React: Synergizing reasoning and acting in language models,” inThe eleventh international conference on learning representations, 2022

2022
[11]

Application of large language models in cybersecurity: A systematic literature review,

I. Hasanov, S. Virtanen, A. Hakkala, and J. Isoaho, “Application of large language models in cybersecurity: A systematic literature review,”IEEE access, vol. 12, pp. 176 751–176 778, 2024

2024
[12]

Beyond detection: large language models and next-generation cybersecurity,

A. Ali and M. C. Ghanem, “Beyond detection: large language models and next-generation cybersecurity,”SHIFRA, vol. 2025, pp. 81–97, 2025

2025
[13]

Heuristic-induced multimodal risk distribution jailbreak attack for multimodal large language models,

T. Ma, X. Jia, R. Duan, X. Li, Y . Huang, X. Jia, Z. Chu, and W. Ren, “Heuristic-induced multimodal risk distribution jailbreak attack for multimodal large language models,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2025, pp. 2686–2696

2025
[14]

Omnisafebench-mm: A unified benchmark and toolbox for multimodal jailbreak attack-defense evaluation,

X. Jia, J. Liao, Q. Guo, T. Ma, S. Qin, R. Duan, T. Li, Y . Huang, Z. Zeng, D. Wuet al., “Omnisafebench-mm: A unified benchmark and toolbox for multimodal jailbreak attack-defense evaluation,”arXiv preprint arXiv:2512.06589, 2025

work page arXiv 2025
[15]

Reinforcement learning from multi-role debates as feedback for bias mitigation in llms,

R. Cheng, H. Ma, S. Cao, J. Li, A. Pei, Z. Wang, P. Ji, H. Wang, and J. Huo, “Reinforcement learning from multi-role debates as feedback for bias mitigation in llms,”arXiv preprint arXiv:2404.10160, 2024

work page arXiv 2024
[16]

Agr: Age group fairness reward for bias mitigation in llms,

S. Cao, R. Cheng, and Z. Wang, “Agr: Age group fairness reward for bias mitigation in llms,” inICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025, pp. 1–5

2025
[17]

Oyster-i: Beyond refusal–constructive safety alignment for responsible language models,

R. Duan, J. Liu, X. Jia, S. Zhao, R. Cheng, F. Wang, C. Wei, Y . Xie, C. Liu, D. Liet al., “Oyster-i: Beyond refusal–constructive safety alignment for responsible language models,”arXiv preprint arXiv:2509.01909, 2025

work page arXiv 2025
[18]

Cheng, H

R. Cheng, H. Ma, W. Wang, R. Duan, J. Liu, X. Jia, S. Qin, X. Cao, Y . Liu, and X. Jia, “Inverse reinforcement learning with dynamic reward scaling for llm alignment,”arXiv preprint arXiv:2503.18991, 2025

work page arXiv 2025
[19]

Steering the Verifiability of Multimodal AI Hallucinations

J. Pang, R. Cheng, Z. Ye, X. Ma, Z. Wu, X. Huang, and Y .-G. Jiang, “Steering the verifiability of multimodal ai hallucinations,”arXiv preprint arXiv:2604.06714, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026
[20]

From Local to Global: A Graph RAG Approach to Query-Focused Summarization

D. Edge, H. Trinh, N. Cheng, J. Bradley, A. Chao, A. Mody, S. Truitt, D. Metropolitansky, R. O. Ness, and J. Larson, “From local to global: A graph rag approach to query-focused summarization,”arXiv preprint arXiv:2404.16130, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[21]

Neuro-Symbolic AI for Cybersecurity: State of the Art, Challenges, and Opportunities

S. B. Hakim, M. Adil, A. Velasquez, S. Xu, and H. H. Song, “Neuro- symbolic ai for cybersecurity: State of the art, challenges, and opportu- nities,”arXiv preprint arXiv:2509.06921, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[22]

Experimenting with neurosymbolic ai for defending against cyber attacks,

M. W. Eckhoff, J. Halvorsen, B. J. Hansen, M. Eian, V . Mavroeidis, R. A. Chetwyn, G. Skjøtskift, and G. Grov, “Experimenting with neurosymbolic ai for defending against cyber attacks,”Neurosymbolic Artificial Intelligence Journal, 2025

2025
[23]

Cheng, H

R. Cheng, H. Ma, T. Ma, and H. Zhang, “Ecoalign: An economi- cally rational framework for efficient lvlm alignment,”arXiv preprint arXiv:2511.11301, 2025

work page arXiv 2025
[24]

Omegalog: High-fidelity attack investigation via transparent multi-layer log analy- sis,

W. U. Hassan, M. A. Noureddine, P. Datta, and A. Bates, “Omegalog: High-fidelity attack investigation via transparent multi-layer log analy- sis,” inNetwork and distributed system security symposium, 2020

2020
[25]

Kairos: Practical intrusion detection and investigation using whole- system provenance,

Z. Cheng, Q. Lv, J. Liang, Y . Wang, D. Sun, T. Pasquier, and X. Han, “Kairos: Practical intrusion detection and investigation using whole- system provenance,” in2024 IEEE Symposium on Security and Privacy (SP). IEEE, 2024, pp. 3533–3551

2024
[26]

Nodlink: An online system for fine-grained apt attack detection and investigation,

S. Li, F. Dong, X. Xiao, H. Wang, F. Shao, J. Chen, Y . Guo, X. Chen, and D. Li, “Nodlink: An online system for fine-grained apt attack detection and investigation,”arXiv preprint arXiv:2311.02331, 2023

work page arXiv 2023
[27]

{ATLAS}: A sequence-based learning approach for attack investigation,

A. Alsaheel, Y . Nan, S. Ma, L. Yu, G. Walkup, Z. B. Celik, X. Zhang, and D. Xu, “{ATLAS}: A sequence-based learning approach for attack investigation,” in30th USENIX security symposium (USENIX security 21), 2021, pp. 3005–3022

2021
[28]

Unicorn: Runtime provenance-based detector for advanced persistent threats,

X. Han, T. Pasquier, A. Bates, J. Mickens, and M. Seltzer, “Unicorn: Runtime provenance-based detector for advanced persistent threats,” arXiv preprint arXiv:2001.01525, 2020

work page arXiv 2001
[29]

Alert prioritization techniques in security monitoring: A focus on severity averaging and alert entities,

C. Bassey, S. Idowu, and C. Ojo, “Alert prioritization techniques in security monitoring: A focus on severity averaging and alert entities,” Saudi J. Eng. Technol., vol. 9, no. 07, pp. 334–339, 2024

2024
[30]

Information- dense reasoning for efficient and auditable security alert triage,

G. Zhao, Y . Zhang, C. Tian, D. Xie, H. Liu, and B. Wang, “Information- dense reasoning for efficient and auditable security alert triage,”arXiv preprint arXiv:2512.08169, 2025

work page arXiv 2025
[31]

Pbi-attack: Prior-guided bimodal interactive black-box jailbreak attack for toxicity maximization,

R. Cheng, Y . Ding, S. Cao, R. Duan, X. Jia, S. Yuan, S. Qin, Z. Wang, and X. Jia, “Pbi-attack: Prior-guided bimodal interactive black-box jailbreak attack for toxicity maximization,” inProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025, pp. 609–628

2025
[32]

Tuni: A textual unimodal detector for identity inference in clip models,

S. Li, R. Cheng, and X. Jia, “Tuni: A textual unimodal detector for identity inference in clip models,” inProceedings of the Sixth Workshop on Privacy in Natural Language Processing, 2025, pp. 1–13

2025
[33]

Gibberish is all you need for membership inference detection in contrastive language-audio pretraining,

R. Cheng, Y . Ding, S. Cao, and Z. Wang, “Gibberish is all you need for membership inference detection in contrastive language-audio pretraining,” inProceedings of the 2025 International Conference on Multimedia Retrieval, 2025, pp. 108–116

2025
[34]

Strata-sword: A hierarchical safety evaluation towards llms based on reasoning complexity of jailbreak instructions,

S. Zhao, R. Duan, J. Liu, X. Jia, F. Wang, C. Wei, R. Cheng, Y . Xie, C. Liu, Q. Guoet al., “Strata-sword: A hierarchical safety evaluation towards llms based on reasoning complexity of jailbreak instructions,” arXiv preprint arXiv:2509.01444, 2025

work page arXiv 2025
[35]

Membership Inference for Contrastive Pre-training Models with Text-only PII Queries

R. Cheng, Y . Ding, J. Zhao, H. Zhang, H. Ma, T. Zhang, Y . Huang, and X. Li, “Membership inference for contrastive pre-training models with text-only pii queries,”arXiv preprint arXiv:2603.14222, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026
[36]

Pentestgpt: An llm-empowered automatic penetration testing tool.arXiv preprint arXiv:2308.06782, 2024

G. Deng, Y . Liu, V . Mayoral-Vilches, P. Liu, Y . Li, Y . Xu, T. Zhang, Y . Liu, M. Pinzger, and S. Rass, “Pentestgpt: An llm-empowered automatic penetration testing tool,”arXiv preprint arXiv:2308.06782, 2023

work page arXiv 2023
[37]

Secllm: Enhancing security smell detection in iac with large language models,

G. De Vito, F. Palomba, and F. Ferrucci, “Secllm: Enhancing security smell detection in iac with large language models,”IEEE Access, vol. 13, pp. 204 480–204 498, 2025

2025
[38]

Craken: Cybersecurity llm agent with knowledge-based execution,

M. Shao, H. Xi, N. Rani, M. Udeshi, V . S. C. Putrevu, K. Milner, B. Dolan-Gavitt, S. K. Shukla, P. Krishnamurthy, F. Khorramiet al., “Craken: Cybersecurity llm agent with knowledge-based execution,” arXiv preprint arXiv:2505.17107, 2025

work page arXiv 2025
[39]

Llm-driven provenance forensics for threat investigation and detection,

K. Mukherjee and M. Kantarcioglu, “Llm-driven provenance forensics for threat investigation and detection,”arXiv preprint arXiv:2508.21323, 2025

work page arXiv 2025
[40]

arXiv preprint arXiv:2509.18970 , year=

X. Lin, Y . Ning, J. Zhang, Y . Dong, Y . Liu, Y . Wu, X. Qi, N. Sun, Y . Shang, K. Wanget al., “Llm-based agents suffer from hallucina- tions: A survey of taxonomy, methods, and directions,”arXiv preprint arXiv:2509.18970, 2025

work page arXiv 2025
[41]

The emerged security and privacy of llm agent: A survey with case studies,

F. He, T. Zhu, D. Ye, B. Liu, W. Zhou, and P. S. Yu, “The emerged security and privacy of llm agent: A survey with case studies,”ACM Computing Surveys, vol. 58, no. 6, pp. 1–36, 2025

2025
[42]

Ecoalign: An economically rational framework for efficient lvlm alignment,

R. Cheng, H. Ma, T. Ma, and H. Zhang, “Ecoalign: An economically rational framework for efficient lvlm alignment,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

2026
[43]

Adversarial Orthogonal Disentanglement for LVLM Hallucination Mitigation

R. Cheng, H. Ma, Z. Hai, Y . Huang, R. Duan, T. Zhang, X. Yang, Z. Ye, and X. Ma, “Adversarial orthogonal disentanglement for lvlm hallucination mitigation,” 2026. [Online]. Available: https://arxiv.org/abs/2605.25377

work page internal anchor Pith review Pith/arXiv arXiv 2026
[44]

Mitre att&ck: Design and philosophy,

B. E. Strom, A. Applebaum, D. P. Miller, K. C. Nickels, A. G. Pennington, and C. B. Thomas, “Mitre att&ck: Design and philosophy,” 2018

2018
[45]

Analyzing the usefulness of the darpa transparent computing e5 dataset in apt detection research,

G. Ouyang, Y . Huang, and C. Zhang, “Analyzing the usefulness of the darpa transparent computing e5 dataset in apt detection research,” inInternational Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2022), vol. 12288. SPIE, 2022, pp. 400– 409

2022
[46]

Analyzing the usefulness of the darpa optc dataset in cyber threat detection research,

M. M. Anjum, S. Iqbal, and B. Hamelin, “Analyzing the usefulness of the darpa optc dataset in cyber threat detection research,” inProceedings of the 26th ACM symposium on access control models and technologies, 2021, pp. 27–32

2021
[47]

Inductive representation learning on large graphs,

W. L. Hamilton, R. Ying, and J. Leskovec, “Inductive representation learning on large graphs,” inAdvances in Neural Information Processing Systems (NeurIPS), 2017. APPENDIXA EVADEKITPERTURBATIONPSEUDOCODE EVADEKITtakes a labelled traceT= ( ˆG,L)from any of the public benchmarks (D1, D2, D3) and produces the perturbed traceT † = ( ˆG†,L)visible to the SUT,...

2017

[1] [1]

Nodoze: Combatting threat alert fatigue with automated provenance triage,

W. U. Hassan, S. Guo, D. Li, Z. Chen, K. Jee, Z. Li, and A. Bates, “Nodoze: Combatting threat alert fatigue with automated provenance triage,” innetwork and distributed systems security symposium, 2019

2019

[2] [2]

Holmes: real-time apt detection through correlation of suspicious information flows,

S. M. Milajerdi, R. Gjomemo, B. Eshete, R. Sekar, and V . Venkatakrish- nan, “Holmes: real-time apt detection through correlation of suspicious information flows,” in2019 IEEE symposium on security and privacy (SP). IEEE, 2019, pp. 1137–1152

2019

[3] [3]

Selfprompt: Au- tonomously evaluating llm robustness via domain-constrained knowl- edge guidelines and refined adversarial prompts,

A. Pei, Z. Yang, S. Zhu, R. Cheng, and J. Jia, “Selfprompt: Au- tonomously evaluating llm robustness via domain-constrained knowl- edge guidelines and refined adversarial prompts,” inProceedings of the 31st International Conference on Computational Linguistics, 2025, pp. 6840–6854

2025

[4] [4]

Prov2vec: Learning provenance graph representation for unsupervised apt detection,

B. Bhattarai and H. H. Huang, “Prov2vec: Learning provenance graph representation for unsupervised apt detection,”arXiv preprint arXiv:2310.00843, 2023

work page arXiv 2023

[5] [5]

You are what you do: Hunting stealthy malware via data provenance analysis

Q. Wang, W. U. Hassan, D. Li, K. Jee, X. Yu, K. Zou, J. Rhee, Z. Chen, W. Cheng, C. A. Gunteret al., “You are what you do: Hunting stealthy malware via data provenance analysis.” inNDSS, 2020

2020

[6] [6]

{SLEUTH}: Real- time attack scenario reconstruction from{COTS}audit data,

M. N. Hossain, S. M. Milajerdi, J. Wang, B. Eshete, R. Gjomemo, R. Sekar, S. Stoller, and V . Venkatakrishnan, “{SLEUTH}: Real- time attack scenario reconstruction from{COTS}audit data,” in26th USENIX Security Symposium (USENIX Security 17), 2017, pp. 487–504

2017

[7] [7]

Tactical provenance analysis for endpoint detection and response systems,

W. U. Hassan, A. Bates, and D. Marino, “Tactical provenance analysis for endpoint detection and response systems,” in2020 IEEE symposium on security and privacy (SP). IEEE, 2020, pp. 1172–1189

2020

[8] [8]

Sometimes, you aren’t what you do: Mimicry attacks against provenance graph host intrusion detection systems,

A. Goyal, X. Han, G. Wang, and A. Bates, “Sometimes, you aren’t what you do: Mimicry attacks against provenance graph host intrusion detection systems,” in30th Network and Distributed System Security Symposium, 2023

2023

[9] [9]

Graph-based anomaly apt attack detection via threat intelli- gence,

C.-I. Fan, C.-H. Shie, Y .-C. Chang, T. Ban, T. Morikawa, and T. Taka- hashi, “Graph-based anomaly apt attack detection via threat intelli- gence,”IEEE Transactions on Emerging Topics in Computing, 2026

2026

[10] [10]

React: Synergizing reasoning and acting in language models,

S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. R. Narasimhan, and Y . Cao, “React: Synergizing reasoning and acting in language models,” inThe eleventh international conference on learning representations, 2022

2022

[11] [11]

Application of large language models in cybersecurity: A systematic literature review,

I. Hasanov, S. Virtanen, A. Hakkala, and J. Isoaho, “Application of large language models in cybersecurity: A systematic literature review,”IEEE access, vol. 12, pp. 176 751–176 778, 2024

2024

[12] [12]

Beyond detection: large language models and next-generation cybersecurity,

A. Ali and M. C. Ghanem, “Beyond detection: large language models and next-generation cybersecurity,”SHIFRA, vol. 2025, pp. 81–97, 2025

2025

[13] [13]

Heuristic-induced multimodal risk distribution jailbreak attack for multimodal large language models,

T. Ma, X. Jia, R. Duan, X. Li, Y . Huang, X. Jia, Z. Chu, and W. Ren, “Heuristic-induced multimodal risk distribution jailbreak attack for multimodal large language models,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2025, pp. 2686–2696

2025

[14] [14]

Omnisafebench-mm: A unified benchmark and toolbox for multimodal jailbreak attack-defense evaluation,

X. Jia, J. Liao, Q. Guo, T. Ma, S. Qin, R. Duan, T. Li, Y . Huang, Z. Zeng, D. Wuet al., “Omnisafebench-mm: A unified benchmark and toolbox for multimodal jailbreak attack-defense evaluation,”arXiv preprint arXiv:2512.06589, 2025

work page arXiv 2025

[15] [15]

Reinforcement learning from multi-role debates as feedback for bias mitigation in llms,

R. Cheng, H. Ma, S. Cao, J. Li, A. Pei, Z. Wang, P. Ji, H. Wang, and J. Huo, “Reinforcement learning from multi-role debates as feedback for bias mitigation in llms,”arXiv preprint arXiv:2404.10160, 2024

work page arXiv 2024

[16] [16]

Agr: Age group fairness reward for bias mitigation in llms,

S. Cao, R. Cheng, and Z. Wang, “Agr: Age group fairness reward for bias mitigation in llms,” inICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2025, pp. 1–5

2025

[17] [17]

Oyster-i: Beyond refusal–constructive safety alignment for responsible language models,

R. Duan, J. Liu, X. Jia, S. Zhao, R. Cheng, F. Wang, C. Wei, Y . Xie, C. Liu, D. Liet al., “Oyster-i: Beyond refusal–constructive safety alignment for responsible language models,”arXiv preprint arXiv:2509.01909, 2025

work page arXiv 2025

[18] [18]

Cheng, H

R. Cheng, H. Ma, W. Wang, R. Duan, J. Liu, X. Jia, S. Qin, X. Cao, Y . Liu, and X. Jia, “Inverse reinforcement learning with dynamic reward scaling for llm alignment,”arXiv preprint arXiv:2503.18991, 2025

work page arXiv 2025

[19] [19]

Steering the Verifiability of Multimodal AI Hallucinations

J. Pang, R. Cheng, Z. Ye, X. Ma, Z. Wu, X. Huang, and Y .-G. Jiang, “Steering the verifiability of multimodal ai hallucinations,”arXiv preprint arXiv:2604.06714, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026

[20] [20]

From Local to Global: A Graph RAG Approach to Query-Focused Summarization

D. Edge, H. Trinh, N. Cheng, J. Bradley, A. Chao, A. Mody, S. Truitt, D. Metropolitansky, R. O. Ness, and J. Larson, “From local to global: A graph rag approach to query-focused summarization,”arXiv preprint arXiv:2404.16130, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[21] [21]

Neuro-Symbolic AI for Cybersecurity: State of the Art, Challenges, and Opportunities

S. B. Hakim, M. Adil, A. Velasquez, S. Xu, and H. H. Song, “Neuro- symbolic ai for cybersecurity: State of the art, challenges, and opportu- nities,”arXiv preprint arXiv:2509.06921, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[22] [22]

Experimenting with neurosymbolic ai for defending against cyber attacks,

M. W. Eckhoff, J. Halvorsen, B. J. Hansen, M. Eian, V . Mavroeidis, R. A. Chetwyn, G. Skjøtskift, and G. Grov, “Experimenting with neurosymbolic ai for defending against cyber attacks,”Neurosymbolic Artificial Intelligence Journal, 2025

2025

[23] [23]

Cheng, H

R. Cheng, H. Ma, T. Ma, and H. Zhang, “Ecoalign: An economi- cally rational framework for efficient lvlm alignment,”arXiv preprint arXiv:2511.11301, 2025

work page arXiv 2025

[24] [24]

Omegalog: High-fidelity attack investigation via transparent multi-layer log analy- sis,

W. U. Hassan, M. A. Noureddine, P. Datta, and A. Bates, “Omegalog: High-fidelity attack investigation via transparent multi-layer log analy- sis,” inNetwork and distributed system security symposium, 2020

2020

[25] [25]

Kairos: Practical intrusion detection and investigation using whole- system provenance,

Z. Cheng, Q. Lv, J. Liang, Y . Wang, D. Sun, T. Pasquier, and X. Han, “Kairos: Practical intrusion detection and investigation using whole- system provenance,” in2024 IEEE Symposium on Security and Privacy (SP). IEEE, 2024, pp. 3533–3551

2024

[26] [26]

Nodlink: An online system for fine-grained apt attack detection and investigation,

S. Li, F. Dong, X. Xiao, H. Wang, F. Shao, J. Chen, Y . Guo, X. Chen, and D. Li, “Nodlink: An online system for fine-grained apt attack detection and investigation,”arXiv preprint arXiv:2311.02331, 2023

work page arXiv 2023

[27] [27]

{ATLAS}: A sequence-based learning approach for attack investigation,

A. Alsaheel, Y . Nan, S. Ma, L. Yu, G. Walkup, Z. B. Celik, X. Zhang, and D. Xu, “{ATLAS}: A sequence-based learning approach for attack investigation,” in30th USENIX security symposium (USENIX security 21), 2021, pp. 3005–3022

2021

[28] [28]

Unicorn: Runtime provenance-based detector for advanced persistent threats,

X. Han, T. Pasquier, A. Bates, J. Mickens, and M. Seltzer, “Unicorn: Runtime provenance-based detector for advanced persistent threats,” arXiv preprint arXiv:2001.01525, 2020

work page arXiv 2001

[29] [29]

Alert prioritization techniques in security monitoring: A focus on severity averaging and alert entities,

C. Bassey, S. Idowu, and C. Ojo, “Alert prioritization techniques in security monitoring: A focus on severity averaging and alert entities,” Saudi J. Eng. Technol., vol. 9, no. 07, pp. 334–339, 2024

2024

[30] [30]

Information- dense reasoning for efficient and auditable security alert triage,

G. Zhao, Y . Zhang, C. Tian, D. Xie, H. Liu, and B. Wang, “Information- dense reasoning for efficient and auditable security alert triage,”arXiv preprint arXiv:2512.08169, 2025

work page arXiv 2025

[31] [31]

Pbi-attack: Prior-guided bimodal interactive black-box jailbreak attack for toxicity maximization,

R. Cheng, Y . Ding, S. Cao, R. Duan, X. Jia, S. Yuan, S. Qin, Z. Wang, and X. Jia, “Pbi-attack: Prior-guided bimodal interactive black-box jailbreak attack for toxicity maximization,” inProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025, pp. 609–628

2025

[32] [32]

Tuni: A textual unimodal detector for identity inference in clip models,

S. Li, R. Cheng, and X. Jia, “Tuni: A textual unimodal detector for identity inference in clip models,” inProceedings of the Sixth Workshop on Privacy in Natural Language Processing, 2025, pp. 1–13

2025

[33] [33]

Gibberish is all you need for membership inference detection in contrastive language-audio pretraining,

R. Cheng, Y . Ding, S. Cao, and Z. Wang, “Gibberish is all you need for membership inference detection in contrastive language-audio pretraining,” inProceedings of the 2025 International Conference on Multimedia Retrieval, 2025, pp. 108–116

2025

[34] [34]

Strata-sword: A hierarchical safety evaluation towards llms based on reasoning complexity of jailbreak instructions,

S. Zhao, R. Duan, J. Liu, X. Jia, F. Wang, C. Wei, R. Cheng, Y . Xie, C. Liu, Q. Guoet al., “Strata-sword: A hierarchical safety evaluation towards llms based on reasoning complexity of jailbreak instructions,” arXiv preprint arXiv:2509.01444, 2025

work page arXiv 2025

[35] [35]

Membership Inference for Contrastive Pre-training Models with Text-only PII Queries

R. Cheng, Y . Ding, J. Zhao, H. Zhang, H. Ma, T. Zhang, Y . Huang, and X. Li, “Membership inference for contrastive pre-training models with text-only pii queries,”arXiv preprint arXiv:2603.14222, 2026

work page internal anchor Pith review Pith/arXiv arXiv 2026

[36] [36]

Pentestgpt: An llm-empowered automatic penetration testing tool.arXiv preprint arXiv:2308.06782, 2024

G. Deng, Y . Liu, V . Mayoral-Vilches, P. Liu, Y . Li, Y . Xu, T. Zhang, Y . Liu, M. Pinzger, and S. Rass, “Pentestgpt: An llm-empowered automatic penetration testing tool,”arXiv preprint arXiv:2308.06782, 2023

work page arXiv 2023

[37] [37]

Secllm: Enhancing security smell detection in iac with large language models,

G. De Vito, F. Palomba, and F. Ferrucci, “Secllm: Enhancing security smell detection in iac with large language models,”IEEE Access, vol. 13, pp. 204 480–204 498, 2025

2025

[38] [38]

Craken: Cybersecurity llm agent with knowledge-based execution,

M. Shao, H. Xi, N. Rani, M. Udeshi, V . S. C. Putrevu, K. Milner, B. Dolan-Gavitt, S. K. Shukla, P. Krishnamurthy, F. Khorramiet al., “Craken: Cybersecurity llm agent with knowledge-based execution,” arXiv preprint arXiv:2505.17107, 2025

work page arXiv 2025

[39] [39]

Llm-driven provenance forensics for threat investigation and detection,

K. Mukherjee and M. Kantarcioglu, “Llm-driven provenance forensics for threat investigation and detection,”arXiv preprint arXiv:2508.21323, 2025

work page arXiv 2025

[40] [40]

arXiv preprint arXiv:2509.18970 , year=

X. Lin, Y . Ning, J. Zhang, Y . Dong, Y . Liu, Y . Wu, X. Qi, N. Sun, Y . Shang, K. Wanget al., “Llm-based agents suffer from hallucina- tions: A survey of taxonomy, methods, and directions,”arXiv preprint arXiv:2509.18970, 2025

work page arXiv 2025

[41] [41]

The emerged security and privacy of llm agent: A survey with case studies,

F. He, T. Zhu, D. Ye, B. Liu, W. Zhou, and P. S. Yu, “The emerged security and privacy of llm agent: A survey with case studies,”ACM Computing Surveys, vol. 58, no. 6, pp. 1–36, 2025

2025

[42] [42]

Ecoalign: An economically rational framework for efficient lvlm alignment,

R. Cheng, H. Ma, T. Ma, and H. Zhang, “Ecoalign: An economically rational framework for efficient lvlm alignment,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

2026

[43] [43]

Adversarial Orthogonal Disentanglement for LVLM Hallucination Mitigation

R. Cheng, H. Ma, Z. Hai, Y . Huang, R. Duan, T. Zhang, X. Yang, Z. Ye, and X. Ma, “Adversarial orthogonal disentanglement for lvlm hallucination mitigation,” 2026. [Online]. Available: https://arxiv.org/abs/2605.25377

work page internal anchor Pith review Pith/arXiv arXiv 2026

[44] [44]

Mitre att&ck: Design and philosophy,

B. E. Strom, A. Applebaum, D. P. Miller, K. C. Nickels, A. G. Pennington, and C. B. Thomas, “Mitre att&ck: Design and philosophy,” 2018

2018

[45] [45]

Analyzing the usefulness of the darpa transparent computing e5 dataset in apt detection research,

G. Ouyang, Y . Huang, and C. Zhang, “Analyzing the usefulness of the darpa transparent computing e5 dataset in apt detection research,” inInternational Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2022), vol. 12288. SPIE, 2022, pp. 400– 409

2022

[46] [46]

Analyzing the usefulness of the darpa optc dataset in cyber threat detection research,

M. M. Anjum, S. Iqbal, and B. Hamelin, “Analyzing the usefulness of the darpa optc dataset in cyber threat detection research,” inProceedings of the 26th ACM symposium on access control models and technologies, 2021, pp. 27–32

2021

[47] [47]

Inductive representation learning on large graphs,

W. L. Hamilton, R. Ying, and J. Leskovec, “Inductive representation learning on large graphs,” inAdvances in Neural Information Processing Systems (NeurIPS), 2017. APPENDIXA EVADEKITPERTURBATIONPSEUDOCODE EVADEKITtakes a labelled traceT= ( ˆG,L)from any of the public benchmarks (D1, D2, D3) and produces the perturbed traceT † = ( ˆG†,L)visible to the SUT,...

2017