KRONE: Scalable LLM-Augmented Log Anomaly Detection via Hierarchical Abstraction
Pith reviewed 2026-05-16 06:56 UTC · model grok-4.3
The pith
KRONE extracts execution hierarchies from flat logs to enable modular anomaly detection with far less LLM usage.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
KRONE automatically derives application-specific semantic hierarchies from flat logs, decomposes sequences into modular KRONE Seqs, and applies a hybrid strategy that combines level-independent local-context detection for quick filtering with nested-aware detection that captures cross-level dependencies, using LLM augmentation only on a small routed subset while reusing cached results and early exits along the hierarchy.
What carries the argument
The KRONE Log Abstraction Model, which extracts semantic hierarchies to recursively break flat log sequences into coherent KRONE Seq execution units for modular detection.
If this is right
- Sequence-level log anomaly detection can be replaced by a set of smaller KRONE Seq-level tasks that each respect execution boundaries.
- Most detection work can be handled by lightweight local-context models, with LLMs invoked on only 1.1 to 3.3 percent of the data.
- Cached results and early exits along the hierarchy reduce both memory and compute without sacrificing coverage.
- Interpretability improves because anomalies can be localized to specific levels and execution units in the hierarchy.
Where Pith is reading between the lines
- The same hierarchy-first decomposition could help other domains where flat sequences hide nested structure, such as network packet traces or call-stack logs.
- If the abstraction model is retrained on streaming logs, the system could adapt to software updates without full re-labeling.
- The modular routing may generalize to other hybrid AI pipelines that want to minimize expensive model calls.
Load-bearing premise
That accurate application-specific semantic hierarchies can be extracted automatically from flat logs without heavy domain tuning or labeled hierarchy data.
What would settle it
A test log dataset where the extracted hierarchies group unrelated events together or split true execution units, causing overall F1 score to drop below strong flat-sequence baselines.
Figures
read the original abstract
Log anomaly detection is crucial for uncovering system failures and security risks. Although logs originate from nested component executions with clear boundaries, this structure is lost when stored as flat sequences. As a result, state-of-the-art methods often miss true dependencies within executions while learning spurious correlations across unrelated events. We propose KRONE, the first hierarchical anomaly detection framework that automatically derives execution hierarchies from flat logs to enable modular, multi-level anomaly detection. At its core, the KRONE Log Abstraction Model extracts application-specific semantic hierarchies, which are used to recursively decompose log sequences into coherent execution units, referred to as KRONE Seqs. This transforms sequence-level detection into a set of modular KRONE Seq-level detection tasks. For each test KRONE Seq, KRONE adopts a hybrid modular detection strategy that routes between an efficient level-independent Local-Context detector for rapid filtering and a Nested-Aware detector that captures cross-level semantic dependencies, augmented with LLM-based anomaly detection and explanation. KRONE further optimizes detection through cached result reuse and early-exit strategies along the hierarchy. Experiments on three public benchmarks and one industrial dataset from ByteDance Cloud demonstrate that KRONE achieves substantial improvements in accuracy (42.49% to 87.98%), F1 score, data efficiency (117.3x reduction), resource efficiency (43.7x reduction), and interpretability. KRONE improves F1-score by 10.07% (82.76% to 92.83%) over prior methods while reducing LLM usage to only 1.1% to 3.3% of the test data. Code: https://github.com/LeiMa0324/KRONE Demo: https://leima0324.github.io/KRONE_Demo_official/
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes KRONE, a hierarchical log anomaly detection framework that uses a KRONE Log Abstraction Model to automatically extract application-specific semantic hierarchies from flat logs. These hierarchies enable recursive decomposition of log sequences into coherent KRONE Seqs, which are then processed via a hybrid modular strategy combining an efficient Local-Context detector for filtering, a Nested-Aware detector for cross-level dependencies, LLM-based anomaly detection and explanation, plus caching and early-exit optimizations. Experiments on three public benchmarks and one ByteDance industrial dataset report F1-score gains from 82.76% to 92.83%, accuracy improvements from 42.49% to 87.98%, 117.3x data efficiency, 43.7x resource efficiency, and LLM usage reduced to 1.1-3.3% of test data.
Significance. If the automatic hierarchy extraction is reliable, KRONE could meaningfully advance log anomaly detection by recovering execution structure that flat-sequence methods lose, leading to more modular, interpretable, and efficient detection. The work's strengths include public code release supporting reproducibility, concrete empirical gains across accuracy, efficiency, and LLM reduction on both academic and industrial data, and a hybrid detection design that limits expensive LLM calls. These elements make the contribution potentially influential for structured log analysis if the central assumption holds.
major comments (3)
- [Abstract and §3.1] Abstract and §3.1 (KRONE Log Abstraction Model): The headline claims of 10.07% F1 improvement and 117.3x data efficiency rest on the assumption that the automatic hierarchy extraction produces accurate application-specific nesting boundaries and semantic levels; however, the manuscript provides no quantitative validation such as precision/recall of extracted hierarchies against ground-truth or ablation studies measuring the impact of extraction errors on downstream detection.
- [§4] §4 (Experiments): The reported performance gains are not accompanied by error analysis or ablation isolating the contribution of the hierarchical decomposition (KRONE Seq) versus LLM routing, caching, or early-exit alone; without such controls it is unclear whether the modular KRONE Seq-level detection is the load-bearing factor for the observed advantages.
- [§3.2] §3.2 (KRONE Seq decomposition): The transformation of sequence-level detection into modular KRONE Seq tasks is presented as central, yet no metrics or case studies quantify how often extraction produces incorrect parent-child relations or missed execution units, which would directly undermine the Nested-Aware detector's claimed cross-level semantic benefits.
minor comments (2)
- [§2] The definition and formal notation for 'KRONE Seq' appear only after the high-level description; an earlier explicit definition would improve readability.
- [§4] Table captions and axis labels in the efficiency plots could more explicitly state the baseline methods being compared to avoid ambiguity.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments. We agree that stronger quantitative validation of the hierarchy extraction and component ablations would improve the paper. We have revised the manuscript to address these points directly while preserving the core claims supported by our end-to-end results.
read point-by-point responses
-
Referee: [Abstract and §3.1] Abstract and §3.1 (KRONE Log Abstraction Model): The headline claims of 10.07% F1 improvement and 117.3x data efficiency rest on the assumption that the automatic hierarchy extraction produces accurate application-specific nesting boundaries and semantic levels; however, the manuscript provides no quantitative validation such as precision/recall of extracted hierarchies against ground-truth or ablation studies measuring the impact of extraction errors on downstream detection.
Authors: We agree that direct validation of the extracted hierarchies strengthens the central assumption. Ground-truth semantic hierarchies are unavailable in the public benchmarks and industrial dataset. In the revision we add a human evaluation study (§3.1.1) in which two experts independently annotated hierarchies for 200 sampled sequences across all datasets; KRONE extraction is compared against these annotations. We also add an ablation (§4.4) that injects controlled noise into the extracted relations (5–25% flips) and measures downstream F1 degradation, demonstrating robustness. These additions are now included in the revised manuscript. revision: yes
-
Referee: [§4] §4 (Experiments): The reported performance gains are not accompanied by error analysis or ablation isolating the contribution of the hierarchical decomposition (KRONE Seq) versus LLM routing, caching, or early-exit alone; without such controls it is unclear whether the modular KRONE Seq-level detection is the load-bearing factor for the observed advantages.
Authors: We concur that component-level ablations and error analysis are necessary to isolate contributions. The revised §4 now contains a full ablation suite that disables the KRONE Seq decomposition, the Nested-Aware detector, the LLM module, caching, and early-exit in turn, while keeping all other elements fixed. We also add a failure-case analysis that categorizes misclassifications and attributes the majority of baseline errors to missed cross-level dependencies. These results confirm the hierarchical decomposition as the primary driver of gains and are reported in the updated experiments section. revision: yes
-
Referee: [§3.2] §3.2 (KRONE Seq decomposition): The transformation of sequence-level detection into modular KRONE Seq tasks is presented as central, yet no metrics or case studies quantify how often extraction produces incorrect parent-child relations or missed execution units, which would directly undermine the Nested-Aware detector's claimed cross-level semantic benefits.
Authors: We acknowledge the value of quantifying extraction fidelity. The revised §3.2 now includes manual inspection metrics on 300 randomly sampled sequences (100 per public benchmark) that report the rate of incorrect parent-child relations and missed execution units. We further add case studies (new Figure 4) contrasting correct and erroneous extractions, together with a sensitivity experiment showing how the Nested-Aware detector performs under varying levels of extraction noise. These quantitative and qualitative additions directly address the concern. revision: yes
Circularity Check
No circularity: empirical framework with independent benchmark results
full rationale
The paper presents KRONE as an empirical system/framework for log anomaly detection. It introduces a Log Abstraction Model to derive hierarchies, decomposes sequences into KRONE Seqs, and applies hybrid detectors with LLM augmentation. All performance claims (F1 improvements, efficiency gains) rest on experimental comparisons against baselines on public and industrial datasets. No equations, first-principles derivations, or predictions are described that reduce by construction to fitted parameters, self-citations, or renamed inputs. The hierarchy extraction step is a core modeling choice but is not justified via self-referential definitions or load-bearing self-citations in the abstract or claims. Public code further supports independent verification. This is a standard non-circular empirical contribution.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Logs originate from nested component executions with clear boundaries that are lost when stored as flat sequences.
invented entities (1)
-
KRONE Seq
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
KRONELog Abstraction Modeldefines an abstract three-level schema of thetopics: entities, actions,andstatuses... instantiated per dataset via LLM-based NER
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanLogicNat recovery unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
TOPDOWNSEQDECOMPOSEalgorithm... recursively decomposes each log sequence into multi-level KRONESeqs
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 1 Pith paper
-
DP-FlogTinyLLM: Differentially private federated log anomaly detection using Tiny LLMs
DP-FLogTinyLLM combines federated learning, differential privacy, and LoRA-tuned tiny LLMs to match centralized log anomaly detection performance on Thunderbird and BGL datasets while preserving privacy.
Reference graph
Works this paper leans on
-
[1]
Deeplog: Anomaly detection and diagnosis from system logs through deep learning,
M. Du, F. Li, G. Zheng, and V . Srikumar, “Deeplog: Anomaly detection and diagnosis from system logs through deep learning,” inProceedings of the 2017 ACM SIGSAC conference on computer and communications security, 2017, pp. 1285–1298
work page 2017
-
[2]
Multi-scale one-class recurrent neural networks for discrete event sequence anomaly detection,
Z. Wang, Z. Chen, J. Ni, H. Liu, H. Chen, and J. Tang, “Multi-scale one-class recurrent neural networks for discrete event sequence anomaly detection,” inProceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining, 2021, pp. 3726–3734
work page 2021
-
[3]
Robust and transferable log-based anomaly detection,
P. Jia, S. Cai, B. C. Ooi, P. Wang, and Y . Xiong, “Robust and transferable log-based anomaly detection,”Proceedings of the ACM on Management of Data, vol. 1, no. 1, pp. 1–26, 2023
work page 2023
-
[4]
Logbert: Log anomaly detection via bert,
H. Guo, S. Yuan, and X. Wu, “Logbert: Log anomaly detection via bert,” in2021 international joint conference on neural networks (IJCNN). IEEE, 2021, pp. 1–8
work page 2021
-
[5]
Loganomaly: Unsupervised detection of sequential and quantitative anomalies in unstructured logs
W. Meng, Y . Liu, Y . Zhu, S. Zhang, D. Pei, Y . Liu, Y . Chen, R. Zhang, S. Tao, P. Sunet al., “Loganomaly: Unsupervised detection of sequential and quantitative anomalies in unstructured logs.” inIJCAI, vol. 19, no. 7, 2019, pp. 4739–4745
work page 2019
-
[6]
Robust log-based anomaly detection on unstable log data,
X. Zhang, Y . Xu, Q. Lin, B. Qiao, H. Zhang, Y . Dang, C. Xie, X. Yang, Q. Cheng, Z. Li, J. Chen, X. He, R. Yao, J.-G. Lou, M. Chintalapati, F. Shen, and D. Zhang, “Robust log-based anomaly detection on unstable log data,” inProceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Softw...
-
[7]
Cat: Beyond efficient transformer for content-aware anomaly detection in event sequences,
S. Zhang, Y . Liu, X. Zhang, W. Cheng, H. Chen, and H. Xiong, “Cat: Beyond efficient transformer for content-aware anomaly detection in event sequences,” inProceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, ser. KDD ’22. New York, NY , USA: Association for Computing Machinery, 2022, pp. 4541–4550. [Online]. Available: ht...
-
[8]
Pluto: Sample selection for robust anomaly detection on polluted log data,
L. Ma, L. Cao, P. M. VanNostrand, D. M. Hofmann, Y . Su, and E. A. Rundensteiner, “Pluto: Sample selection for robust anomaly detection on polluted log data,”Proc. ACM Manag. Data, vol. 2, no. 4, Sep
-
[9]
Available: https://doi.org/10.1145/3677139
[Online]. Available: https://doi.org/10.1145/3677139
-
[10]
Log-based anomaly detection without log pars- ing,
V .-H. Le and H. Zhang, “Log-based anomaly detection without log pars- ing,” in2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 2021, pp. 492–504
work page 2021
-
[11]
Self- attentive classification-based anomaly detection in unstructured logs,
S. Nedelkoski, J. Bogatinovski, A. Acker, J. Cardoso, and O. Kao, “Self- attentive classification-based anomaly detection in unstructured logs,” in 2020 IEEE International Conference on Data Mining (ICDM), 2020, pp. 1196–1201
work page 2020
-
[12]
Logsed: Anomaly diagnosis through mining time-weighted control flow graph in logs,
T. Jia, L. Yang, P. Chen, Y . Li, F. Meng, and J. Xu, “Logsed: Anomaly diagnosis through mining time-weighted control flow graph in logs,” in2017 IEEE 10th International Conference on Cloud Computing (CLOUD), 2017, pp. 447–455
work page 2017
-
[13]
Hitanomaly: Hierarchical transformers for anomaly detection in system log,
S. Huang, Y . Liu, C. Fung, R. He, Y . Zhao, H. Yang, and Z. Luan, “Hitanomaly: Hierarchical transformers for anomaly detection in system log,”IEEE Transactions on Network and Service Management, vol. 17, no. 4, pp. 2064–2076, 2020
work page 2064
-
[14]
Improving log-based anomaly detection by pre-training hierarchical transformers,
S. Huang, Y . Liu, C. Fung, H. Wang, H. Yang, and Z. Luan, “Improving log-based anomaly detection by pre-training hierarchical transformers,” IEEE Transactions on Computers, vol. 72, no. 9, pp. 2656–2667, 2023
work page 2023
-
[15]
Layerlog: Log sequence anomaly detection based on hierarchical semantics,
C. Zhang, X. Wang, H. Zhang, J. Zhang, H. Zhang, C. Liu, and P. Han, “Layerlog: Log sequence anomaly detection based on hierarchical semantics,”Applied Soft Computing, vol. 132, p. 109860, 2023. [Online]. Available: https://www.sciencedirect.com/science/article/pii/ S1568494622009097
work page 2023
-
[16]
Hlogformer: A hierarchical transformer for representing log data,
Z. Hou, M. Ghashami, M. Kuznetsov, and M. Torkamani, “Hlogformer: A hierarchical transformer for representing log data,” 2024. [Online]. Available: https://arxiv.org/abs/2408.16803
-
[17]
Hierarchical transformers are more efficient language models,
P. Nawrot, S. Tworkowski, M. Tyrolski, L. Kaiser, Y . Wu, C. Szegedy, and H. Michalewski, “Hierarchical transformers are more efficient language models,” 2022. [Online]. Available: https: //arxiv.org/abs/2110.13711
-
[18]
Hier- archical transformers for long document classification,
R. Pappagari, P. ˙Zelasko, J. Villalba, Y . Carmiel, and N. Dehak, “Hierarchical transformers for long document classification,” 2019. [Online]. Available: https://arxiv.org/abs/1910.10781
-
[19]
Deeptralog: Trace-log combined microservice anomaly de- tection through graph-based deep learning,
C. Zhang, X. Peng, C. Sha, K. Zhang, Z. Fu, X. Wu, Q. Lin, and D. Zhang, “Deeptralog: Trace-log combined microservice anomaly de- tection through graph-based deep learning,” in2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE), 2022, pp. 623–634
work page 2022
-
[20]
Bridging the gap: Llm-powered transfer learning for log anomaly detection in new software systems,
Y . Sui, X. Wang, T. Cui, T. Xiao, C. He, S. Zhang, Y . Zhang, X. Yang, Y . Sun, and D. Pei, “Bridging the gap: Llm-powered transfer learning for log anomaly detection in new software systems,” in2025 IEEE 41st International Conference on Data Engineering (ICDE), 2025, pp. 4414– 4427
work page 2025
-
[21]
Early exploration of using ChatGPT for log-based anomaly detection on parallel file systems logs
C. Egersdoerfer, D. Zhang, and D. Dai, “Early exploration of using chatgpt for log-based anomaly detection on parallel file systems logs,” inProceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, ser. HPDC ’23. New York, NY , USA: Association for Computing Machinery, 2023, pp. 315–316. [Online]. Available: ...
-
[22]
H. Jin, G. Papadimitriou, K. Raghavan, P. Zuk, P. Balaprakash, C. Wang, A. Mandal, and E. Deelman, “Large language models for anomaly detection in computational workflows: from supervised fine-tuning to in-context learning,” 2024. [Online]. Available: https: //arxiv.org/abs/2407.17545
-
[23]
Llm-based event log analysis techniques: A survey.arXiv preprint arXiv:2502.00677, 2025
S. Akhtar, S. Khan, and S. Parkinson, “Llm-based event log analysis techniques: A survey,” 2025. [Online]. Available: https: //arxiv.org/abs/2502.00677
-
[24]
L. Zhang, T. Jia, M. Jia, Y . Wu, H. Liu, and Y . Li, “XRAGLog: A resource-efficient and context-aware log-based anomaly detection method using retrieval-augmented generation,” inAAAI 2025 Workshop on Preventing and Detecting LLM Misinformation (PDLM), 2025. [Online]. Available: https://openreview.net/forum?id=8gv7CXuXQ3
work page 2025
-
[25]
Interpretable online log analysis using large language models with prompt strategies,
Y . Liu, S. Tao, W. Meng, J. Wang, W. Ma, Y . Zhao, Y . Chen, H. Yang, Y . Jiang, and X. Chen, “Interpretable online log analysis using large language models with prompt strategies,” 2024. [Online]. Available: https://arxiv.org/abs/2308.07610
-
[26]
Beyond the limits: A survey of techniques to extend the context length in large language models,
X. Wang, M. Salmani, P. Omidi, X. Ren, M. Rezagholizadeh, and A. Eshaghi, “Beyond the limits: A survey of techniques to extend the context length in large language models,” 2024. [Online]. Available: https://arxiv.org/abs/2402.02244
-
[27]
J. Liu, D. Zhu, Z. Bai, Y . He, H. Liao, H. Que, Z. Wang, C. Zhang, G. Zhang, J. Zhang, Y . Zhang, Z. Chen, H. Guo, S. Li, Z. Liu, Y . Shan, Y . Song, J. Tian, W. Wu, Z. Zhou, R. Zhu, J. Feng, Y . Gao, S. He, Z. Li, T. Liu, F. Meng, W. Su, Y . Tan, Z. Wang, J. Yang, W. Ye, B. Zheng, W. Zhou, W. Huang, S. Li, and Z. Zhang, “A comprehensive survey on long c...
-
[28]
A survey on deep learning for named entity recognition,
J. Li, A. Sun, J. Han, and C. Li, “A survey on deep learning for named entity recognition,”IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 1, pp. 50–70, Jan. 2022. [Online]. Available: http://dx.doi.org/10.1109/TKDE.2020.2981314
-
[29]
From Local to Global: A Graph RAG Approach to Query-Focused Summarization
D. Edge, H. Trinh, N. Cheng, J. Bradley, A. Chao, A. Mody, S. Truitt, D. Metropolitansky, R. O. Ness, and J. Larson, “From local to global: A graph rag approach to query-focused summarization,” 2025. [Online]. Available: https://arxiv.org/abs/2404.16130
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[30]
Automated system monitoring and notification with swatch,
S. E. Hansen and E. T. Atkins, “Automated system monitoring and notification with swatch,” inProceedings of the 7th USENIX Conference on System Administration, ser. LISA ’93. USA: USENIX Association, 1993, pp. 145–152
work page 1993
-
[31]
Refereed papers: Real-time log file analysis using the simple event correlator (sec),
J. P. Rouillard, “Refereed papers: Real-time log file analysis using the simple event correlator (sec),” inProceedings of the 18th USENIX Conference on System Administration, ser. LISA ’04. USA: USENIX Association, 2004, pp. 133–150
work page 2004
-
[32]
Experience report: System log analysis for anomaly detection,
S. He, J. Zhu, P. He, and M. R. Lyu, “Experience report: System log analysis for anomaly detection,” in2016 IEEE 27th international symposium on software reliability engineering (ISSRE). IEEE, 2016, pp. 207–218
work page 2016
-
[33]
A comprehensive study of machine learning techniques for log- based anomaly detection,
S. Ali, C. Boufaied, D. Bianculli, P. Branco, and L. Briand, “A comprehensive study of machine learning techniques for log- based anomaly detection,”Empirical Software Engineering, vol. 30, no. 5, Jun. 2025. [Online]. Available: http://dx.doi.org/10.1007/ s10664-025-10669-3
work page 2025
-
[34]
Log-based anomaly detection without log pars- ing,
V .-H. Le and H. Zhang, “Log-based anomaly detection without log pars- ing,” in2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2021, pp. 492–504
work page 2021
-
[35]
LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing,
Z. Ma, A. R. Chen, D. J. Kim, T.-H. Chen, and S. Wang, “Llmparser: An exploratory study on using large language models for log parsing,” inProceedings of the IEEE/ACM 46th International Conference on Software Engineering, ser. ICSE ’24. ACM, Apr. 2024, pp. 1–13. [Online]. Available: http://dx.doi.org/10.1145/3597503.3639150
-
[36]
Librelog: Accurate and efficient unsupervised log parsing using open-source large language models,
Z. Ma, D. J. Kim, and T.-H. Chen, “Librelog: Accurate and efficient unsupervised log parsing using open-source large language models,”
-
[37]
LibreLog: Accurate and Efficient Unsupervised Log Parsing Using Open-Source Large Language Models,
[Online]. Available: https://arxiv.org/abs/2408.01585
-
[38]
LogGPT: Log anomaly detection via GPT.arXiv preprint, 2023
X. Han, S. Yuan, and M. Trabelsi, “Loggpt: Log anomaly detection via gpt,” 2023. [Online]. Available: https://arxiv.org/abs/2309.14482
-
[39]
Raglog: Log anomaly detection using retrieval augmented generation, 2023
J. Pan, S. L. Wong, and Y . Yuan, “Raglog: Log anomaly detection using retrieval augmented generation,” 2023. [Online]. Available: https://arxiv.org/abs/2311.05261
-
[40]
Lost in the Middle: How Language Models Use Long Contexts
N. Liu, D. Schuurmans, M. Bosma, and et al., “Lost in the middle: How language models use long contexts,”arXiv preprint arXiv:2307.03172, 2023
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[41]
Deep learning for anomaly detection in log data: A survey,
M. Landauer, S. Onder, F. Skopik, and M. Wurzenberger, “Deep learning for anomaly detection in log data: A survey,”Machine Learning with Applications, vol. 12, p. 100470, 2023
work page 2023
-
[42]
Log-based anomaly detection with deep learning: How far are we?
V . H. Le and H. Zhang, “Log-based anomaly detection with deep learning: How far are we?” inProceedings of the 44th International Conference on Software Engineering, 2022, pp. 1356–1367
work page 2022
-
[44]
A data clustering algorithm for mining patterns from event logs,
SLCT, “A data clustering algorithm for mining patterns from event logs,” inProceedings of the 3rd IEEE Workshop on IP Operations and Management (IPOM 2003) (IEEE Cat. No.03EX764), 2003, pp. 119– 126
work page 2003
-
[45]
Clustering event logs using iterative partitioning,
A. A. Makanju, A. N. Zincir-Heywood, and E. E. Milios, “Clustering event logs using iterative partitioning,” inProceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD ’09. New York, NY , USA: Association for Computing Machinery, 2009, pp. 1255–1264. [Online]. Available: https://doi.org/10.1145/1557019.1557154
-
[46]
Execution anomaly detection in distributed systems through unstructured log analysis,
Q. Fu, J.-G. Lou, Y . Wang, and J. Li, “Execution anomaly detection in distributed systems through unstructured log analysis,” in2009 Ninth IEEE International Conference on Data Mining, 2009, pp. 149–158
work page 2009
-
[47]
Spell: Streaming parsing of system event logs,
M. Du and F. Li, “Spell: Streaming parsing of system event logs,” in 2016 IEEE 16th International Conference on Data Mining (ICDM). IEEE, 2016, pp. 859–864
work page 2016
-
[48]
Drain: An online log parsing approach with fixed depth tree,
P. He, J. Zhu, Z. Zheng, and M. R. Lyu, “Drain: An online log parsing approach with fixed depth tree,” in2017 IEEE international conference on web services (ICWS). IEEE, 2017, pp. 33–40
work page 2017
-
[49]
Neural entity linking: A survey of models based on deep learning,
O. Sevgili, A. Shelmanov, M. Arkhipov, A. Panchenko, and C. Biemann, “Neural entity linking: A survey of models based on deep learning,” Semantic Web, vol. 13, no. 3, pp. 527–570, Apr. 2022. [Online]. Available: http://dx.doi.org/10.3233/SW--222986
-
[50]
Gpt-ner: Named entity recognition via large language models,
S. Wang, X. Sun, X. Li, R. Ouyang, F. Wu, T. Zhang, J. Li, and G. Wang, “Gpt-ner: Named entity recognition via large language models,” 2023. [Online]. Available: https://arxiv.org/abs/2304.10428
-
[51]
PromptNER: Prompting for named entity recognition,
D. Ashok and Z. C. Lipton, “Promptner: Prompting for named entity recognition,” 2023. [Online]. Available: https://arxiv.org/abs/2305.15444
-
[52]
llmner: (zero—few)-shot named entity recognition, exploiting the power of large language models,
F. Villena, L. Miranda, and C. Aracena, “llmner: (zero—few)-shot named entity recognition, exploiting the power of large language models,” 2024. [Online]. Available: https://arxiv.org/abs/2406.04528
-
[53]
Language models are few-shot learners,
T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askellet al., “Language models are few-shot learners,”Advances in Neural Information Processing Systems, vol. 33, pp. 1877–1901, 2020
work page 1901
-
[54]
Rethinking the role of demonstrations: What makes in-context learning work?
S. Min, M. Lewis, L. Zettlemoyer, and H. Hajishirzi, “Rethinking the role of demonstrations: What makes in-context learning work?” in Transactions of the Association for Computational Linguistics, 2022
work page 2022
-
[55]
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
J. Wei, X. Wang, D. Schuurmans, M. Bosma, B. Ichter, F. Xia, E. Chi, Q. Le, and D. Zhou, “Chain-of-thought prompting elicits reasoning in large language models,” 2023. [Online]. Available: https://arxiv.org/abs/2201.11903
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[56]
ReAct: Synergizing Reasoning and Acting in Language Models
S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. Narasimhan, and Y . Cao, “React: Synergizing reasoning and acting in language models,” 2023. [Online]. Available: https://arxiv.org/abs/2210.03629
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[57]
What supercomputers say: A study of five system logs,
A. Oliner and J. Stearley, “What supercomputers say: A study of five system logs,” in37th annual IEEE/IFIP international conference on dependable systems and networks (DSN’07). IEEE, 2007, pp. 575– 584
work page 2007
-
[58]
Detecting large-scale system problems by mining console logs,
W. Xu, L. Huang, A. Fox, D. Patterson, and M. I. Jordan, “Detecting large-scale system problems by mining console logs,” inProceedings of the ACM SIGOPS 22nd symposium on Operating systems principles, 2009, pp. 117–132
work page 2009
-
[59]
Largescale system problem detection by mining console logs,
W. Xu, L. Huang, A. Fox, D. Patterson, and M. Jordan, “Largescale system problem detection by mining console logs,”Proceedings of SOSP’09, 2009
work page 2009
-
[60]
Estimating the support of a high-dimensional distribution,
B. Sch ¨olkopf, J. C. Platt, J. Shawe-Taylor, A. J. Smola, and R. C. Williamson, “Estimating the support of a high-dimensional distribution,” Neural computation, vol. 13, no. 7, pp. 1443–1471, 2001
work page 2001
-
[61]
F. T. Liu, K. M. Ting, and Z.-H. Zhou, “Isolation forest,” in2008 eighth ieee international conference on data mining. IEEE, 2008, pp. 413–422
work page 2008
-
[62]
“The source code of loglizer,” https://github.com/logpai/loglizer, ac- cessed: 2025-10-26
work page 2025
-
[63]
The source code of deep-loglizer,
“The source code of deep-loglizer,” https://github.com/logpai/ deep-loglizer, accessed: 2025-10-26
work page 2025
-
[64]
“The source code of oc4seq,” https://github.com/KnowledgeDiscovery/ OC4Seq, accessed: 2025-10-26
work page 2025
-
[65]
“The source code of logprompt,” https://github.com/lunyiliu/LogPrompt/ tree/main, accessed: 2025-10-26
work page 2025
-
[66]
“The source code of logbert,” https://github.com/HelenGuohx/logbert, accessed: 2025-10-26
work page 2025
-
[67]
Prefixspan,: mining sequential patterns efficiently by prefix- projected pattern growth,
J. Pei, J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal, and M.-C. Hsu, “Prefixspan,: mining sequential patterns efficiently by prefix- projected pattern growth,” inProceedings 17th International Conference on Data Engineering, 2001, pp. 215–224
work page 2001
-
[68]
Frequent pattern mining: current status and future directions,
J. Han, H. Cheng, D. Xin, and X. Yan, “Frequent pattern mining: current status and future directions,”Data Min. Knowl. Discov., vol. 15, no. 1, pp. 55–86, Aug. 2007. [Online]. Available: https://doi.org/10.1007/s10618-006-0059-1
-
[69]
Spade: An efficient algorithm for mining frequent se- quences,
M. J. Zaki, “Spade: An efficient algorithm for mining frequent se- quences,”Machine learning, vol. 42, no. 1, pp. 31–60, 2001
work page 2001
-
[70]
Hierarchical topic models and the nested chinese restaurant process,
T. Griffiths, M. Jordan, J. Tenenbaum, and D. Blei, “Hierarchical topic models and the nested chinese restaurant process,” in Advances in Neural Information Processing Systems, S. Thrun, L. Saul, and B. Sch ¨olkopf, Eds., vol. 16. MIT Press, 2003. [Online]. Available: https://proceedings.neurips.cc/paper files/paper/ 2003/file/7b41bfa5085806dfa24b8c9de0ce...
work page 2003
-
[71]
Raptor: Recursive abstractive processing for tree-organized retrieval,
P. Sarthi, S. Abdullah, A. Tuli, S. Khanna, A. Goldie, and C. D. Manning, “Raptor: Recursive abstractive processing for tree-organized retrieval,” inInternational Conference on Learning Representations (ICLR), 2024
work page 2024
-
[72]
Self-Consistency Improves Chain of Thought Reasoning in Language Models
X. Wang, J. Wei, D. Schuurmans, Q. Le, E. Chi, S. Narang, A. Chowdhery, and D. Zhou, “Self-consistency improves chain of thought reasoning in language models,” 2023. [Online]. Available: https://arxiv.org/abs/2203.11171
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[73]
Cost-effective online multi-llm selection with versatile reward models,
X. Dai, J. Li, X. Liu, A. Yu, and J. C. S. Lui, “Cost-effective online multi-llm selection with versatile reward models,” 2024. [Online]. Available: https://arxiv.org/abs/2405.16587
-
[74]
Optllm: Optimal assignment of queries to large language models,
Y . Liu, H. Zhang, Y . Miao, V .-H. Le, and Z. Li, “Optllm: Optimal assignment of queries to large language models,” 2024. [Online]. Available: https://arxiv.org/abs/2405.15130
-
[75]
Y . Li, “Llm bandit: Cost-efficient llm generation via preference- conditioned dynamic routing,” 2025. [Online]. Available: https: //arxiv.org/abs/2502.02743
-
[76]
Unique security and privacy threats of large language model: A comprehensive survey,
S. Wang, T. Zhu, B. Liu, M. Ding, X. Guo, D. Ye, W. Zhou, and P. S. Yu, “Unique security and privacy threats of large language model: A comprehensive survey,” 2024. [Online]. Available: https://arxiv.org/abs/2406.07973
-
[77]
B. C. Das, M. H. Amini, and Y . Wu, “Security and privacy challenges of large language models: A survey,” 2024. [Online]. Available: https://arxiv.org/abs/2402.00888
-
[78]
Privacy issues in large language models: A survey,
S. Neel and P. Chang, “Privacy issues in large language models: A survey,” 2024. [Online]. Available: https://arxiv.org/abs/2312.06717
-
[79]
Sovereign large language models: Advantages, strategy and regulations,
M. Bondarenko, S. Lushnei, Y . Paniv, O. Molchanovsky, M. Romanyshyn, Y . Filipchuk, and A. Kiulian, “Sovereign large language models: Advantages, strategy and regulations,” 2025. [Online]. Available: https://arxiv.org/abs/2503.04745
-
[80]
An empirical survey on long document summarization: Datasets, models, and metrics,
H. Y . Koh, J. Ju, M. Liu, and S. Pan, “An empirical survey on long document summarization: Datasets, models, and metrics,”ACM Computing Surveys, vol. 55, no. 8, pp. 1–35, Dec. 2022. [Online]. Available: http://dx.doi.org/10.1145/3545176
-
[81]
A divide-and-conquer approach to the summarization of long documents,
A. Gidiotis and G. Tsoumakas, “A divide-and-conquer approach to the summarization of long documents,” 2020. [Online]. Available: https://arxiv.org/abs/2004.06190
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.