Code Researcher retrieves global context via multi-step reasoning on code semantics, patterns, and commit history to fix Linux kernel crashes, reaching 48% crash-resolution rate versus 31% for baselines.
Bugs as deviant behavior: A general approach to inferring errors in systems code
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 2roles
method 1polarities
use method 1representative citing papers
Chaintrix achieves 71.7% recall on 120 high-severity vulnerabilities in the EVMbench benchmark and outperforms the strongest frontier-model baseline by 26 percentage points through LLM pipelines grounded in a Cross-Contract Interaction Model and filtered by structural checks.
citing papers explorer
-
Code Researcher: Deep Research Agent for Large Systems Code and Commit History
Code Researcher retrieves global context via multi-step reasoning on code semantics, patterns, and commit history to fix Linux kernel crashes, reaching 48% crash-resolution rate versus 31% for baselines.
-
CHAINTRIX: A multi-pipeline LLM-augmented framework for automated smart-contract security auditing
Chaintrix achieves 71.7% recall on 120 high-severity vulnerabilities in the EVMbench benchmark and outperforms the strongest frontier-model baseline by 26 percentage points through LLM pipelines grounded in a Cross-Contract Interaction Model and filtered by structural checks.