pith. sign in

KAIJU: An Executive Kernel for Intent-Gated Execution of LLM Agents

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

Tool-calling autonomous agents based on large language models using ReAct exhibit three limitations: serial latency, quadratic context growth, and vulnerability to prompt injection and hallucination. Recent work moves towards separating planning from execution but in each case the model remains coupled to the execution mechanics. We introduce a system-level abstraction for LLM agents which decouples the execution of agent workflows from the LLM reasoning layer. We define two first-class abstractions: (1) Intent-Gated Execution (IGX), a security paradigm that enforces intent at execution, and (2) an Executive Kernel that manages scheduling, tool dispatch, dependency resolution, failures and security. In KAIJU, the LLM plans upfront, optimistically scheduling tools in parallel with dependency-aware parameter injection. Tools are authorised via IGX based on four independent variables: scope, intent, impact, and clearance (external approval). KAIJU supports three adaptive execution modes (Reflect, nReflect, and Orchestrator), providing progressively finer-grained execution control apt for complex investigation and deep analysis or research. Empirical evaluation against a ReAct baseline shows that KAIJU has a latency penalty on simple queries due to planning overhead, convergence at moderate complexity, and a structural advantage on computational queries requiring parallel data gathering. Beyond latency, the separation enforces behavioural guarantees that ReAct cannot match through prompting alone. Code available at https://github.com/compdeep/kaiju

fields

cs.AI 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

CAX-Agent: A Lightweight Agent Harness for Reliable APDL Automation

cs.AI · 2026-05-12 · unverdicted · novelty 5.0

CAX-Agent is a three-layer agent harness for MAPDL automation whose model-driven recovery policy reaches 0.93 task completion and 0.84 zero-intervention rate on 50 simple structural benchmarks, outperforming rule-only and no-recovery baselines.

citing papers explorer

Showing 1 of 1 citing paper.

  • CAX-Agent: A Lightweight Agent Harness for Reliable APDL Automation cs.AI · 2026-05-12 · unverdicted · none · ref 6 · internal anchor

    CAX-Agent is a three-layer agent harness for MAPDL automation whose model-driven recovery policy reaches 0.93 task completion and 0.84 zero-intervention rate on 50 simple structural benchmarks, outperforming rule-only and no-recovery baselines.