Native runs (Claude Code, Codex, OpenClaw) emit native session artifacts that are collected and ingested after the run terminates

In-ProcessFrameworkAdapterPipeline This appendix complements Appendix A for framework comparisons that donotgo through a native CLI harness

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Auditing Agent Harness Safety

cs.CL · 2026-05-14 · unverdicted · novelty 7.0 · 2 refs

HarnessAudit audits full execution trajectories of LLM agents for boundary compliance and introduces HarnessAudit-Bench showing that task completion often diverges from safe execution with risks accumulating over longer trajectories.

citing papers explorer

Showing 1 of 1 citing paper.

Auditing Agent Harness Safety cs.CL · 2026-05-14 · unverdicted · none · ref 11 · 2 links
HarnessAudit audits full execution trajectories of LLM agents for boundary compliance and introduces HarnessAudit-Bench showing that task completion often diverges from safe execution with risks accumulating over longer trajectories.

Native runs (Claude Code, Codex, OpenClaw) emit native session artifacts that are collected and ingested after the run terminates

fields

years

verdicts

representative citing papers

citing papers explorer