pith. machine review for the scientific record. sign in

arxiv: 2604.27820 · v1 · submitted 2026-04-30 · 💻 cs.AI · cs.DB· cs.IR· cs.MA

Recognition: unknown

ObjectGraph: From Document Injection to Knowledge Traversal -- A Native File Format for the Agentic Era

Authors on Pith no claims yet

Pith reviewed 2026-05-07 04:59 UTC · model grok-4.3

classification 💻 cs.AI cs.DBcs.IRcs.MA
keywords document formatknowledge graphLLM agentstoken efficiencyprogressive disclosurerole-scoped accessMarkdown supersetagentic AI
0
0 comments X

The pith

ObjectGraph models documents as typed directed knowledge graphs that agents traverse selectively instead of injecting full texts.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Current document formats assume linear human reading, but autonomous agents retrieve information from context windows, leading to wasteful full-document injection. The paper formalizes this as the Document Consumption Problem and specifies six structural properties that must be satisfied for efficient agent use. It introduces OBJECTGRAPH as a Markdown-compatible format that represents content as a traversable knowledge graph, incorporating progressive disclosure, role-scoped access, and executable assertions. Proof shows it meets all six properties simultaneously, unlike prior formats. Tests on five document types and eight agent tasks confirm up to 95.3 percent token savings and 98.7 percent content fidelity with no accuracy drop.

Core claim

The central discovery is that the Document Consumption Problem can be solved by shifting from linear string injection to knowledge graph traversal. OBJECTGRAPH is defined as a strict superset of Markdown that encodes documents as typed directed graphs. It natively supports the Progressive Disclosure Model for layered information release, the Role-Scoped Access Protocol for permissioned retrieval, and Executable Assertion Nodes for embedded logic. This design is shown to satisfy the six required structural properties where no existing format does.

What carries the argument

OBJECTGRAPH (.og), the typed directed knowledge graph file format that serves as a strict superset of Markdown and provides a two-primitive query protocol along with built-in support for progressive disclosure, role-scoped access, and executable assertions to enable selective traversal.

If this is right

  • Agents achieve up to 95.3 percent reduction in tokens used for document consumption across the evaluated tasks with no statistically significant accuracy loss.
  • Transpilers convert existing documents to OBJECTGRAPH while preserving 98.7 percent of content on held-out benchmarks.
  • The format remains readable by humans without tooling because it is a strict superset of Markdown.
  • Multi-turn agent loops avoid context compounding through native role-scoped and progressive access mechanisms.
  • A two-primitive query protocol suffices for all traversal needs without additional infrastructure.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Agent systems could shift from managing large context windows to operating on graph query primitives for better scalability in extended sessions.
  • The six properties could serve as an evaluation standard for any new document format intended for autonomous agents.
  • Executable assertion nodes suggest documents that embed verification logic or trigger external actions directly within the format.
  • The graph model might extend beyond text to unify access to code, data tables, or other structured artifacts in agent workflows.

Load-bearing premise

The six structural properties fully capture the requirements for effective document consumption by autonomous agents and the evaluation across five document classes and eight task types generalizes to real-world deployments.

What would settle it

An experiment on a new agent task type or document class where using OBJECTGRAPH produces a statistically significant drop in task accuracy compared to full document injection.

Figures

Figures reproduced from arXiv: 2604.27820 by Mohit Dubey, Open Gigantic.

Figure 1
Figure 1. Figure 1: Two architectural instantiations of the ObjectGraph view at source ↗
Figure 2
Figure 2. Figure 2: Mean token consumption per query across docu view at source ↗
Figure 3
Figure 3. Figure 3: Cumulative token cost in a 5-turn agentic work view at source ↗
Figure 4
Figure 4. Figure 4: Ablation: token reduction contribution of individual view at source ↗
read the original abstract

Every document format in existence was designed for a human reader moving linearly through text. Autonomous LLM agents do not read - they retrieve. This fundamental mismatch forces agents to inject entire documents into their context window, wasting tokens on irrelevant content, compounding state across multi-turn loops, and broadcasting information indiscriminately across agent roles. We argue this is not a prompt engineering problem, not a retrieval problem, and not a compression problem: it is a format problem. We introduce OBJECTGRAPH (.og), a file format that reconceives the document as a typed, directed knowledge graph to be traversed rather than a string to be injected. OBJECTGRAPH is a strict superset of Markdown - every .md file is a valid .og file - requires no infrastructure beyond a two-primitive query protocol, and is readable by both humans and agents without tooling. We formalize the Document Consumption Problem, characterise six structural properties no existing format satisfies simultaneously, and prove OBJECTGRAPH satisfies all six. We further introduce the Progressive Disclosure Model, the Role-Scoped Access Protocol, and Executable Assertion Nodes as native format primitives. Empirical evaluation across five document classes and eight agent task types demonstrates up to 95.3 percent token reduction with no statistically significant degradation in task accuracy (p > 0.05). Transpiler fidelity reaches 98.7 percent content preservation on a held-out document benchmark.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

4 major / 2 minor

Summary. The paper claims that current document formats are mismatched to autonomous LLM agents, which retrieve rather than read linearly, leading to wasteful full-document injection. It introduces OBJECTGRAPH (.og) as a strict Markdown superset that models documents as typed directed knowledge graphs traversable via a minimal two-primitive query protocol. The authors formalize the Document Consumption Problem, characterize six structural properties no existing format satisfies simultaneously, prove OBJECTGRAPH meets all six, and introduce three native primitives (Progressive Disclosure Model, Role-Scoped Access Protocol, Executable Assertion Nodes). Empirical evaluation across five document classes and eight agent task types reports up to 95.3% token reduction with no statistically significant task accuracy degradation (p > 0.05) and 98.7% transpiler fidelity on a held-out benchmark.

Significance. If the formalization, proofs, and results hold under scrutiny, the work could meaningfully advance agentic systems by providing a native format that reduces context bloat, enables role-scoped access, and supports progressive disclosure without extra infrastructure. Treating the document as a traversable graph rather than injectable text is a substantive conceptual shift, and the Markdown superset design aids adoption. The explicit formalization of the consumption problem and the reported token savings offer a concrete baseline for future agent-document research. Significance is limited by the narrow evaluation scope and the absence of external grounding for the six properties as the minimal necessary set.

major comments (4)
  1. [Abstract] Abstract: The claim that the six structural properties are necessary and that 'no existing format satisfies [them] simultaneously' is load-bearing for the entire contribution, yet the properties are presented as author-characterized without citation to agent interaction logs, framework analyses, or prior literature identifying these exact bottlenecks. This leaves open whether the properties are complete or whether the proof that OBJECTGRAPH is the first to satisfy all six rests on an ad-hoc characterization.
  2. [Abstract] Abstract (empirical evaluation): The statement of 'no statistically significant degradation in task accuracy (p > 0.05)' only indicates failure to reject the null hypothesis of no difference; it does not demonstrate equivalence, absence of degradation, or practical non-inferiority. No effect sizes, confidence intervals, or power analysis are referenced, undermining the central claim that token reduction is achieved without performance cost.
  3. [Abstract] Abstract: The headline result of 'up to 95.3 percent token reduction' reports the best-case outcome across the five document classes and eight task types. Without average or median reductions, standard deviations, or per-class/per-task breakdowns, it is impossible to judge whether the savings are typical or confined to favorable cases.
  4. [Abstract] Abstract: The 98.7 percent transpiler fidelity is reported on a 'held-out document benchmark' whose size, selection criteria, document classes, and construction method are unspecified. Because fidelity is a core supporting claim for the format's practicality, the lack of benchmark details makes the result non-reproducible from the abstract alone.
minor comments (2)
  1. [Abstract] The abstract introduces the Progressive Disclosure Model, Role-Scoped Access Protocol, and Executable Assertion Nodes without even one-sentence definitions, forcing readers to reach later sections to understand the primitives that are presented as native to the format.
  2. [Abstract] The assertion that OBJECTGRAPH 'requires no infrastructure beyond a two-primitive query protocol' is repeated but the two primitives themselves are not named or briefly described in the abstract, reducing immediate clarity for readers evaluating the 'no tooling' claim.

Simulated Author's Rebuttal

4 responses · 0 unresolved

We thank the referee for their constructive and detailed comments. We address each major comment point by point below, indicating the revisions we will make to strengthen the manuscript.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The claim that the six structural properties are necessary and that 'no existing format satisfies [them] simultaneously' is load-bearing for the entire contribution, yet the properties are presented as author-characterized without citation to agent interaction logs, framework analyses, or prior literature identifying these exact bottlenecks. This leaves open whether the properties are complete or whether the proof that OBJECTGRAPH is the first to satisfy all six rests on an ad-hoc characterization.

    Authors: The six structural properties are derived directly from our formalization of the Document Consumption Problem, which captures the core mismatches between linear document formats and the retrieval-oriented behavior of autonomous agents. While the properties are author-characterized, they synthesize established issues documented in the RAG, context-window, and multi-agent systems literature. In the revised manuscript we will add a dedicated subsection that explicitly traces each property back to the formal problem definition and includes additional citations to prior analyses of agent-document interactions. We will also clarify that the properties constitute a minimal sufficient set for the demonstrated benefits rather than an exhaustive or universally necessary characterization; the proof shows that no existing format satisfies all six simultaneously. revision: partial

  2. Referee: [Abstract] Abstract (empirical evaluation): The statement of 'no statistically significant degradation in task accuracy (p > 0.05)' only indicates failure to reject the null hypothesis of no difference; it does not demonstrate equivalence, absence of degradation, or practical non-inferiority. No effect sizes, confidence intervals, or power analysis are referenced, undermining the central claim that token reduction is achieved without performance cost.

    Authors: We agree that failure to reject the null hypothesis does not establish equivalence or the absence of degradation. In the revised manuscript we will expand the results section to report effect sizes, 95% confidence intervals on accuracy differences, and a post-hoc power analysis for the statistical tests performed. We will also present per-task and per-document-class accuracy comparisons to allow readers to assess practical non-inferiority directly from the data. revision: yes

  3. Referee: [Abstract] Abstract: The headline result of 'up to 95.3 percent token reduction' reports the best-case outcome across the five document classes and eight task types. Without average or median reductions, standard deviations, or per-class/per-task breakdowns, it is impossible to judge whether the savings are typical or confined to favorable cases.

    Authors: The phrase 'up to 95.3 percent' in the abstract is intended to highlight the maximum observed reduction. The full manuscript already contains tables and figures that report mean, median, and standard-deviation token reductions broken down by document class and task type. To address the concern, we will revise the abstract to reference these detailed breakdowns (e.g., 'token reductions reaching up to 95.3% with full per-class and per-task distributions provided in Section 4') so that the headline figure is placed in context. revision: partial

  4. Referee: [Abstract] Abstract: The 98.7 percent transpiler fidelity is reported on a 'held-out document benchmark' whose size, selection criteria, document classes, and construction method are unspecified. Because fidelity is a core supporting claim for the format's practicality, the lack of benchmark details makes the result non-reproducible from the abstract alone.

    Authors: The size, selection criteria, document-class coverage, and construction method of the held-out benchmark are fully specified in the Experimental Setup subsection of the manuscript. We acknowledge that the abstract would benefit from a concise qualifier. In the revision we will update the abstract to include a brief description of the benchmark (e.g., 'on a held-out benchmark spanning the five document classes') while directing readers to the methods section for complete reproducibility details. revision: partial

Circularity Check

0 steps flagged

No significant circularity in the derivation chain

full rationale

The paper formalizes the Document Consumption Problem, characterizes six structural properties, and proves OBJECTGRAPH satisfies them all while introducing native primitives like Progressive Disclosure and Role-Scoped Access. This is a standard requirements-definition followed by design-verification approach, not a reduction of any claimed result to its own inputs by construction. Empirical claims (token reduction, accuracy p-values, transpiler fidelity) are presented as separate evaluations on held-out benchmarks rather than fitted quantities renamed as predictions. No self-citation chains, uniqueness theorems imported from prior author work, or ansatzes smuggled via citation appear as load-bearing steps in the abstract or context. The derivation is self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 3 invented entities

The central claim rests on the unelaborated characterization of the Document Consumption Problem via six structural properties and the introduction of three new primitives, with no free parameters or external benchmarks detailed in the abstract.

axioms (1)
  • domain assumption The Document Consumption Problem is defined by six structural properties that no existing format satisfies simultaneously.
    Abstract states the problem is formalized around these properties as the basis for proving the new format's superiority.
invented entities (3)
  • Progressive Disclosure Model no independent evidence
    purpose: Native format primitive allowing information to be revealed progressively to agents
    Introduced as a core feature of ObjectGraph to address agent context limitations.
  • Role-Scoped Access Protocol no independent evidence
    purpose: Protocol for controlling document access based on agent roles
    New access control mechanism presented as part of the format.
  • Executable Assertion Nodes no independent evidence
    purpose: Graph nodes that support executable assertions within the document structure
    New primitive enabling dynamic checks directly in the file format.

pith-pipeline@v0.9.0 · 5558 in / 1507 out tokens · 58159 ms · 2026-05-07T04:59:37.020283+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

14 extracted references · 8 canonical work pages · 2 internal anchors

  1. [1]

    Model Context Protocol (MCP): An open standard for connecting AI assistants to tools and data sources, 2024

    Anthropic. Model Context Protocol (MCP): An open standard for connecting AI assistants to tools and data sources, 2024. https://modelcontextprotocol.io

  2. [2]

    D. Edge, H. Trinh, N. Cheng, J. Bradley, A. Chao, A. Mody, S. Truitt, and J. Larson. From local to global: A graph RAG approach to query-focused summarization. arXiv preprint arXiv:2404.16130, 2024

  3. [3]

    Fat-Cat: Document-driven metacognitive multi-agent system for complex reasoning

    Anonymous. Fat-Cat: Document-driven metacognitive multi-agent system for complex reasoning. arXiv preprint arXiv:2602.02206, 2025

  4. [4]

    Y. Gao, Z. Li, Y. Yuan, Z. Ji, P. Ma, and S. Wang. SkillReducer: Optimizing LLM agent skills for token efficiency. arXiv preprint arXiv:2603.29919, 2026

  5. [5]

    J. Gruber. Markdown, 2004. https://daringfireball.net/projects/markdown/

  6. [6]

    Active context compression: Autonomous memory management in LLM agents

    Anonymous. Active context compression: Autonomous memory management in LLM agents. arXiv preprint arXiv:2601.07190, 2025

  7. [7]

    From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

    Anonymous. From skill text to skill structure: The Scheduling-Structural-Logical representation for agent skills. arXiv preprint arXiv:2604.24026, 2026

  8. [8]

    u ttler, M. Lewis, W.-T. Yih, T. Rockt\

    P. Lewis, E. Perez, A. Piktus, F. Petroni, V. Karpukhin, N. Goyal, H. K\" u ttler, M. Lewis, W.-T. Yih, T. Rockt\" a schel, S. Riedel, and D. Kiela. Retrieval-augmented generation for knowledge-intensive NLP tasks. In Advances in Neural Information Processing Systems (NeurIPS), 2020

  9. [9]

    J. Howard. A proposed standard for using Markdown in LLM context, 2024. https://llmstxt.org

  10. [10]

    W. Mei, Y. Guo, Y. Wang, Z. Li, and H. Zhao. From commands to prompts: LLM-based semantic file system for AIOS. arXiv preprint arXiv:2410.11843, 2024

  11. [11]

    Schopplich

    J. Schopplich. TOON: Token-Oriented Object Notation, 2025. https://toonformat.dev

  12. [12]

    Structured context engineering for file-native agentic systems

    Anonymous. Structured context engineering for file-native agentic systems. arXiv preprint arXiv:2602.05447, 2025

  13. [13]

    Codebase-Memory: Tree-Sitter-based knowledge graphs for LLM code exploration via MCP

    Anonymous. Codebase-Memory: Tree-Sitter-based knowledge graphs for LLM code exploration via MCP. arXiv preprint arXiv:2603.27277, 2026

  14. [14]

    Y.-A. Xiao, P. Gao, C. Peng, and Y. Xiong. Reducing cost of LLM agents with trajectory reduction. Proc.\ ACM Softw.\ Eng., 3(FSE):FSE056, 2026