pith. sign in

arxiv: 2604.05865 · v1 · submitted 2026-04-07 · 💻 cs.AI · cs.PL

JTON: A Token-Efficient JSON Superset with Zen Grid Tabular Encoding for Large Language Models

Pith reviewed 2026-05-10 20:06 UTC · model grok-4.3

classification 💻 cs.AI cs.PL
keywords JTONZen Gridtoken efficiencyJSON supersettabular dataLLM contextstructured serializationdata encoding
0
0 comments X

The pith

JTON with Zen Grid encoding reduces token counts for tabular data in LLMs by 28.5% on average while maintaining comprehension accuracy.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes JTON, a strict superset of JSON, to address token inefficiency in serializing tabular data for large language models. Standard JSON repeats key names for every row in an array, wasting tokens as the number of rows grows. Zen Grid factors out the column headers into a single row and uses semicolons to separate values in each subsequent row. This approach preserves JSON's full type system and syntax. If effective, it allows LLMs to process more data within fixed context windows or at lower inference costs without losing information.

Core claim

JTON introduces Zen Grid tabular encoding, which places column headers in one row and encodes row values using semicolon delimiters, achieving 15-60% token reduction compared to compact JSON across seven real-world domains with an average of 28.5%. LLM comprehension tests on 10 models show a net 0.3 percentage point accuracy improvement, and generation tests on 12 models confirm 100% syntactic validity in few-shot and zero-shot scenarios.

What carries the argument

Zen Grid: a tabular encoding mechanism that isolates column headers in a single row and delimits cell values with semicolons within a JSON-compatible structure.

If this is right

  • Token usage for tabular arrays drops by 15 to 60 percent compared to standard JSON.
  • LLMs achieve comparable or slightly higher accuracy on data comprehension tasks.
  • Models can generate valid JTON data reliably without special training.
  • Optimized parsers can process the format faster than standard JSON libraries.
  • Structured data can fit into more LLM prompts due to reduced token overhead.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Zen Grid could be adapted to other data serialization formats beyond JSON to achieve similar efficiencies.
  • With lower token counts, LLMs might handle larger tables or more complex nested structures in a single context window.
  • Future work might test the format on emerging LLM architectures or in production APIs for cost savings.
  • The public test suite and implementation allow independent verification and extension to new domains.

Load-bearing premise

The seven real-world domains and the particular LLMs tested are representative enough that the observed token savings and accuracy results will hold in other common use cases.

What would settle it

Testing on an eighth domain or additional LLMs where token reduction falls below 10% or where accuracy decreases by more than 1 percentage point compared to JSON.

Figures

Figures reproduced from arXiv: 2604.05865 by Gowthamkumar Nandakishore.

Figure 1
Figure 1. Figure 1: Token count scaling for employee records (4 columns, [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Token savings (%) versus JSON formats converge rapidly with row count, reaching [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Zen Grid token savings across different data shapes. Savings range from 15% (number [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Token counts across seven real-world datasets. Zen Grid (green) consistently uses fewer [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Zen Grid token savings by dataset domain. Savings correlate with key name length [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Left: parse speedup per dataset. Right: absolute parse latency comparison. JTON [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: LLM accuracy on real-world datasets: JSON vs Zen Grid across 10 models. Four [PITH_FULL_IMAGE:figures/full_fig_p011_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Accuracy by question type on real-world data (10 models combined). Lookup remains [PITH_FULL_IMAGE:figures/full_fig_p012_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Accuracy by dataset domain. Performance varies by domain complexity–Twitter and [PITH_FULL_IMAGE:figures/full_fig_p012_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Token savings: synthetic scaling trend (green line) versus real-world datasets (red [PITH_FULL_IMAGE:figures/full_fig_p013_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Token counts (left) and savings vs JSON compact (right) across six serialization [PITH_FULL_IMAGE:figures/full_fig_p014_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Cost–accuracy tradeoff across 10 LLMs. Each model appears twice: as a red point [PITH_FULL_IMAGE:figures/full_fig_p016_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Parsing and serialization throughput. JTON achieves 1.2–1.9 [PITH_FULL_IMAGE:figures/full_fig_p018_13.png] view at source ↗
read the original abstract

When LLMs process structured data, the serialization format directly affects cost and context utilization. Standard JSON wastes tokens repeating key names in every row of a tabular array--overhead that scales linearly with row count. This paper presents JTON (JSON Tabular Object Notation), a strict JSON superset whose main idea, Zen Grid, factors column headers into a single row and encodes values with semicolons, preserving JSON's type system while cutting redundancy. Across seven real-world domains, Zen Grid reduces token counts by 15-60% versus JSON compact (28.5% average; 32% with bare_strings). Comprehension tests on 10 LLMs show a net +0.3 pp accuracy gain over JSON: four models improve, three hold steady, and three dip slightly. Generation tests on 12 LLMs yield 100% syntactic validity in both few-shot and zero-shot settings. A Rust/PyO3 reference implementation adds SIMD-accelerated parsing at 1.4x the speed of Python's json module. Code, a 683-vector test suite, and all experimental data are publicly available.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The paper introduces JTON, a strict JSON superset featuring Zen Grid tabular encoding that factors column headers into a single row and uses semicolon-separated values to reduce redundancy while preserving JSON's type system. It reports 15-60% token reductions (28.5% average) versus compact JSON across seven real-world domains, a net +0.3 pp accuracy gain in comprehension tests on 10 LLMs (with mixed per-model outcomes), 100% syntactic validity in generation tests on 12 LLMs, and a SIMD-accelerated Rust/PyO3 implementation that parses 1.4x faster than Python's json module. All code, a 683-vector test suite, and experimental data are released publicly.

Significance. If the empirical claims hold under scrutiny, the work offers a practical, backward-compatible format that could lower token costs for tabular data in LLM pipelines without introducing accuracy penalties, supported by the open reference implementation and test suite as reproducibility strengths.

major comments (1)
  1. [Abstract] Abstract: The central claim of a net +0.3 pp accuracy gain (with four models improving, three steady, three dipping) lacks per-model scores, test-item counts, variance estimates, confidence intervals, or significance tests. This makes it impossible to determine whether the small aggregate figure reflects a reliable property of Zen Grid or statistical noise, directly undermining the 'no downside' component of the contribution.
minor comments (1)
  1. The abstract and results sections would benefit from explicit statements of the exact tokenization model (e.g., which tokenizer) used for the reported counts, as token savings can vary across implementations.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on the abstract's presentation of results. We address the concern point by point below and commit to revisions that improve transparency without altering the core claims.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The central claim of a net +0.3 pp accuracy gain (with four models improving, three steady, three dipping) lacks per-model scores, test-item counts, variance estimates, confidence intervals, or significance tests. This makes it impossible to determine whether the small aggregate figure reflects a reliable property of Zen Grid or statistical noise, directly undermining the 'no downside' component of the contribution.

    Authors: We agree that the abstract, as a concise summary, does not embed the full statistical apparatus. The manuscript already reports the mixed per-model outcomes explicitly and draws from the 683-vector test suite described in Section 3. Per-model accuracy scores, item counts per domain, and variance estimates appear in the results section (Table 2 and accompanying text). However, formal confidence intervals and significance tests (e.g., paired t-tests or McNemar) were not computed in the submitted version. We will revise the abstract to add a brief qualifier referencing the detailed results and will incorporate the requested statistical tests in the revised manuscript to allow readers to assess whether the net +0.3 pp gain is distinguishable from noise. revision: yes

Circularity Check

0 steps flagged

No circularity: claims rest on direct empirical measurements

full rationale

The paper introduces JTON/Zen Grid as a JSON superset for tabular data and validates it via token-count reductions measured on seven real-world domains plus direct LLM comprehension and generation tests across 10-12 models. No equations, fitted parameters, or derivations are presented that could reduce to their own inputs. No self-citations, uniqueness theorems, or ansatzes are invoked to support the core claims. All reported outcomes (token savings, accuracy deltas, 100% validity, parsing speed) are obtained from external benchmarks and implementations rather than by construction from the format definition itself. The derivation chain is therefore self-contained and non-circular.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests primarily on empirical testing of token counts and LLM performance rather than formal axioms or free parameters.

axioms (1)
  • domain assumption Preservation of JSON type system and syntactic validity in the superset format
    The paper assumes the new encoding maintains full compatibility with JSON types and parsers without introducing new parsing ambiguities.
invented entities (1)
  • Zen Grid no independent evidence
    purpose: Tabular encoding technique within JTON that factors headers and uses semicolon separation
    Newly introduced encoding method without independent external validation or falsifiable predictions outside the paper's tests.

pith-pipeline@v0.9.0 · 5496 in / 1452 out tokens · 92872 ms · 2026-05-10T20:06:48.246355+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Notation Matters: A Benchmark Study of Token-Optimized Formats in Agentic AI Systems

    cs.AI 2026-05 conditional novelty 6.0

    TRON cuts tokens up to 27% with accuracy within 14pp of JSON on agentic benchmarks while TOON reaches 18% savings but triggers multi-turn parsing failures and parallel-call collapse on most models.

Reference graph

Works this paper leans on

3 extracted references · 3 canonical work pages · cited by 1 Pith paper

  1. [1]

    Oren Ben-Kiki, Clark Evans, and Ingy döt Net

    doi: 10.1145/3192366.3192369. Oren Ben-Kiki, Clark Evans, and Ingy döt Net. YAML ain’t markup language (YAML) version 1.2.https://yaml.org/spec/1.2.2/,

  2. [2]

    19 Tom B

    URLhttps://www.rfc-editor.org/rfc/rfc8259. 19 Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, et al. Language models are few-shot learners.Advances in Neural Information Processing Systems, 33:1877–1901,

  3. [3]

    doi: 10.14778/3352063.3352077. OpenAI. tiktoken: Fast BPE tokeniser for use with OpenAI’s models.https://github.com/ openai/tiktoken,