pith. sign in
Pith Number

pith:Y573BBBU

pith:2026:Y573BBBUW5Q4MRIMXVJOGM3QUW
not attested not anchored not stored refs resolved

RTI-Bench: A Structured Dataset for Indian Right-to-Information Decision Analysis

Joy Bose

RTI-Bench supplies the first structured collection of Indian Central Information Commission decisions with outcome labels and exemption details.

arxiv:2605.16843 v1 · 2026-05-16 · cs.CL

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{Y573BBBUW5Q4MRIMXVJOGM3QUW}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

To the best of our knowledge it is the first publicly released structured dataset for Indian RTI administrative decisions.

C2weakest assumption

That rule-based extraction plus manual review on a 50-case sample produces sufficiently accurate and representative labels across the full 298-PDF collection despite only 51% coverage in the first release.

C3one line summary

RTI-Bench is the first publicly released structured dataset of CIC administrative decisions with outcome labels, exemption citations, IRAC reasoning, and timelines, built from 1,218 corpus cases and 298 PDFs, achieving 95.3% label precision on manual review and 57.3% accuracy on a Mistral 7B zero-Sh

References

10 extracted · 10 resolved · 1 Pith anchors

[1] Annual Report 2021-22 2022
[2] Satija, N. (2021). Over 32,000 RTI appeals pending with Central Information Commission. Hindustan Times, December 16, 2021. https://www.hindustantimes.com/india-news/over-32-000-rti-appeals-pending-wi 2021
[3] K., Ghosh, K., Guha, S 2021
[4] K., Sharma, A., Khanna, D., Shallum, N., Ghosh, K., & Bhattacharya, A 2024
[5] Pal, A. (2022, December). Deepparliament: A legal domain benchmark & dataset for parliament bills prediction. In Proceedings of the Workshop on Unimodal and Multimodal Induction of Linguistic Structur 2022
Receipt and verification
First computed 2026-05-20T00:03:25.783211Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

c77fb08434b761c6450cbd52e33370a5a6d3566c0de2a96c38ce5aaeb892c23e

Aliases

arxiv: 2605.16843 · arxiv_version: 2605.16843v1 · doi: 10.48550/arxiv.2605.16843 · pith_short_12: Y573BBBUW5Q4 · pith_short_16: Y573BBBUW5Q4MRIM · pith_short_8: Y573BBBU
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/Y573BBBUW5Q4MRIMXVJOGM3QUW \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: c77fb08434b761c6450cbd52e33370a5a6d3566c0de2a96c38ce5aaeb892c23e
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "9fb4fa25a45680b08970dda9ad3d069b064baccea5b242df0cf11bc063c919d0",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.CL",
    "submitted_at": "2026-05-16T07:02:10Z",
    "title_canon_sha256": "fca80cd2a9e8fc758d818810440e9b6ec25cf3067021cecd9e6c81e5eec9497e"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.16843",
    "kind": "arxiv",
    "version": 1
  }
}