pith. sign in
Pith Number

pith:27UO5YVA

pith:2026:27UO5YVAGI5LBPUQ5VEPFP6FDP
not attested not anchored not stored refs resolved

GHGbench: A Unified Multi-Entity, Multi-Task Benchmark for Carbon Emission Prediction

Chao Xue, Flora Salim, Lihuan Li, Siyuan Zheng, Yifan Duan

GHGbench shows building carbon emissions are structurally harder to predict than company emissions, with out-of-distribution gaps dominating model differences.

arxiv:2605.13743 v1 · 2026-05-13 · cs.LG

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{27UO5YVAGI5LBPUQ5VEPFP6FDP}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Three benchmark-level findings emerge: (i) building emissions are structurally harder than company emissions; (ii) the in-distribution to out-of-distribution gap dwarfs any within-model gap across both the company track and the building track, and a tabular foundation model is, to our knowledge, the first baseline to open a paired-bootstrap-significant gap over tuned trees on a multi-city building-emissions task; (iii) multimodal remote-sensing embeddings help precisely where tabular generalisation breaks.

C2weakest assumption

The harmonization of 13 heterogeneous building data sources into a single schema produces accurate, unbiased labels and features without introducing systematic errors that affect the reported generalization gaps.

C3one line summary

GHGbench is a new multi-entity benchmark for company- and building-level carbon emission prediction that shows building tasks are harder, out-of-distribution gaps dominate, and multimodal data aids generalization.

References

46 extracted · 46 resolved · 3 Pith anchors

[1] Maddix, Hao Wang, Michael W 2024
[2] The Claude 3 model family: Opus, Sonnet, Haiku 2024
[3] EnergyStar++: Towards more accurate and explanatory building energy benchmarking.Applied Energy, 276:115413, 2020 2020 · doi:10.1016/j.apenergy.2020.115413
[4] Greenhouse gases emissions: Estimating corporate non-reported emissions using interpretable machine learning 2023 · doi:10.3390/su15043391
[5] Addressing data gaps in sustainability reporting: A benchmark dataset for greenhouse gas emission extraction.Scientific Data, 12: 1497, 2025 2025 · doi:10.1038/s41597-025-05664-8

Formal links

2 machine-checked theorem links

Receipt and verification
First computed 2026-05-18T02:44:16.439537Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

d7e8eee2a0323ab0be90ed48f2bfc51be4fb8136da20b6a99f8227586c075091

Aliases

arxiv: 2605.13743 · arxiv_version: 2605.13743v1 · doi: 10.48550/arxiv.2605.13743 · pith_short_12: 27UO5YVAGI5L · pith_short_16: 27UO5YVAGI5LBPUQ · pith_short_8: 27UO5YVA
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/27UO5YVAGI5LBPUQ5VEPFP6FDP \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: d7e8eee2a0323ab0be90ed48f2bfc51be4fb8136da20b6a99f8227586c075091
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "3b95f73a0584fcb4c5eea5a84d09b7723eadb4d9987d99d9fb78fb67029af875",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.LG",
    "submitted_at": "2026-05-13T16:20:49Z",
    "title_canon_sha256": "a0081fb0206bd3dfde8246955b5f65aaac5ecdc8b6740c2b46ee2e6adfcc5ecb"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.13743",
    "kind": "arxiv",
    "version": 1
  }
}