pith. sign in
Pith Number

pith:HLB5V2EM

pith:2026:HLB5V2EMI6GL3WMLZYGGYVCSRT
not attested not anchored not stored refs resolved

Assessing the Creativity of Large Language Models: Testing, Limits, and New Frontiers

Alexi Gladstone, Heng Ji, Jonah Black, Samuel Schapiro

The Divergent Remote Association Test predicts large language models' scientific ideation ability where other creativity tests fail.

arxiv:2605.13450 v1 · 2026-05-13 · cs.AI · cs.CL · cs.HC

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{HLB5V2EMI6GL3WMLZYGGYVCSRT}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

The DRAT is the first and only creativity test for LLMs that is a significant predictor of scientific ideation ability, demonstrating robustness across major design choices. Furthermore, the performance gain of the DRAT is not recoverable from any linear combination of the Divergent Association Task and the Remote Associates Test.

C2weakest assumption

That the human creativity tests and the new DRAT validly measure the target constructs of creative writing, divergent thinking, and scientific ideation in LLMs, despite the abstract noting that these tests have limited validity even for humans.

C3one line summary

The Divergent Remote Association Test (DRAT) is the first creativity test that significantly predicts LLMs' scientific ideation ability, unlike prior tests such as DAT or RAT.

References

29 extracted · 29 resolved · 5 Pith anchors

[1] doi: 10.1016/j.tsc.2021.100859 2021 · doi:10.1016/j.tsc.2021.100859
[2] 20 Chiang, W.-L., Zheng, L., Sheng, Y., Angelopoulos, A
[3] On the Measure of Intelligence 1911 · arXiv:1911.01547
[4] doi: 10.3758/s13423-018-1517-7 · doi:10.3758/s13423-018-1517-7
[5] CresOWLve: Benchmarking Creative Problem-Solving Over Real-World Knowledge · arXiv:2604.03374

Formal links

1 machine-checked theorem link

Receipt and verification
First computed 2026-05-18T02:44:41.906529Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

3ac3dae88c478cbdd98bce0c6c54528cc037fe7b12cd0811d6bfad2ace532773

Aliases

arxiv: 2605.13450 · arxiv_version: 2605.13450v1 · doi: 10.48550/arxiv.2605.13450 · pith_short_12: HLB5V2EMI6GL · pith_short_16: HLB5V2EMI6GL3WML · pith_short_8: HLB5V2EM
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/HLB5V2EMI6GL3WMLZYGGYVCSRT \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 3ac3dae88c478cbdd98bce0c6c54528cc037fe7b12cd0811d6bfad2ace532773
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "efe89762f839231939802c28a6b3aa2575fe6709ea712e4e09e6301d4e378b4b",
    "cross_cats_sorted": [
      "cs.CL",
      "cs.HC"
    ],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.AI",
    "submitted_at": "2026-05-13T12:46:05Z",
    "title_canon_sha256": "f89e3dc824ca299a0e7bf4fa92d54273981e405c905e82b1f2891db6a56888cd"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.13450",
    "kind": "arxiv",
    "version": 1
  }
}