pith. sign in
Pith Number

pith:4PH2U7EJ

pith:2026:4PH2U7EJ5TS5WAGLAZ4SZDX4KG
not attested not anchored not stored refs pending

ScriptHOI: Learning Scripted State Transitions for Open-Vocabulary Human-Object Interaction Detection

Bao Ngoc Le, Linh Chi Vo, Minh Anh Nguyen, Quang Huy Tran, Suiyang Guang, Tuan Kiet Pham

Decomposing interaction phrases into state slots verifies multiple visual cues and improves rare and unseen human-object interaction detection.

arxiv:2605.05057 v3 · 2026-05-06 · cs.CV

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{4PH2U7EJ5TS5WAGLAZ4SZDX4KG}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Experiments on HICO-DET, V-COCO, and open-vocabulary HOI splits show that ScriptHOI improves rare and unseen interaction recognition while substantially reducing affordance-conflict false positives.

C2weakest assumption

That the visual state tokenizer can reliably parse human-object pairs into accurate state tokens for the six slots and that script coverage and conflict estimates provide valid calibration without introducing new biases or missing critical visual cues.

C3one line summary

ScriptHOI improves rare and unseen HOI recognition by decomposing phrases into state slots, using visual tokenization and slot-wise matching for script coverage and conflict to calibrate predictions and constrain training on incomplete labels.

Receipt and verification
First computed 2026-05-27T01:04:58.799764Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

e3cfaa7c89ece5db00cb06792c8efc5180aded54535774509e9f54a89bf8998a

Aliases

arxiv: 2605.05057 · arxiv_version: 2605.05057v3 · doi: 10.48550/arxiv.2605.05057 · pith_short_12: 4PH2U7EJ5TS5 · pith_short_16: 4PH2U7EJ5TS5WAGL · pith_short_8: 4PH2U7EJ
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/4PH2U7EJ5TS5WAGLAZ4SZDX4KG \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: e3cfaa7c89ece5db00cb06792c8efc5180aded54535774509e9f54a89bf8998a
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "ae304ecf7f00f4789847681c0eddaf2c8bfae8916d17565ba5f3950676dc8644",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by-sa/4.0/",
    "primary_cat": "cs.CV",
    "submitted_at": "2026-05-06T15:52:35Z",
    "title_canon_sha256": "5ba83a70b6539342e3fa653a8d3464a206d40a03beaee8e71d5a75b31d7aef91"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.05057",
    "kind": "arxiv",
    "version": 3
  }
}