Pith Number
pith:CVVQ5U6Z
pith:2024:CVVQ5U6ZAPZQOEDEYH7QLSO6PQ
not attested
not anchored
not stored
refs resolved
SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents
Advancements in GUI grounding directly improve the performance of visual agents that automate tasks from screenshots alone.
arxiv:2401.10935 v2 · 2024-01-17 · cs.HC · cs.AI
Record completeness
1
Bitcoin timestamp
2
Internet Archive
3
Author claim
· sign in to
claim
4
Citations
5
Replications
✓
Portable graph bundle live · download bundle · merged
state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same
current state with the deterministic merge algorithm.
Claims
C1strongest claim
advancements in GUI grounding directly correlate with enhanced performance in downstream GUI agent tasks
C2weakest assumption
That the automatically curated GUI grounding data is sufficiently high-quality and representative to enable effective transfer to real agent tasks across environments.
C3one line summary
SeeClick improves visual GUI agents via GUI grounding pre-training on automatically curated data and introduces the ScreenSpot benchmark, with results indicating that stronger grounding boosts downstream task performance.
References
[1] Aho and Jeffrey D
[2] Publications Manual , year = "1983", publisher =
[3] Chandra and Dexter C
[4] Scalable training of
[5] Dan Gusfield , title =. 1997
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:14.418669Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
156b0ed3d903f3071064c1ff05c9de7c107098706c7beb3b249c632e0ef6faf4
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/CVVQ5U6ZAPZQOEDEYH7QLSO6PQ \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 156b0ed3d903f3071064c1ff05c9de7c107098706c7beb3b249c632e0ef6faf4
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "c715e54ca2fc5df30d79d57a56510b383f51991a97577d6e6757f967c307f952",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.HC",
"submitted_at": "2024-01-17T08:10:35Z",
"title_canon_sha256": "9c130e118c2a1c05f74f4892c3b481834ad6af8d966941736349a41f6527fb8a"
},
"schema_version": "1.0",
"source": {
"id": "2401.10935",
"kind": "arxiv",
"version": 2
}
}