pith:ERDK2YZA
DiffVAS: Diffusion-Guided Visual Active Search in Partially Observable Environments
A diffusion model that reconstructs full geospatial maps from partial aerial glimpses enables a target-conditioned reinforcement learning planner to search for multiple object types at once.
arxiv:2605.15519 v1 · 2026-05-15 · cs.CV · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{ERDK2YZATZGPT4XDIYVR5CLI4D}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
DiffVAS leverages a diffusion model to reconstruct the entire geospatial area from sequentially observed partial glimpses, which enables a target-conditioned reinforcement learning-based planning module to effectively reason and guide subsequent search steps.
The diffusion model produces reconstructions of unobserved regions that are sufficiently accurate and useful for the downstream RL planner to improve search performance over baselines in partially observable settings.
DiffVAS combines diffusion-based reconstruction of unobserved geospatial regions with target-conditioned RL planning to enable multi-object visual active search in partially observable environments.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:01:02.933310Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
2446ad63209e4cf9f2e3462b1e8968e0ec89d37e5bd912e562c7e458dd0a4a4d
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/ERDK2YZATZGPT4XDIYVR5CLI4D \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 2446ad63209e4cf9f2e3462b1e8968e0ec89d37e5bd912e562c7e458dd0a4a4d
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "db0329acb72c5ad1eb33d780d915e3208f368927781a9105547b35aa7d42d22d",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://creativecommons.org/licenses/by-nc-nd/4.0/",
"primary_cat": "cs.CV",
"submitted_at": "2026-05-15T01:30:07Z",
"title_canon_sha256": "144f4e7cf25d1c7dfd9b825a81d215c5a9cb6e0dc7ba38b9789c06acf95f7830"
},
"schema_version": "1.0",
"source": {
"id": "2605.15519",
"kind": "arxiv",
"version": 1
}
}