{"paper":{"title":"DiffVAS: Diffusion-Guided Visual Active Search in Partially Observable Environments","license":"http://creativecommons.org/licenses/by-nc-nd/4.0/","headline":"A diffusion model that reconstructs full geospatial maps from partial aerial glimpses enables a target-conditioned reinforcement learning planner to search for multiple object types at once.","cross_cats":["cs.AI"],"primary_cat":"cs.CV","authors_text":"Aleksis Pirinen, Anindya Sarkar, Nathan Jacobs, Srikumar Sastry, Yevgeniy Vorobeychik","submitted_at":"2026-05-15T01:30:07Z","abstract_excerpt":"Visual active search (VAS) has been introduced as a modeling framework that leverages visual cues to direct aerial (e.g., UAV-based) exploration and pinpoint areas of interest within extensive geospatial regions. Potential applications of VAS include detecting hotspots for rare wildlife poaching, aiding search-and-rescue missions, and uncovering illegal trafficking of weapons, among other uses. Previous VAS approaches assume that the entire search space is known upfront, which is often unrealistic due to constraints such as a restricted field of view and high acquisition costs, and they typica"},"claims":{"count":4,"items":[{"kind":"strongest_claim","text":"DiffVAS leverages a diffusion model to reconstruct the entire geospatial area from sequentially observed partial glimpses, which enables a target-conditioned reinforcement learning-based planning module to effectively reason and guide subsequent search steps.","source":"verdict.strongest_claim","status":"machine_extracted","claim_id":"C1","attestation":"unclaimed"},{"kind":"weakest_assumption","text":"The diffusion model produces reconstructions of unobserved regions that are sufficiently accurate and useful for the downstream RL planner to improve search performance over baselines in partially observable settings.","source":"verdict.weakest_assumption","status":"machine_extracted","claim_id":"C2","attestation":"unclaimed"},{"kind":"one_line_summary","text":"DiffVAS combines diffusion-based reconstruction of unobserved geospatial regions with target-conditioned RL planning to enable multi-object visual active search in partially observable environments.","source":"verdict.one_line_summary","status":"machine_extracted","claim_id":"C3","attestation":"unclaimed"},{"kind":"headline","text":"A diffusion model that reconstructs full geospatial maps from partial aerial glimpses enables a target-conditioned reinforcement learning planner to search for multiple object types at once.","source":"verdict.pith_extraction.headline","status":"machine_extracted","claim_id":"C4","attestation":"unclaimed"}],"snapshot_sha256":"e190d0d164282f46252e436d45e3f90f736556ce39d32421428c1aed3a7ba0b3"},"source":{"id":"2605.15519","kind":"arxiv","version":1},"verdict":{"id":"7f3a8f70-3ae7-4390-8a55-8c56446ab74a","model_set":{"reader":"grok-4.3"},"created_at":"2026-05-19T15:18:23.211141Z","strongest_claim":"DiffVAS leverages a diffusion model to reconstruct the entire geospatial area from sequentially observed partial glimpses, which enables a target-conditioned reinforcement learning-based planning module to effectively reason and guide subsequent search steps.","one_line_summary":"DiffVAS combines diffusion-based reconstruction of unobserved geospatial regions with target-conditioned RL planning to enable multi-object visual active search in partially observable environments.","pipeline_version":"pith-pipeline@v0.9.0","weakest_assumption":"The diffusion model produces reconstructions of unobserved regions that are sufficiently accurate and useful for the downstream RL planner to improve search performance over baselines in partially observable settings.","pith_extraction_headline":"A diffusion model that reconstructs full geospatial maps from partial aerial glimpses enables a target-conditioned reinforcement learning planner to search for multiple object types at once."},"integrity":{"clean":true,"summary":{"advisory":0,"critical":0,"by_detector":{},"informational":0},"endpoint":"/pith/2605.15519/integrity.json","findings":[],"available":true,"detectors_run":[{"name":"doi_title_agreement","ran_at":"2026-05-19T15:31:17.700996Z","status":"completed","version":"1.0.0","findings_count":0},{"name":"doi_compliance","ran_at":"2026-05-19T15:30:34.772873Z","status":"completed","version":"1.0.0","findings_count":0},{"name":"cited_work_retraction","ran_at":"2026-05-19T14:22:03.347595Z","status":"completed","version":"1.0.0","findings_count":0},{"name":"claim_evidence","ran_at":"2026-05-19T14:21:54.048273Z","status":"completed","version":"1.0.0","findings_count":0},{"name":"shingle_duplication","ran_at":"2026-05-19T13:49:41.843760Z","status":"skipped","version":"0.1.0","findings_count":0},{"name":"citation_quote_validity","ran_at":"2026-05-19T13:49:41.381434Z","status":"skipped","version":"0.1.0","findings_count":0},{"name":"ai_meta_artifact","ran_at":"2026-05-19T13:33:22.629573Z","status":"skipped","version":"1.0.0","findings_count":0}],"snapshot_sha256":"56c1e5ab221b8540289ae0b05615df35682831fdb2a540fe56ffc7527423acd5"},"references":{"count":36,"sample":[{"doi":"","year":2020,"title":"Luca Bartolomei, Lucas Teixeira, and Margarita Chli. 2020. Perception-aware path planning for uavs using semantic segmentation. In2020 IEEE/RSJ International Conference on Intelligent Robots and Syste","work_id":"37e3c8e0-6e7d-4480-8c16-9ef4f029f767","ref_index":1,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":null,"title":"Elizabeth Bondi, Debadeepta Dey, Ashish Kapoor, Jim Piavis, Shital Shah, Fei Fang, Bistra Dilkina, Robert Hannaford, Arvind Iyer, Lucas Joppa, et al","work_id":"5ca874c9-79c0-48ae-b593-a4c7b9ec6335","ref_index":2,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":null,"title":"InProceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies","work_id":"a831ac29-17ef-498d-8215-3f443ab60b2c","ref_index":3,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2018,"title":"Tung Dang, Christos Papachristos, and Kostas Alexis. 2018. Autonomous exploration and simultaneous object search using aerial robots. In2018 IEEE Aerospace Conference. IEEE, 1–7","work_id":"bb044a56-5bf7-411c-90a3-994a84cfa234","ref_index":4,"cited_arxiv_id":"","is_internal_anchor":false},{"doi":"","year":2016,"title":"Fei Fang, Thanh Nguyen, Rob Pickles, Wai Lam, Gopalasamy Clements, Bo An, Amandeep Singh, Milind Tambe, and Andrew Lemieux. 2016. Deploying paws: Field optimization of the protection assistant for wil","work_id":"46537736-8479-4c1f-8964-bacf69a516db","ref_index":5,"cited_arxiv_id":"","is_internal_anchor":false}],"resolved_work":36,"snapshot_sha256":"a88b18e126277f1993e8066be1d8feb18d4a1cb676bacce48365409dca4472e9","internal_anchors":3},"formal_canon":{"evidence_count":2,"snapshot_sha256":"1b985a1841ec4503cf51f2c499159672235f1fa209f41933b925cf1ce6323360"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"}