Geo2Sound generates geographically realistic soundscapes from satellite imagery via geospatial attribute modeling, semantic hypothesis expansion, and geo-acoustic alignment, achieving SOTA FAD of 1.765 on a new 20k-pair benchmark.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
SafeScreen enforces individualized safety constraints as a prerequisite for video retrieval by using profile extraction, adaptive VideoRAG analysis, and LLM decision-making to approve content for vulnerable users.
A patch-based fusion method extends CLIP to high-resolution images by retaining multi-scale details for improved class-prompted retrieval.
citing papers explorer
-
Geo2Sound: A Scalable Geo-Aligned Framework for Soundscape Generation from Satellite Imagery
Geo2Sound generates geographically realistic soundscapes from satellite imagery via geospatial attribute modeling, semantic hypothesis expansion, and geo-acoustic alignment, achieving SOTA FAD of 1.765 on a new 20k-pair benchmark.
-
SafeScreen: A Safety-First Screening Framework for Personalized Video Retrieval for Vulnerable Users
SafeScreen enforces individualized safety constraints as a prerequisite for video retrieval by using profile extraction, adaptive VideoRAG analysis, and LLM decision-making to approve content for vulnerable users.
-
DetailCLIP: Injecting Image Details into CLIP's Feature Space
A patch-based fusion method extends CLIP to high-resolution images by retaining multi-scale details for improved class-prompted retrieval.