We train a model on this unified data for 200 epochs at an learning rate decay rate of 0.97

For GaMa, we randomly sample videos so that the corpus size is equivalent to the other two

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

VidTAG: Temporally Aligned Video to GPS Geolocalization with Denoising Sequence Prediction at a Global Scale

cs.CV · 2026-04-14 · unverdicted · novelty 7.0

VidTAG achieves fine-grained global video-to-GPS geolocalization via temporal frame alignment and denoising sequence refinement, reporting 20% gains at 1 km over GeoCLIP and 25% on CityGuessr68k.

citing papers explorer

Showing 1 of 1 citing paper.

VidTAG: Temporally Aligned Video to GPS Geolocalization with Denoising Sequence Prediction at a Global Scale cs.CV · 2026-04-14 · unverdicted · none · ref 79
VidTAG achieves fine-grained global video-to-GPS geolocalization via temporal frame alignment and denoising sequence refinement, reporting 20% gains at 1 km over GeoCLIP and 25% on CityGuessr68k.

We train a model on this unified data for 200 epochs at an learning rate decay rate of 0.97

fields

years

verdicts

representative citing papers

citing papers explorer