Grounding DINO fuses language and vision via feature enhancer, language-guided query selection, and cross-modality decoder in a DINO backbone, achieving 52.5 AP zero-shot on COCO and a new record of 26.1 AP mean on ODinW.
In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
baseline 1polarities
baseline 1representative citing papers
A wavelet diffusion GAN for SISR reduces diffusion timesteps via the diffusion GAN paradigm and applies DWT for dimensionality reduction, claiming faster training/inference and higher fidelity than prior methods on CelebA-HQ.
citing papers explorer
-
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Grounding DINO fuses language and vision via feature enhancer, language-guided query selection, and cross-modality decoder in a DINO backbone, achieving 52.5 AP zero-shot on COCO and a new record of 26.1 AP mean on ODinW.
-
A Wavelet Diffusion GAN for Image Super-Resolution
A wavelet diffusion GAN for SISR reduces diffusion timesteps via the diffusion GAN paradigm and applies DWT for dimensionality reduction, claiming faster training/inference and higher fidelity than prior methods on CelebA-HQ.