Ov-deim: Real-time detr-style open-vocabulary object detection with gridsynthetic augmentation.arXiv preprint arXiv:2603.07022, 2026

Leilei Wang, Longfei Liu, Xi Shen, Xuanlong Yu, Ying Tiffany He, Fei Richard Yu, Yingyi Chen · 2026 · arXiv 2603.07022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

VL-DINO: Leveraging CLIP Vision-Language Knowledge for Open-Vocabulary Object Detectio

cs.CV · 2026-06-10 · unverdicted · novelty 5.0

VL-DINO improves open-vocabulary object detection by adding QPSC, VSE, and ORSA modules that inject CLIP knowledge into DINO, reaching 36.3 and 38.1 AP zero-shot on LVIS.

citing papers explorer

Showing 1 of 1 citing paper after filters.

VL-DINO: Leveraging CLIP Vision-Language Knowledge for Open-Vocabulary Object Detectio cs.CV · 2026-06-10 · unverdicted · none · ref 36
VL-DINO improves open-vocabulary object detection by adding QPSC, VSE, and ORSA modules that inject CLIP knowledge into DINO, reaching 36.3 and 38.1 AP zero-shot on LVIS.

Ov-deim: Real-time detr-style open-vocabulary object detection with gridsynthetic augmentation.arXiv preprint arXiv:2603.07022, 2026

fields

years

verdicts

representative citing papers

citing papers explorer