Toward open vocabulary aerial object detection with clip-activated student-teacher learning

Li, Y · arXiv 2509.17562

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

SLIP-RS: Structured-Attribute Language-Image Pre-Training for Remote Sensing Object Detection

cs.CV · 2026-05-22 · unverdicted · novelty 6.0

SLIP-RS introduces a Structured-Attribute Decoupling Paradigm with contrastive learning and a conformal reliability engine to create a 15M-attribute dataset for remote sensing pre-training.

WOW-Seg: A Word-free Open World Segmentation Model

cs.CV · 2026-05-16 · conditional · novelty 6.0

WOW-Seg proposes a word-free open-world segmentation model using Mask2Token and Cascade Attention Mask modules, reporting 89.7 semantic similarity and 82.4 semantic IoU on LVIS with one-eighth the parameters of prior SOTA plus a new 7,662-class benchmark.

citing papers explorer

Showing 2 of 2 citing papers.

SLIP-RS: Structured-Attribute Language-Image Pre-Training for Remote Sensing Object Detection cs.CV · 2026-05-22 · unverdicted · none · ref 6
SLIP-RS introduces a Structured-Attribute Decoupling Paradigm with contrastive learning and a conformal reliability engine to create a 15M-attribute dataset for remote sensing pre-training.
WOW-Seg: A Word-free Open World Segmentation Model cs.CV · 2026-05-16 · conditional · none · ref 7
WOW-Seg proposes a word-free open-world segmentation model using Mask2Token and Cascade Attention Mask modules, reporting 89.7 semantic similarity and 82.4 semantic IoU on LVIS with one-eighth the parameters of prior SOTA plus a new 7,662-class benchmark.

Toward open vocabulary aerial object detection with clip-activated student-teacher learning

fields

years

verdicts

representative citing papers

citing papers explorer