← back to paper
arxiv: 2606.20140 · 2 revisions
SA-VIS: Sparse frame Annotations for training Video Instance Segmentation