Mask2former for video instance segmentation

· 2021 · arXiv 2112.10764

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Mind the Gap: Disentangling Performance Bottlenecks in Video Instance Segmentation

cs.CV · 2026-06-05 · unverdicted · novelty 7.0

An ILP-based oracle applied to seven VIS methods on YouTube-VIS and OVIS shows tracking instability as the dominant bottleneck, producing gaps exceeding 20 AP under occlusion while classification impact is secondary.

SA-VIS: Sparse frame Annotations for training Video Instance Segmentation

cs.CV · 2026-06-18 · unverdicted · novelty 6.0

SA-VIS trains video instance segmentation models on sparse frame annotations via a Past-frames Feature Propagation module and frame-specific instance queries, showing only a 0.4% AP drop versus dense training on YouTube-VIS and OVIS benchmarks.

GOLD-BEV: GrOund and aeriaL Data for Dense Semantic BEV Mapping of Dynamic Scenes

cs.CV · 2026-04-21 · unverdicted · novelty 6.0

GOLD-BEV learns dense BEV semantic maps including dynamic agents from ego-centric sensors by using synchronized aerial imagery for training supervision and pseudo-label generation.

Primus: Enforcing Attention Usage for 3D Medical Image Segmentation

cs.CV · 2025-03-03 · unverdicted · novelty 6.0

Primus and PrimusV2 are Transformer-centric models that match or exceed nnU-Net and top CNNs on nine 3D medical segmentation datasets by enforcing attention usage.

PAT-VCM: Plug-and-Play Auxiliary Tokens for Video Coding for Machines

cs.CV · 2026-04-14 · unverdicted · novelty 5.0

PAT-VCM adds lightweight auxiliary tokens to a shared baseline video stream to support multiple downstream machine tasks without task-specific codecs.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Mind the Gap: Disentangling Performance Bottlenecks in Video Instance Segmentation cs.CV · 2026-06-05 · unverdicted · none · ref 2
An ILP-based oracle applied to seven VIS methods on YouTube-VIS and OVIS shows tracking instability as the dominant bottleneck, producing gaps exceeding 20 AP under occlusion while classification impact is secondary.
SA-VIS: Sparse frame Annotations for training Video Instance Segmentation cs.CV · 2026-06-18 · unverdicted · none · ref 3
SA-VIS trains video instance segmentation models on sparse frame annotations via a Past-frames Feature Propagation module and frame-specific instance queries, showing only a 0.4% AP drop versus dense training on YouTube-VIS and OVIS benchmarks.
GOLD-BEV: GrOund and aeriaL Data for Dense Semantic BEV Mapping of Dynamic Scenes cs.CV · 2026-04-21 · unverdicted · none · ref 7
GOLD-BEV learns dense BEV semantic maps including dynamic agents from ego-centric sensors by using synchronized aerial imagery for training supervision and pseudo-label generation.
PAT-VCM: Plug-and-Play Auxiliary Tokens for Video Coding for Machines cs.CV · 2026-04-14 · unverdicted · none · ref 9
PAT-VCM adds lightweight auxiliary tokens to a shared baseline video stream to support multiple downstream machine tasks without task-specific codecs.

Mask2former for video instance segmentation

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer