pith. sign in

Audio-visual instance segmenta- tion

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.CV 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

2nd of the 5th PVUW MeViS-Audio Track: ASR-SaSaSa2VA

cs.CV · 2026-04-27 · unverdicted · novelty 3.0

ASR-SaSaSa2VA turns audio into text via ASR then feeds it to pre-trained referring video segmentation models, achieving 80.7 and second place in the 5th PVUW MeViS-v2-Audio track.

citing papers explorer

Showing 1 of 1 citing paper.

  • 2nd of the 5th PVUW MeViS-Audio Track: ASR-SaSaSa2VA cs.CV · 2026-04-27 · unverdicted · none · ref 5

    ASR-SaSaSa2VA turns audio into text via ASR then feeds it to pre-trained referring video segmentation models, achieving 80.7 and second place in the 5th PVUW MeViS-v2-Audio track.