Hiera: A hi- erarchical vision transformer without the bells-and-whistles

Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

TinySAM 2: Extreme Memory Compression for Efficient Track Anything Model

cs.CV · 2026-05-18 · conditional · novelty 5.0

TinySAM 2 reaches 90% of SAM 2.1 performance on DAVIS and SA-V using 7% of the memory tokens and 3% of the training data via frame selection, spatial average pooling, temporal similarity-based token pruning, and a RepViT image encoder.

citing papers explorer

Showing 1 of 1 citing paper.

TinySAM 2: Extreme Memory Compression for Efficient Track Anything Model cs.CV · 2026-05-18 · conditional · none · ref 21
TinySAM 2 reaches 90% of SAM 2.1 performance on DAVIS and SA-V using 7% of the memory tokens and 3% of the training data via frame selection, spatial average pooling, temporal similarity-based token pruning, and a RepViT image encoder.

Hiera: A hi- erarchical vision transformer without the bells-and-whistles

fields

years

verdicts

representative citing papers

citing papers explorer