Convnext v2: Co-designing and scaling convnets with masked autoencoders

Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, Saining Xie · 2023 · arXiv 2301.00808

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Transcoda: End-to-End Zero-Shot Optical Music Recognition via Data-Centric Synthetic Training

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

Transcoda achieves state-of-the-art zero-shot OMR with an 18.46% OMR-NED error rate on synthetic scores and 63.97% on historical Polish scans using a 59M model trained in 6 hours via synthetic data, kern normalization, and grammar decoding.

RaPA: Enhancing Transferable Targeted Attacks via Random Parameter Pruning

cs.LG · 2025-04-24 · conditional · novelty 6.0

Random parameter pruning during targeted attack optimization on surrogate models yields up to 11.7% higher average attack success rates when transferring to Transformer targets.

When Does Sparse MoE Help in Vision? The Role of Backbone Compute Leverage in Sparse Routing

cs.CV · 2026-05-15 · unverdicted · novelty 5.0

Sparse MoE vision models show positive accuracy gaps only when routing a substantial compute fraction ρ and using k≥2 experts at large scale; batch-axis dispatch is identified as a key failure mode.

citing papers explorer

Showing 3 of 3 citing papers.

Transcoda: End-to-End Zero-Shot Optical Music Recognition via Data-Centric Synthetic Training cs.CV · 2026-05-11 · unverdicted · none · ref 30
Transcoda achieves state-of-the-art zero-shot OMR with an 18.46% OMR-NED error rate on synthetic scores and 63.97% on historical Polish scans using a 59M model trained in 6 hours via synthetic data, kern normalization, and grammar decoding.
RaPA: Enhancing Transferable Targeted Attacks via Random Parameter Pruning cs.LG · 2025-04-24 · conditional · none · ref 53
Random parameter pruning during targeted attack optimization on surrogate models yields up to 11.7% higher average attack success rates when transferring to Transformer targets.
When Does Sparse MoE Help in Vision? The Role of Backbone Compute Leverage in Sparse Routing cs.CV · 2026-05-15 · unverdicted · none · ref 49
Sparse MoE vision models show positive accuracy gaps only when routing a substantial compute fraction ρ and using k≥2 experts at large scale; batch-axis dispatch is identified as a key failure mode.

Convnext v2: Co-designing and scaling convnets with masked autoencoders

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer