GFNet : Global filter networks for visual recognition

doi: 10 · 2023 · arXiv 2023.326382

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

From Spatial to Spectral: An Efficient, Frequency-Guided Feature Representation Learner for Small Object Detection

cs.CV · 2026-06-22 · unverdicted · novelty 6.0

Proposes DERNet with Decompose-Enhance-Reconstruct operator and three plug-and-play modules to shift small object detection from spatial to spectral feature processing, claiming better performance than YOLOv11 with 1/6 the parameters.

Deep Psychovisual Image Representations

cs.CV · 2026-05-28 · unverdicted · novelty 6.0

Proposes a psychovisual-inspired deep learning method that encodes images in learned frequency sub-bands for interpretable semantic structures and reduced depth dependence.

Where Do Tokens Go? Understanding Pruning Behaviors in STEP at High Resolutions

cs.CV · 2025-09-17 · unverdicted · novelty 5.0

STEP uses dynamic superpatch merging via dCTS and early token exits to cut token count by 2.5x and computational complexity by up to 4x on ViT-Large for high-res segmentation, with at most 2% accuracy drop and 40% tokens halted early.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Where Do Tokens Go? Understanding Pruning Behaviors in STEP at High Resolutions cs.CV · 2025-09-17 · unverdicted · none · ref 60
STEP uses dynamic superpatch merging via dCTS and early token exits to cut token count by 2.5x and computational complexity by up to 4x on ViT-Large for high-res segmentation, with at most 2% accuracy drop and 40% tokens halted early.

GFNet : Global filter networks for visual recognition

fields

years

verdicts

representative citing papers

citing papers explorer