arXiv preprint arXiv:2110.09408

Hrformer: High-resolution transformer for dense prediction · 2019 · arXiv 2110.09408

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Accelerating Vision Transformers with Adaptive Patch Sizes

cs.CV · 2025-10-20 · conditional · novelty 6.0

APT adaptively varies patch sizes within a single image to reduce ViT token count, delivering 40-50% throughput gains on large models with no downstream performance loss.

Dual-Prompt CLIP with Hybrid Visual Encoders for Occluded Person Re-Identification

cs.CV · 2026-05-19 · unverdicted · novelty 5.0

DPL-ReID adds dual prompt learning, real-world occlusion augmentation, and weighted gated fusion to CLIP for state-of-the-art occluded person re-identification on benchmark datasets.

citing papers explorer

Showing 2 of 2 citing papers.

Accelerating Vision Transformers with Adaptive Patch Sizes cs.CV · 2025-10-20 · conditional · none · ref 17
APT adaptively varies patch sizes within a single image to reduce ViT token count, delivering 40-50% throughput gains on large models with no downstream performance loss.
Dual-Prompt CLIP with Hybrid Visual Encoders for Occluded Person Re-Identification cs.CV · 2026-05-19 · unverdicted · none · ref 16
DPL-ReID adds dual prompt learning, real-world occlusion augmentation, and weighted gated fusion to CLIP for state-of-the-art occluded person re-identification on benchmark datasets.

arXiv preprint arXiv:2110.09408

fields

years

verdicts

representative citing papers

citing papers explorer