TULIP: Token-length upgraded CLIP.arXiv preprint arXiv:2410.10034, 2024

arXiv:2410 · 2024 · arXiv 2410.10034

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation

cs.CV · 2026-05-24 · unverdicted · novelty 6.0

LEASE achieves state-of-the-art unified performance on ImageNet-1K by combining masked token reconstruction and codebook contrast losses in a one-time precomputed discrete token space.

TuringViT: Making SOTA Vision Transformers Accessible to All

cs.CV · 2026-06-23 · unverdicted · novelty 5.0

TuringViT claims a new ViT design with linear attention and curated data that matches SOTA performance using 10% of typical pretraining data while supporting dynamic resolutions and improving VLM integration.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation cs.CV · 2026-05-24 · unverdicted · none · ref 40
LEASE achieves state-of-the-art unified performance on ImageNet-1K by combining masked token reconstruction and codebook contrast losses in a one-time precomputed discrete token space.
TuringViT: Making SOTA Vision Transformers Accessible to All cs.CV · 2026-06-23 · unverdicted · none · ref 23
TuringViT claims a new ViT design with linear attention and curated data that matches SOTA performance using 10% of typical pretraining data while supporting dynamic resolutions and improving VLM integration.

TULIP: Token-length upgraded CLIP.arXiv preprint arXiv:2410.10034, 2024

fields

years

verdicts

representative citing papers

citing papers explorer