pith. sign in

Imagenet: A large-scale hierarchical image database

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

fields

cs.CV 3 cs.AI 1

clear filters

representative citing papers

LPT: Less-overfitting Prompt Tuning for Vision-Language Model

cs.CV · 2024-10-14 · unverdicted · novelty 5.0

LPT reduces overfitting during prompt tuning of VLMs by CLIP-based foreground filtering, a structural preservation constraint aligning features to frozen CLIP, and a hierarchical logit constraint at the output, improving generalization on base-to-novel, cross-dataset, and domain-generalization tasks

citing papers explorer

Showing 2 of 2 citing papers after filters.

  • DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies cs.CV · 2025-03-18 · unverdicted · none · ref 8

    DualToken disentangles semantics and appearance via separate codebooks in one tokenizer, reporting 0.25 rFID, 82% ImageNet zero-shot accuracy, and gains over VILA-U on understanding and generation benchmarks.

  • LPT: Less-overfitting Prompt Tuning for Vision-Language Model cs.CV · 2024-10-14 · unverdicted · none · ref 5

    LPT reduces overfitting during prompt tuning of VLMs by CLIP-based foreground filtering, a structural preservation constraint aligning features to frozen CLIP, and a hierarchical logit constraint at the output, improving generalization on base-to-novel, cross-dataset, and domain-generalization tasks