LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation

Can Jin; Dimitris N. Metaxas; Ligong Han; Mingyu Zhao; Shiyu Zhao; Tong Che; Xiaoxiao He; Ying Li; Zhenting Wang

arxiv: 2502.00896 · v3 · pith:JMZ6HDOTnew · submitted 2025-02-02 · 💻 cs.CV

LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation

Can Jin , Ying Li , Mingyu Zhao , Shiyu Zhao , Zhenting Wang , Xiaoxiao He , Ligong Han , Tong Che

show 1 more author

Dimitris N. Metaxas

This is my paper

classification 💻 cs.CV

keywords visualpromptingacrossimagelor-vpprompttimesinformation

0 comments

read the original abstract

Visual prompting has gained popularity as a method for adapting pre-trained models to specific tasks, particularly in the realm of parameter-efficient tuning. However, existing visual prompting techniques often pad the prompt parameters around the image, limiting the interaction between the visual prompts and the original image to a small set of patches while neglecting the inductive bias present in shared information across different patches. In this study, we conduct a thorough preliminary investigation to identify and address these limitations. We propose a novel visual prompt design, introducing Low-Rank matrix multiplication for Visual Prompting (LoR-VP), which enables shared and patch-specific information across rows and columns of image pixels. Extensive experiments across seven network architectures and four datasets demonstrate significant improvements in both performance and efficiency compared to state-of-the-art visual prompting methods, achieving up to 6 times faster training times, utilizing 18 times fewer visual prompt parameters, and delivering a 3.1% improvement in performance. The code is available as https://github.com/jincan333/LoR-VP.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Beyond Low-Rank: Low-Rank Sparse Prompting via Spiking Neural Network and Prompt Factorization
cs.CV 2026-06 unverdicted novelty 6.0

LoRSP integrates spiking neurons with low-rank factorization to produce sparse visual prompts for efficient adaptation of vision models across tasks.