Visual instruction tuning

Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee, “Visual instruction tuning,” · 2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

SmoGVLM: A Small, Graph-enhanced Vision-Language Model

cs.CV · 2026-04-15 · unverdicted · novelty 4.0

A graph-enhanced 1.3B-parameter VLM achieves up to 16.24% gains and outperforms larger VLMs by integrating structured knowledge via GNNs.

Improving MLLM Training Efficiency via Stage-Aware Sparsity

cs.LG · 2025-09-16 · unverdicted · novelty 4.0

Introduces stage-aware sparsity via Visual Token Compressor for modality alignment and Layer Dynamic Skipper for instruction tuning to improve MLLM training efficiency.

citing papers explorer

Showing 2 of 2 citing papers.

SmoGVLM: A Small, Graph-enhanced Vision-Language Model cs.CV · 2026-04-15 · unverdicted · none · ref 12
A graph-enhanced 1.3B-parameter VLM achieves up to 16.24% gains and outperforms larger VLMs by integrating structured knowledge via GNNs.
Improving MLLM Training Efficiency via Stage-Aware Sparsity cs.LG · 2025-09-16 · unverdicted · none · ref 14
Introduces stage-aware sparsity via Visual Token Compressor for modality alignment and Layer Dynamic Skipper for instruction tuning to improve MLLM training efficiency.

Visual instruction tuning

fields

years

verdicts

representative citing papers

citing papers explorer