When large vision-language models meet person re-identification

· 2024 · arXiv 2411.18111

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

representative citing papers

Beyond Visual Cues: Semantic-Driven Token Filtering and Expert Routing for Anytime Person ReID

cs.CV · 2026-04-16 · unverdicted · novelty 7.0

STFER uses LVLM-generated identity-consistent semantic text to drive visual token filtering and expert routing for improved any-time person re-identification under clothing changes and modality shifts.

Towards Robust Text-to-Image Person Retrieval: Multi-View Reformulation for Semantic Compensation

cs.CV · 2026-04-20 · unverdicted · novelty 5.0

A multi-view semantic reformulation and feature compensation method using LLMs and VLMs improves text-to-image person retrieval accuracy without training and reaches SOTA on three datasets.

citing papers explorer

Showing 2 of 2 citing papers.

Beyond Visual Cues: Semantic-Driven Token Filtering and Expert Routing for Anytime Person ReID cs.CV · 2026-04-16 · unverdicted · none · ref 52 · internal anchor
STFER uses LVLM-generated identity-consistent semantic text to drive visual token filtering and expert routing for improved any-time person re-identification under clothing changes and modality shifts.
Towards Robust Text-to-Image Person Retrieval: Multi-View Reformulation for Semantic Compensation cs.CV · 2026-04-20 · unverdicted · none · ref 41 · internal anchor
A multi-view semantic reformulation and feature compensation method using LLMs and VLMs improves text-to-image person retrieval accuracy without training and reaches SOTA on three datasets.

When large vision-language models meet person re-identification

fields

years

verdicts

representative citing papers

citing papers explorer