A video is worth three views: Trigeminal transformers for video-based person re- identification.IEEE Transactions on Intelligent Transporta- tion Systems, 25(9):12818–12828

Xuehu Liu, Pingping Zhang, Chenyang Yu, Xuesheng Qian, Xiaoyun Yang, Huchuan Lu · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond Pedestrians: Caption-Guided CLIP Framework for High-Difficulty Video-based Person Re-Identification

cs.CV · 2026-04-09 · unverdicted · novelty 5.0

CG-CLIP adds caption-guided memory refinement and token-based spatiotemporal aggregation to CLIP for video person ReID, outperforming SOTA on MARS, iLIDS-VID, SportsVReID and DanceVReID.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond Pedestrians: Caption-Guided CLIP Framework for High-Difficulty Video-based Person Re-Identification cs.CV · 2026-04-09 · unverdicted · none · ref 37
CG-CLIP adds caption-guided memory refinement and token-based spatiotemporal aggregation to CLIP for video person ReID, outperforming SOTA on MARS, iLIDS-VID, SportsVReID and DanceVReID.

A video is worth three views: Trigeminal transformers for video-based person re- identification.IEEE Transactions on Intelligent Transporta- tion Systems, 25(9):12818–12828

fields

years

verdicts

representative citing papers

citing papers explorer