Cider: Consensus-based image description evalua- tion

Ramakrishna Vedantam, C Lawrence Zitnick, Devi Parikh · 2015

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

SARVLM: A Vision Language Foundation Model for Semantic Understanding in SAR Imagery

cs.CV · 2025-10-26 · unverdicted · novelty 7.0

SARVLM is the first vision-language foundation model for SAR, trained via domain transfer on a 1M image-text dataset and outperforming prior models on 13 benchmarks for retrieval, recognition, detection, and captioning.

Curvature-Aware Captioning:Leveraging Geodesic Attention for 3D Scene Understanding

cs.CV · 2026-05-09 · unverdicted · novelty 6.0

A new framework combines self-attention on the Oblique manifold with bidirectional geodesic cross-attention on the Lorentz hyperboloid to improve both localization accuracy and descriptive coherence in 3D dense captioning.

citing papers explorer

Showing 2 of 2 citing papers.

SARVLM: A Vision Language Foundation Model for Semantic Understanding in SAR Imagery cs.CV · 2025-10-26 · unverdicted · none · ref 38
SARVLM is the first vision-language foundation model for SAR, trained via domain transfer on a 1M image-text dataset and outperforming prior models on 13 benchmarks for retrieval, recognition, detection, and captioning.
Curvature-Aware Captioning:Leveraging Geodesic Attention for 3D Scene Understanding cs.CV · 2026-05-09 · unverdicted · none · ref 44
A new framework combines self-attention on the Oblique manifold with bidirectional geodesic cross-attention on the Lorentz hyperboloid to improve both localization accuracy and descriptive coherence in 3D dense captioning.

Cider: Consensus-based image description evalua- tion

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer