A transformer framework for composed vision-language retrieval in skin cancer uses hierarchical query representations and global-local alignment to improve performance over prior methods on the Derm7pt dataset.
Global meets local: Dual activation hashing network for large-scale fine-grained image retrieval,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Composed Vision-Language Retrieval for Skin Cancer Case Search via Joint Alignment of Global and Local Representations
A transformer framework for composed vision-language retrieval in skin cancer uses hierarchical query representations and global-local alignment to improve performance over prior methods on the Derm7pt dataset.