Egtr: Extracting graph from transformer for scene graph generation // Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

Im Jinbae, Nam JeongYeon, Park Nokyung, Lee Hyungmin, Park Seunghyun · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

mKG-RAG: Leveraging Multimodal Knowledge Graphs in Retrieval-Augmented Generation for Knowledge-intensive VQA

cs.CV · 2025-08-07 · unverdicted · novelty 7.0

mKG-RAG constructs multimodal KGs via MLLM-driven extraction and vision-text matching then applies dual-stage query-aware retrieval to achieve new state-of-the-art results on knowledge-based VQA.

citing papers explorer

Showing 1 of 1 citing paper.

mKG-RAG: Leveraging Multimodal Knowledge Graphs in Retrieval-Augmented Generation for Knowledge-intensive VQA cs.CV · 2025-08-07 · unverdicted · none · ref 25
mKG-RAG constructs multimodal KGs via MLLM-driven extraction and vision-text matching then applies dual-stage query-aware retrieval to achieve new state-of-the-art results on knowledge-based VQA.

Egtr: Extracting graph from transformer for scene graph generation // Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

fields

years

verdicts

representative citing papers

citing papers explorer