In: Proceedings of the IEEE/CVF International Conference on Computer Vision (2023)

Kirillov, A · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Towards Unified Surgical Scene Understanding:Bridging Reasoning and Grounding via MLLMs

cs.CV · 2026-05-13 · conditional · novelty 7.0

SurgMLLM unifies high-level reasoning and low-level visual grounding in one MLLM-based model for surgical videos, raising triplet recognition AP from 40.7% to 46.0% on the new CholecT45-Scene dataset with 64,299 annotated frames.

citing papers explorer

Showing 1 of 1 citing paper.

Towards Unified Surgical Scene Understanding:Bridging Reasoning and Grounding via MLLMs cs.CV · 2026-05-13 · conditional · none · ref 12
SurgMLLM unifies high-level reasoning and low-level visual grounding in one MLLM-based model for surgical videos, raising triplet recognition AP from 40.7% to 46.0% on the new CholecT45-Scene dataset with 64,299 annotated frames.

In: Proceedings of the IEEE/CVF International Conference on Computer Vision (2023)

fields

years

verdicts

representative citing papers

citing papers explorer