Trans-svnet: Accurate phase recogni- tion from surgical videos via hybrid embedding aggregation transformer.CoRR, abs/2103.09712

Xiaojie Gao, Yueming Jin, Yonghao Long, Qi Dou, Pheng-Ann Heng · 2021 · arXiv 2103.09712

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Event-Level Detection of Surgical Instrument Handovers in Videos with Interpretable Vision Models

cs.CV · 2026-04-08 · unverdicted · novelty 5.0

A ViT-LSTM spatiotemporal model detects surgical instrument handovers and classifies direction in videos, achieving F1 of 0.84 for detection and 0.72 mean F1 for direction on kidney transplant data.

citing papers explorer

Showing 1 of 1 citing paper.

Event-Level Detection of Surgical Instrument Handovers in Videos with Interpretable Vision Models cs.CV · 2026-04-08 · unverdicted · none · ref 9
A ViT-LSTM spatiotemporal model detects surgical instrument handovers and classifies direction in videos, achieving F1 of 0.84 for detection and 0.72 mean F1 for direction on kidney transplant data.

Trans-svnet: Accurate phase recogni- tion from surgical videos via hybrid embedding aggregation transformer.CoRR, abs/2103.09712

fields

years

verdicts

representative citing papers

citing papers explorer