Differentiable Rendering-based Pose Estimation for Surgical Robotic Instruments

Florian Richter; Michael C. Yip; Zekai Liang; Zih-Yun Chiu

arxiv: 2503.05953 · v1 · pith:XKZCPEHPnew · submitted 2025-03-07 · 💻 cs.RO

Differentiable Rendering-based Pose Estimation for Surgical Robotic Instruments

Zekai Liang , Zih-Yun Chiu , Florian Richter , Michael C. Yip This is my paper

classification 💻 cs.RO

keywords surgicalposecalibrationrobotictrackingangleautomationdifferentiable

0 comments

read the original abstract

Robot pose estimation is a challenging and crucial task for vision-based surgical robotic automation. Typical robotic calibration approaches, however, are not applicable to surgical robots, such as the da Vinci Research Kit (dVRK), due to joint angle measurement errors from cable-drives and the partially visible kinematic chain. Hence, previous works in surgical robotic automation used tracking algorithms to estimate the pose of the surgical tool in real-time and compensate for the joint angle errors. However, a big limitation of these previous tracking works is the initialization step which relied on only keypoints and SolvePnP. In this work, we fully explore the potential of geometric primitives beyond just keypoints with differentiable rendering, cylinders, and construct a versatile pose matching pipeline in a novel pose hypothesis space. We demonstrate the state-of-the-art performance of our single-shot calibration method with both calibration consistency and real surgical tasks. As a result, this marker-less calibration approach proves to be a robust and generalizable initialization step for surgical tool tracking.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

SurfSurg6D: Geometry Consistent Dense Correspondence for Textureless Surgical Instrument Pose Estimation
cs.CV 2026-05 unverdicted novelty 5.0

A new synthetic dataset and geometry-consistent dense correspondence framework improve RGB-only pose estimation accuracy for surgical instruments on three evaluation datasets.