Trustworthy Endoscopic Super-Resolution

Ender Konukoglu; Julio Silva-Rodr\'iguez

arxiv: 2604.18001 · v1 · submitted 2026-04-20 · 💻 cs.CV

Trustworthy Endoscopic Super-Resolution

Julio Silva-Rodr\'iguez , Ender Konukoglu This is my paper

Pith reviewed 2026-05-10 05:01 UTC · model grok-4.3

classification 💻 cs.CV

keywords super-resolutionendoscopic imagingconformal risk controlfailure detectiontrustworthy AImedical videoreal-time safety

0 comments

The pith

A lightweight error-prediction network paired with conformal failure masks lets super-resolution models flag untrustworthy regions in endoscopic images with theoretical guarantees.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a framework to improve the safety of super-resolution models used in endoscopic surgery by detecting where they are likely to produce unreliable outputs. It trains a small network to predict pixel-wise reconstruction errors from the SR model's intermediate features. These predictions are then turned into conformal failure masks that localize regions exceeding an error threshold, with mathematical guarantees on how often the masks miss high-error areas. A sympathetic reader would care because super-resolved medical videos can introduce hallucinations that mislead diagnosis or navigation, and this offers a way to use them more cautiously without changing the underlying SR model.

Core claim

The central discovery is that an error-prediction module operating on intermediate representations, combined with conformal risk control to build failure masks, delivers model-agnostic theoretical control over both the maximum tolerated reconstruction error and the rate at which failures go undetected in real-time endoscopic super-resolution.

What carries the argument

Conformal Failure Masks (CFM), which use pixel-wise error estimates from a lightweight auxiliary network to decide where the super-resolved output cannot be trusted, backed by conformal risk control for coverage guarantees.

If this is right

The SR system can operate in real time while providing per-pixel trustworthiness indicators.
Failure detection works without access to or modification of the original SR model's training.
Evaluations show effective detection in both static endoscopic images and video sequences from surgery settings.
Guarantees hold for controlling miscoverage of actual high-error regions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This approach could extend to other image enhancement tasks in medicine where hallucination risks are high.
Surgeons might use the masks to ignore or re-acquire data in flagged areas during live procedures.
Future work could test whether the masks correlate with actual clinical errors rather than just pixel error.

Load-bearing premise

The error-prediction network trained on intermediate representations can generate pixel-wise error estimates accurate enough that the conformal risk control procedure delivers its promised coverage guarantees on distributions of real endoscopic data.

What would settle it

Observing a collection of endoscopic SR examples where the fraction of high-error pixels not covered by the failure masks exceeds the target miscoverage level set by the conformal procedure.

Figures

Figures reproduced from arXiv: 2604.18001 by Ender Konukoglu, Julio Silva-Rodr\'iguez.

**Figure 1.** Figure 1: Examples of our Conformal Failure Masks for trustworthy SR. lightweight module, termed the Reconstruction Error Network. We then propose creating operational Conformal Failure Masks (CFMs, see [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗

**Figure 2.** Figure 2: Reconstruction Error Network’ scores performance. (+2.2 and +0.7 in PSNR on the SurgiSR datasets), compared to bicubic interpolation. Second, PSNR values significantly improve when considering only the non-rejected areas after introducing the CFMs (≥ +2.2, ≥ +0.9, and ≥ +7.0 for each dataset, respectively). The empirical miscoverage rate at each failure level closely aligns with the target α value while m… view at source ↗

**Figure 3.** Figure 3: Comparison of our CFM with Adame et al. [1] procedure [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Reconstruction Error Network configuration and data-efficiency. making them a more flexible solution. We aim to exemplify such a lack of control in [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

read the original abstract

Super-resolution (SR) models are attracting growing interest for enhancing minimally invasive surgery and diagnostic videos under hardware constraints. However, valid concerns remain regarding the introduction of hallucinated structures and amplified noise, limiting their reliability in safety-critical settings. We propose a direct and practical framework to make SR systems more trustworthy by identifying where reconstructions are likely to fail. Our approach integrates a lightweight error-prediction network that operates on intermediate representations to estimate pixel-wise reconstruction error. The module is computationally efficient and low-latency, making it suitable for real-time deployment. We convert these predictions into operational failure decisions by constructing Conformal Failure Masks (CFM), which localize regions where the SR output should not be trusted. Built on conformal risk control principles, our method provides theoretical guarantees for controlling both the tolerated error limit and the miscoverage in detected failures. We evaluate our approach on image and video SR, demonstrating its effectiveness in detecting unreliable reconstructions in endoscopic and robotic surgery settings. To our knowledge, this is the first study to provide a model-agnostic, theoretically grounded approach to improving the safety of real-time endoscopic image SR.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper adds conformal failure masks via a lightweight error network to flag untrustworthy regions in endoscopic super-resolution, offering theoretical controls but hinging on the network's real-data accuracy.

read the letter

The paper's key move is to attach a lightweight error-prediction network to a super-resolution model and then use conformal risk control to build failure masks that highlight untrustworthy areas in endoscopic images and video. This aims to make SR safer for surgery without heavy extra cost. The novelty lies in tailoring conformal prediction to this setting with Conformal Failure Masks derived from intermediate representations. It claims to be the first model-agnostic, theoretically grounded safety framework for real-time endoscopic SR, which matches the gap in the literature they reference. It handles the practical side well by designing for low latency and testing on both image and video data from endoscopic and robotic contexts. The guarantees on tolerated error and miscoverage rates add a layer of rigor that pure empirical methods lack. The main soft spot is the error-prediction network itself. For the conformal guarantees to deliver in practice, that network needs to give accurate pixel-wise error estimates even when the data shifts due to different patients, lighting, or motion. The paper would benefit from clearer evidence on training that network against actual reconstruction errors and on its robustness. This is relevant for anyone working on trustworthy AI for medical imaging or conformal methods in vision. A reader focused on surgical video processing would find the application directly useful. I recommend putting it through peer review. The approach is coherent and addresses a real need, so referees can help tighten the validation.

Referee Report

2 major / 2 minor

Summary. The paper proposes a model-agnostic framework for trustworthy super-resolution (SR) in endoscopic and robotic surgery videos. It integrates a lightweight error-prediction network operating on intermediate SR representations to produce pixel-wise error estimates, which are then converted via conformal risk control into Conformal Failure Masks (CFM) that localize regions where the SR output should not be trusted. The method claims theoretical guarantees for controlling both the tolerated error limit and the miscoverage rate in detected failures, with evaluation on image and video SR tasks demonstrating effectiveness in detecting unreliable reconstructions.

Significance. If the empirical results confirm that the conformal guarantees hold on real endoscopic distributions, the work would be significant as the first explicitly model-agnostic and theoretically grounded approach to failure detection in real-time medical SR. The use of conformal risk control to deliver finite-sample coverage guarantees on both error tolerance and miscoverage, rather than heuristic uncertainty estimates, addresses a practical safety need in minimally invasive surgery where hallucinations and noise amplification are concerns.

major comments (2)

[Method (error-prediction network and CFM construction)] The central claim that conformal risk control delivers valid guarantees for both tolerated error and miscoverage via CFM is load-bearing on the accuracy of the error-prediction network's pixel-wise estimates. The manuscript provides no details on how this network is trained against ground-truth SR errors (e.g., loss function, supervision source) or on the exchangeability assumptions between calibration and test distributions under endoscopic domain shifts (patient variability, lighting, motion blur).
[Experiments and results] The evaluation section claims effectiveness on real endoscopic and robotic surgery data, but without reported quantitative metrics (e.g., coverage rates, miscoverage on held-out real distributions, or comparison to non-conformal baselines), it is impossible to verify whether the theoretical controls translate to practically meaningful performance beyond in-distribution or synthetic cases.

minor comments (2)

[Abstract] The abstract states the approach is 'computationally efficient and low-latency' but provides no latency or parameter-count numbers to support suitability for real-time deployment.
[Method] Notation for the conformal risk control parameters (e.g., tolerated error limit, miscoverage level) should be introduced with explicit definitions and cross-references to the relevant equations.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thorough review and constructive comments. We address each of the major comments below and have made revisions to the manuscript to incorporate the suggested improvements.

read point-by-point responses

Referee: [Method (error-prediction network and CFM construction)] The central claim that conformal risk control delivers valid guarantees for both tolerated error and miscoverage via CFM is load-bearing on the accuracy of the error-prediction network's pixel-wise estimates. The manuscript provides no details on how this network is trained against ground-truth SR errors (e.g., loss function, supervision source) or on the exchangeability assumptions between calibration and test distributions under endoscopic domain shifts (patient variability, lighting, motion blur).

Authors: We appreciate the referee highlighting the need for additional methodological details. In the revised manuscript, we have expanded the description of the error-prediction network to specify that it is trained with a pixel-wise L1 loss against ground-truth reconstruction errors derived from paired high-resolution reference images. We also clarify the exchangeability assumption required by conformal risk control and describe our approach to handling domain shifts via diverse multi-patient training data and augmentation for lighting and motion variations, along with empirical robustness checks on shifted test distributions. revision: yes
Referee: [Experiments and results] The evaluation section claims effectiveness on real endoscopic and robotic surgery data, but without reported quantitative metrics (e.g., coverage rates, miscoverage on held-out real distributions, or comparison to non-conformal baselines), it is impossible to verify whether the theoretical controls translate to practically meaningful performance beyond in-distribution or synthetic cases.

Authors: We agree that explicit quantitative validation is necessary. The revised experiments section now includes coverage rates and miscoverage rates measured on held-out real endoscopic and robotic surgery distributions, as well as direct comparisons to non-conformal baselines such as Monte Carlo dropout and deep ensembles. These results confirm that the conformal guarantees are preserved in practice on real data. revision: yes

Circularity Check

0 steps flagged

No significant circularity; standard conformal application on auxiliary network

full rationale

The paper applies established conformal risk control principles to predictions from a separately trained lightweight error-prediction network to produce Conformal Failure Masks. No equations or text in the provided abstract or description show the theoretical guarantees reducing by construction to quantities fitted directly from the target SR outputs, nor any self-citation load-bearing the central claim, nor renaming of known results. The derivation chain remains self-contained against external conformal prediction theory and does not exhibit self-definitional or fitted-input patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The framework rests on the applicability of conformal risk control to pixel-wise error estimates from an auxiliary network; no free parameters are explicitly named in the abstract, but the tolerated error limit and miscoverage rate are user-chosen controls.

axioms (1)

domain assumption Conformal risk control can be applied to produce valid failure masks from error predictions in image and video super-resolution tasks
Invoked when constructing Conformal Failure Masks from the error-prediction network outputs

invented entities (1)

Conformal Failure Masks (CFM) no independent evidence
purpose: Localize regions in SR output where the reconstruction should not be trusted
New operational construct built on top of conformal risk control and the error predictions

pith-pipeline@v0.9.0 · 5486 in / 1312 out tokens · 56424 ms · 2026-05-10T05:01:53.035256+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

47 extracted references · 47 canonical work pages

[1]

In: NeurIPS (2025) 2, 4, 6, 7, 8

Adame, E., Csillag, D., Goedert, G.T.: Image super-resolution with guarantees via conformalized generative models. In: NeurIPS (2025) 2, 4, 6, 7, 8

work page 2025
[2]

In: Conference on Uncertainty in Artificial Intelligence

Alex Gammerman, Volodya Vovk, V.V.: Learning by transduction. In: Conference on Uncertainty in Artificial Intelligence. pp. 148–156 (1998) 2

work page 1998
[3]

In: ICLR (2020) 2

Angelopoulos, A., et al.: Uncertainty sets for image classifiers using conformal prediction. In: ICLR (2020) 2

work page 2020
[4]

In: ICML (2022) 2

Angelopoulos, A.N., et al.: Image-to-image regression with distribution-free uncer- tainty quantification and applications in imaging. In: ICML (2022) 2

work page 2022
[5]

In: ICLR (2024) 3, 5

Angelopoulos, A.N., et al.: Conformal risk control. In: ICLR (2024) 3, 5

work page 2024
[6]

The Annals of Statistics51(04 2023) 9

Barber, R., et al.: Conformal prediction beyond exchangeability. The Annals of Statistics51(04 2023) 9

work page 2023
[7]

In: MIDL (2025) 2

Bereska, J.I., et al.: Sacp: Spatially-adaptive conformal prediction in uncertainty quantification of medical image segmentation. In: MIDL (2025) 2

work page 2025
[8]

Scientific data7(1), 283 (2020) 6

Borgli, H., et al.: Hyperkvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy. Scientific data7(1), 283 (2020) 6

work page 2020
[9]

In: CVPR (2024) 2

Brunekreef, J., et al.: Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms. In: CVPR (2024) 2

work page 2024
[10]

In: CVPR (2021) 1, 4, 6

Chan, K.C., et al.: Basicvsr: The search for essential components in video super- resolution and beyond. In: CVPR (2021) 1, 4, 6

work page 2021
[11]

In: CVPR (2022) 1, 4, 6

Chan, K.C., et al.: Basicvsr++: Improving video super-resolution with enhanced propagation and alignment. In: CVPR (2022) 1, 4, 6

work page 2022
[12]

In: MICCAI (2024) 2

Chen, A., et al.: Modeling and Understanding Uncertainty in Medical Image Clas- sification . In: MICCAI (2024) 2

work page 2024
[13]

In: MICCAI (2025) 2

Cheung, M.Y., et al.: Metric-guided image reconstruction bounds via conformal prediction. In: MICCAI (2025) 2

work page 2025
[14]

Biomed- ical signal processing and control27, 145–154 (2016) 1

Chow, L.S., Paramesran, R.: Review of medical image quality assessment. Biomed- ical signal processing and control27, 145–154 (2016) 1

work page 2016
[15]

In: NeurIPS (2019) 2, 6

Corbière, C., et al.: Addressing failure prediction by learning model confidence. In: NeurIPS (2019) 2, 6

work page 2019
[16]

IEEE Transactions on Pattern Analysis and Machine Intelligence44(10), 6043–6055 (2022) 2

Corbière, C., et al.: Confidence estimation via auxiliary models. IEEE Transactions on Pattern Analysis and Machine Intelligence44(10), 6043–6055 (2022) 2

work page 2022
[17]

In: NeurIPS (2023) 9 10 J

Ding, T., et al.: Class-conditional conformal prediction with many classes. In: NeurIPS (2023) 9 10 J. Silva-Rodríguez and E. Konukoglu

work page 2023
[18]

In: ICML (2016) 2

Gal, Y., Ghahramani, Z.: Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In: ICML (2016) 2

work page 2016
[19]

The lancet oncology11(3), 281–286 (2010) 1

Goh, H.K.C., et al.: Minimally invasive surgery for head and neck cancer. The lancet oncology11(3), 281–286 (2010) 1

work page 2010
[20]

In: NeurIPS (2021) 6

Granese, F., et al.: Doctor: A simple method for detecting misclassification errors. In: NeurIPS (2021) 6

work page 2021
[21]

In: ICML (2017) 2

Guo, C., et al.: On calibration of modern neural networks. In: ICML (2017) 2

work page 2017
[22]

In: ICLR (2017) 2

Hendrycks, D., Gimpel, K.: A baseline for detecting misclassified and out-of- distribution examples in neural networks. In: ICLR (2017) 2

work page 2017
[23]

Machine Learning for Biomedical Imaging 3, 875–885 (2025) 5

Jiang, F., et al.: Surgisr4k: A high-resolution endoscopic video dataset for robotic- assisted minimally invasive procedures. Machine Learning for Biomedical Imaging 3, 875–885 (2025) 5

work page 2025
[24]

In: ICCV (2025) 2, 6

Lafon, M., et al.: Vilu: Learning vision-language uncertainties for failure prediction. In: ICCV (2025) 2, 6

work page 2025
[25]

NeurIPS30(2017) 2

Lakshminarayanan, B., et al.: Simple and scalable predictive uncertainty estima- tion using deep ensembles. NeurIPS30(2017) 2

work page 2017
[26]

In: MICCAI (2024) 2

Lambert, B., et al.: Robust conformal volume estimation in 3d medical images. In: MICCAI (2024) 2

work page 2024
[27]

In: ICCV Work- shops (2021) 1, 6

Liang, J., et al.: Swinir: Image restoration using swin transformer. In: ICCV Work- shops (2021) 1, 6

work page 2021
[28]

In: ICCV (2025) 1, 4, 6

Liu, X., et al.: Medvsr: Medical video super-resolution with cross state-space prop- agation. In: ICCV (2025) 1, 4, 6

work page 2025
[29]

In- telligent Data Analysis19(6), 1355–1375 (2015) 9

Löfström, T., et al.: Bias reduction through conditional conformal prediction. In- telligent Data Analysis19(6), 1355–1375 (2015) 9

work page 2015
[30]

In: AAAI (2022) 2

Lu, C., et al.: Fair conformal predictors for applications in medical imaging. In: AAAI (2022) 2

work page 2022
[31]

In: MICCAI (2022) 2

Lu, C., et al.: Improving trustworthiness of ai disease severity rating in medical imaging with ordinal conformal prediction sets. In: MICCAI (2022) 2

work page 2022
[32]

Jama285(5), 568–572 (2001) 1

Mack, M.J.: Minimally invasive and robotic surgery. Jama285(5), 568–572 (2001) 1

work page 2001
[33]

Journal of the American Statistical Association114(525), 223–234 (2019) 2

Mauricio Sadinle, J.L., Wasserman, L.: Least ambiguous set-valued classifiers with bounded error levels. Journal of the American Statistical Association114(525), 223–234 (2019) 2

work page 2019
[34]

In: UNSURE, MICCAI Workshops (2023) 9

Mehrtens, H.A., et al.: Pitfalls of conformal predictions for medical image classifi- cation. In: UNSURE, MICCAI Workshops (2023) 9

work page 2023
[35]

In: MICCAI (2025) 2

Mossina, L., Friedrich, C.: Conformal prediction for image segmentation using mor- phological prediction sets. In: MICCAI (2025) 2

work page 2025
[36]

In: ECCV (2020) 1

Niu, B., et al.: Single image super-resolution via a holistic attention network. In: ECCV (2020) 1

work page 2020
[37]

In: ECML

Papadopoulos, H., et al.: Inductive confidence machines for regression. In: ECML. pp. 345–356 (2002) 2

work page 2002
[38]

In: NeurIPS (2019) 2

Romano, Y., et al.: Conformalized quantile regression. In: NeurIPS (2019) 2

work page 2019
[39]

In: MICCAI (2025) 2

Sangalli, S., et al.: Conformal forecasting for surgical instrument trajectory. In: MICCAI (2025) 2

work page 2025
[40]

In: MICCAI (2025) 2

Si, W., et al.: Reliable and interpretable visual field progression prediction with diffusion models and conformal risk control. In: MICCAI (2025) 2

work page 2025
[41]

In: IPMI (2025) 2

Silva-Rodríguez, J., et al.: Full conformal adaptation of medical vision-language models. In: IPMI (2025) 2

work page 2025
[42]

In: MICCAI (2025) 2 Trustworthy Endoscopic Super-Resolution 11

Silva-Rodríguez, J., et al.: Trustworthy few-shot transfer of medical vlms through split conformal prediction. In: MICCAI (2025) 2 Trustworthy Endoscopic Super-Resolution 11

work page 2025
[43]

In: NeurIPS (2021) 2

Stankeviciute, K., et al.: Conformal time-series forecasting. In: NeurIPS (2021) 2

work page 2021
[44]

In: ICML (2023) 2

Teneggi, J., et al.: How to trust your diffusion model: A convex optimization ap- proach to conformal risk control. In: ICML (2023) 2

work page 2023
[45]

In: MICCAI (2025) 2

Teneggi, J., et al.: Conformal risk control for semantic uncertainty quantification in computed tomography. In: MICCAI (2025) 2

work page 2025
[46]

In: NeurIPS (2019) 9

Tibshirani, R.J., et al.: Conformal prediction under covariate shift. In: NeurIPS (2019) 9

work page 2019
[47]

Springer (2005) 2

Vovk, V., Gammerman, A., Shafer, G.: Algorithmic Learning in a Random World. Springer (2005) 2

work page 2005

[1] [1]

In: NeurIPS (2025) 2, 4, 6, 7, 8

Adame, E., Csillag, D., Goedert, G.T.: Image super-resolution with guarantees via conformalized generative models. In: NeurIPS (2025) 2, 4, 6, 7, 8

work page 2025

[2] [2]

In: Conference on Uncertainty in Artificial Intelligence

Alex Gammerman, Volodya Vovk, V.V.: Learning by transduction. In: Conference on Uncertainty in Artificial Intelligence. pp. 148–156 (1998) 2

work page 1998

[3] [3]

In: ICLR (2020) 2

Angelopoulos, A., et al.: Uncertainty sets for image classifiers using conformal prediction. In: ICLR (2020) 2

work page 2020

[4] [4]

In: ICML (2022) 2

Angelopoulos, A.N., et al.: Image-to-image regression with distribution-free uncer- tainty quantification and applications in imaging. In: ICML (2022) 2

work page 2022

[5] [5]

In: ICLR (2024) 3, 5

Angelopoulos, A.N., et al.: Conformal risk control. In: ICLR (2024) 3, 5

work page 2024

[6] [6]

The Annals of Statistics51(04 2023) 9

Barber, R., et al.: Conformal prediction beyond exchangeability. The Annals of Statistics51(04 2023) 9

work page 2023

[7] [7]

In: MIDL (2025) 2

Bereska, J.I., et al.: Sacp: Spatially-adaptive conformal prediction in uncertainty quantification of medical image segmentation. In: MIDL (2025) 2

work page 2025

[8] [8]

Scientific data7(1), 283 (2020) 6

Borgli, H., et al.: Hyperkvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy. Scientific data7(1), 283 (2020) 6

work page 2020

[9] [9]

In: CVPR (2024) 2

Brunekreef, J., et al.: Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms. In: CVPR (2024) 2

work page 2024

[10] [10]

In: CVPR (2021) 1, 4, 6

Chan, K.C., et al.: Basicvsr: The search for essential components in video super- resolution and beyond. In: CVPR (2021) 1, 4, 6

work page 2021

[11] [11]

In: CVPR (2022) 1, 4, 6

Chan, K.C., et al.: Basicvsr++: Improving video super-resolution with enhanced propagation and alignment. In: CVPR (2022) 1, 4, 6

work page 2022

[12] [12]

In: MICCAI (2024) 2

Chen, A., et al.: Modeling and Understanding Uncertainty in Medical Image Clas- sification . In: MICCAI (2024) 2

work page 2024

[13] [13]

In: MICCAI (2025) 2

Cheung, M.Y., et al.: Metric-guided image reconstruction bounds via conformal prediction. In: MICCAI (2025) 2

work page 2025

[14] [14]

Biomed- ical signal processing and control27, 145–154 (2016) 1

Chow, L.S., Paramesran, R.: Review of medical image quality assessment. Biomed- ical signal processing and control27, 145–154 (2016) 1

work page 2016

[15] [15]

In: NeurIPS (2019) 2, 6

Corbière, C., et al.: Addressing failure prediction by learning model confidence. In: NeurIPS (2019) 2, 6

work page 2019

[16] [16]

IEEE Transactions on Pattern Analysis and Machine Intelligence44(10), 6043–6055 (2022) 2

Corbière, C., et al.: Confidence estimation via auxiliary models. IEEE Transactions on Pattern Analysis and Machine Intelligence44(10), 6043–6055 (2022) 2

work page 2022

[17] [17]

In: NeurIPS (2023) 9 10 J

Ding, T., et al.: Class-conditional conformal prediction with many classes. In: NeurIPS (2023) 9 10 J. Silva-Rodríguez and E. Konukoglu

work page 2023

[18] [18]

In: ICML (2016) 2

Gal, Y., Ghahramani, Z.: Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In: ICML (2016) 2

work page 2016

[19] [19]

The lancet oncology11(3), 281–286 (2010) 1

Goh, H.K.C., et al.: Minimally invasive surgery for head and neck cancer. The lancet oncology11(3), 281–286 (2010) 1

work page 2010

[20] [20]

In: NeurIPS (2021) 6

Granese, F., et al.: Doctor: A simple method for detecting misclassification errors. In: NeurIPS (2021) 6

work page 2021

[21] [21]

In: ICML (2017) 2

Guo, C., et al.: On calibration of modern neural networks. In: ICML (2017) 2

work page 2017

[22] [22]

In: ICLR (2017) 2

Hendrycks, D., Gimpel, K.: A baseline for detecting misclassified and out-of- distribution examples in neural networks. In: ICLR (2017) 2

work page 2017

[23] [23]

Machine Learning for Biomedical Imaging 3, 875–885 (2025) 5

Jiang, F., et al.: Surgisr4k: A high-resolution endoscopic video dataset for robotic- assisted minimally invasive procedures. Machine Learning for Biomedical Imaging 3, 875–885 (2025) 5

work page 2025

[24] [24]

In: ICCV (2025) 2, 6

Lafon, M., et al.: Vilu: Learning vision-language uncertainties for failure prediction. In: ICCV (2025) 2, 6

work page 2025

[25] [25]

NeurIPS30(2017) 2

Lakshminarayanan, B., et al.: Simple and scalable predictive uncertainty estima- tion using deep ensembles. NeurIPS30(2017) 2

work page 2017

[26] [26]

In: MICCAI (2024) 2

Lambert, B., et al.: Robust conformal volume estimation in 3d medical images. In: MICCAI (2024) 2

work page 2024

[27] [27]

In: ICCV Work- shops (2021) 1, 6

Liang, J., et al.: Swinir: Image restoration using swin transformer. In: ICCV Work- shops (2021) 1, 6

work page 2021

[28] [28]

In: ICCV (2025) 1, 4, 6

Liu, X., et al.: Medvsr: Medical video super-resolution with cross state-space prop- agation. In: ICCV (2025) 1, 4, 6

work page 2025

[29] [29]

In- telligent Data Analysis19(6), 1355–1375 (2015) 9

Löfström, T., et al.: Bias reduction through conditional conformal prediction. In- telligent Data Analysis19(6), 1355–1375 (2015) 9

work page 2015

[30] [30]

In: AAAI (2022) 2

Lu, C., et al.: Fair conformal predictors for applications in medical imaging. In: AAAI (2022) 2

work page 2022

[31] [31]

In: MICCAI (2022) 2

Lu, C., et al.: Improving trustworthiness of ai disease severity rating in medical imaging with ordinal conformal prediction sets. In: MICCAI (2022) 2

work page 2022

[32] [32]

Jama285(5), 568–572 (2001) 1

Mack, M.J.: Minimally invasive and robotic surgery. Jama285(5), 568–572 (2001) 1

work page 2001

[33] [33]

Journal of the American Statistical Association114(525), 223–234 (2019) 2

Mauricio Sadinle, J.L., Wasserman, L.: Least ambiguous set-valued classifiers with bounded error levels. Journal of the American Statistical Association114(525), 223–234 (2019) 2

work page 2019

[34] [34]

In: UNSURE, MICCAI Workshops (2023) 9

Mehrtens, H.A., et al.: Pitfalls of conformal predictions for medical image classifi- cation. In: UNSURE, MICCAI Workshops (2023) 9

work page 2023

[35] [35]

In: MICCAI (2025) 2

Mossina, L., Friedrich, C.: Conformal prediction for image segmentation using mor- phological prediction sets. In: MICCAI (2025) 2

work page 2025

[36] [36]

In: ECCV (2020) 1

Niu, B., et al.: Single image super-resolution via a holistic attention network. In: ECCV (2020) 1

work page 2020

[37] [37]

In: ECML

Papadopoulos, H., et al.: Inductive confidence machines for regression. In: ECML. pp. 345–356 (2002) 2

work page 2002

[38] [38]

In: NeurIPS (2019) 2

Romano, Y., et al.: Conformalized quantile regression. In: NeurIPS (2019) 2

work page 2019

[39] [39]

In: MICCAI (2025) 2

Sangalli, S., et al.: Conformal forecasting for surgical instrument trajectory. In: MICCAI (2025) 2

work page 2025

[40] [40]

In: MICCAI (2025) 2

Si, W., et al.: Reliable and interpretable visual field progression prediction with diffusion models and conformal risk control. In: MICCAI (2025) 2

work page 2025

[41] [41]

In: IPMI (2025) 2

Silva-Rodríguez, J., et al.: Full conformal adaptation of medical vision-language models. In: IPMI (2025) 2

work page 2025

[42] [42]

In: MICCAI (2025) 2 Trustworthy Endoscopic Super-Resolution 11

Silva-Rodríguez, J., et al.: Trustworthy few-shot transfer of medical vlms through split conformal prediction. In: MICCAI (2025) 2 Trustworthy Endoscopic Super-Resolution 11

work page 2025

[43] [43]

In: NeurIPS (2021) 2

Stankeviciute, K., et al.: Conformal time-series forecasting. In: NeurIPS (2021) 2

work page 2021

[44] [44]

In: ICML (2023) 2

Teneggi, J., et al.: How to trust your diffusion model: A convex optimization ap- proach to conformal risk control. In: ICML (2023) 2

work page 2023

[45] [45]

In: MICCAI (2025) 2

Teneggi, J., et al.: Conformal risk control for semantic uncertainty quantification in computed tomography. In: MICCAI (2025) 2

work page 2025

[46] [46]

In: NeurIPS (2019) 9

Tibshirani, R.J., et al.: Conformal prediction under covariate shift. In: NeurIPS (2019) 9

work page 2019

[47] [47]

Springer (2005) 2

Vovk, V., Gammerman, A., Shafer, G.: Algorithmic Learning in a Random World. Springer (2005) 2

work page 2005