Training-Free Inference for High-Resolution Sinogram Completion

Bin Ren; Guannan Wang; Jiaze E; Srutarshi Banerjee; Tekin Bicer; Yanfu Zhang

arxiv: 2506.08809 · v5 · pith:OBJ6U752new · submitted 2025-06-10 · 💻 cs.CV · eess.IV

Training-Free Inference for High-Resolution Sinogram Completion

Jiaze E , Srutarshi Banerjee , Tekin Bicer , Guannan Wang , Yanfu Zhang , Bin Ren This is my paper

Pith reviewed 2026-05-19 10:35 UTC · model grok-4.3

classification 💻 cs.CV eess.IV

keywords sinogram completiondiffusion modelshigh resolutiontraining-free inferencecomputed tomographyadaptive allocationspatial heterogeneity

0 comments

The pith

HRSino uses spatial heterogeneity to adaptively allocate diffusion inference for efficient high-resolution sinogram completion.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces HRSino, a training-free method that makes diffusion-based completion of high-resolution sinograms more efficient. It does this by recognizing that different parts of the sinogram have different signal properties, like how sparse their frequencies are or how detailed they are locally. Instead of applying the same expensive process everywhere, HRSino handles broad structure at lower resolutions and only refines tricky spots at full resolution. This results in lower memory use and quicker processing while keeping the accuracy of the completed sinograms high across various data sets. Readers interested in medical imaging or efficient AI for scientific data would care because it could make high-quality CT scans more accessible without needing massive computing resources.

Core claim

By explicitly accounting for spatial heterogeneity in signal characteristics such as spectral sparsity and local complexity, HRSino allocates inference effort adaptively across spatial regions and resolutions. This captures global consistency at coarse scales while refining local details only where necessary, reducing peak memory usage by up to 30.81% and inference time by up to 17.58% compared to state-of-the-art frameworks without loss of completion accuracy.

What carries the argument

Adaptive inference allocation across regions and resolutions based on spatial heterogeneity of spectral sparsity and local complexity

If this is right

Peak memory usage for high-resolution sinogram completion is reduced by up to 30.81%.
Inference time is reduced by up to 17.58%.
Completion accuracy is maintained across different datasets and resolutions.
The method remains training-free, avoiding the need for task-specific fine-tuning.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This could apply to other generative tasks in imaging where computation can be focused on complex areas.
Uniform diffusion steps may be wasteful when signal complexity varies spatially in projection data.
Testing on even higher resolutions or 3D volumes could reveal further scalability benefits.

Load-bearing premise

The method assumes that explicitly accounting for spatial heterogeneity in signal characteristics such as spectral sparsity and local complexity enables adaptive allocation of inference effort across regions and resolutions without loss of global consistency or local accuracy.

What would settle it

A benchmark experiment on a standard high-resolution CT sinogram dataset that shows no reduction in peak memory or inference time, or a drop in accuracy metrics such as PSNR or SSIM compared to uniform diffusion inference.

Figures

Figures reproduced from arXiv: 2506.08809 by Bin Ren, Guannan Wang, Jiaze E, Srutarshi Banerjee, Tekin Bicer, Yanfu Zhang.

**Figure 2.** Figure 2: Qualitative inpainting results on the Real-world dataset (column 1 to 2), Shape dataset (column 3 to 4), and Shepp2d dataset (column 5 to 6) with 0.8 mask ratio at 1024 × 1024 resolution. Odd columns and even columns show the sinogram and reconstructed images, respectively. 5 Conclusion & Limitations We present HiSin, a novel framework for efficient high-resolution sinogram inpainting. To address the GPU m… view at source ↗

read the original abstract

High-resolution sinogram completion is critical for computed tomography reconstruction, as missing projections can introduce severe artifacts. While diffusion models provide strong generative priors for this task, their inference cost grows prohibitively with resolution. We propose HRSino, a training-free and efficient diffusion inference approach for high-resolution sinogram completion. By explicitly accounting for spatial heterogeneity in signal characteristics, such as spectral sparsity and local complexity, HRSino allocates inference effort adaptively across spatial regions and resolutions, rather than applying uniform high-resolution diffusion steps. This enables global consistency to be captured at coarse scales while refining local details only where necessary. Experimental results show that HRSino reduces peak memory usage by up to 30.81% and inference time by up to 17.58% compared to the state-of-the-art framework, and maintains completion accuracy across datasets and resolutions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

HRSino adapts diffusion effort to local sinogram heterogeneity for memory and time savings, but the cross-scale consistency mechanisms look underdeveloped.

read the letter

The main thing to know is that this paper introduces HRSino, a training-free diffusion inference method that allocates steps adaptively across regions and resolutions based on spatial heterogeneity like spectral sparsity and local complexity, rather than running uniform high-resolution diffusion everywhere. This targets the scaling problem in sinogram completion for CT reconstruction. It claims to capture global consistency at coarse scales while refining details only where needed, with reported gains of up to 30.81% lower peak memory and 17.58% less inference time versus the prior state-of-the-art, while holding accuracy across datasets and resolutions. That efficiency angle is the practical hook. The adaptive allocation itself is the clearest new piece here, as it applies existing diffusion priors in a non-uniform way without retraining. The focus on heterogeneity as a signal for where to spend compute is a direct and reasonable response to the cost issue. If the numbers hold up in the full experiments, this could make generative reconstruction more usable in settings where full high-res diffusion is too heavy. The soft spot is consistency. The approach assumes that coarse-scale structure plus local refinement will not create boundary artifacts or scale-transition problems, but the abstract gives little on how coarse latents or noise schedules feed into fine regions or whether overlap handling is explicit. If those links are loose, the generative prior could be violated locally even if average accuracy metrics look fine. The lack of dataset specifics, baselines, or error bars in the provided summary also leaves the soundness feeling preliminary. This is for people working on CT inverse problems or efficient diffusion inference in imaging. A reader focused on practical deployment in resource-limited medical workflows might find the allocation strategy worth testing. I would send it for peer review. The efficiency claims address a real constraint and the adaptive idea is distinct enough to merit referee time, though the consistency details will need tightening.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes HRSino, a training-free diffusion inference method for high-resolution sinogram completion in computed tomography. It explicitly models spatial heterogeneity (spectral sparsity and local complexity) to allocate diffusion steps adaptively across regions and resolutions, capturing global consistency at coarse scales while refining local details only where needed. The central experimental claim is that this yields peak memory reductions of up to 30.81% and inference time reductions of up to 17.58% relative to the state-of-the-art framework, while preserving completion accuracy across datasets and resolutions.

Significance. If the accuracy preservation and efficiency gains are robustly demonstrated with proper controls, the work would be significant for practical deployment of diffusion priors in high-resolution medical imaging, where memory and latency constraints often limit applicability. The training-free design and explicit use of signal heterogeneity are strengths that could generalize beyond sinograms.

major comments (2)

[Abstract and §4] Abstract and §4 (Experiments): The reported memory and time reductions (30.81% and 17.58%) and accuracy maintenance are stated without specifying the exact datasets, number of test volumes or sinograms, baseline implementations, statistical tests, or error bars. This absence directly limits verification of whether the central efficiency-accuracy tradeoff claim holds under the reported conditions.
[§3] §3 (Method): The adaptive allocation mechanism is described as capturing consistency at coarse scales and refining locally, but no explicit description is given for propagating coarse-scale latents or noise schedules into fine-scale regions, nor for boundary consistency or cross-resolution conditioning. Without these interfaces, the generative prior may be violated locally even if aggregate metrics appear acceptable.

minor comments (1)

[§3] Notation for spectral sparsity and local complexity measures should be defined with explicit formulas or pseudocode in the method section to allow reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive comments on our manuscript. We address each of the major comments below, indicating where revisions will be made to improve the paper.

read point-by-point responses

Referee: [Abstract and §4] Abstract and §4 (Experiments): The reported memory and time reductions (30.81% and 17.58%) and accuracy maintenance are stated without specifying the exact datasets, number of test volumes or sinograms, baseline implementations, statistical tests, or error bars. This absence directly limits verification of whether the central efficiency-accuracy tradeoff claim holds under the reported conditions.

Authors: We agree that providing more specific details on the experimental setup would enhance the reproducibility and verifiability of our results. In the revised version, we will expand the description in the abstract and §4 to include the exact datasets used, the number of test volumes and sinograms, details on how baselines were implemented, and any statistical tests or error bars associated with the reported metrics. This will allow readers to better assess the robustness of the efficiency gains while maintaining accuracy. revision: yes
Referee: [§3] §3 (Method): The adaptive allocation mechanism is described as capturing consistency at coarse scales and refining locally, but no explicit description is given for propagating coarse-scale latents or noise schedules into fine-scale regions, nor for boundary consistency or cross-resolution conditioning. Without these interfaces, the generative prior may be violated locally even if aggregate metrics appear acceptable.

Authors: We thank the referee for highlighting this aspect of the method description. While the core idea of adaptive allocation based on spatial heterogeneity is outlined in §3, we recognize that explicit details on the propagation of coarse-scale latents and noise schedules, as well as mechanisms for boundary consistency and cross-resolution conditioning, are important for ensuring the integrity of the generative process. We will revise §3 to include a more detailed explanation of these interfaces and how they preserve the diffusion prior across resolutions and regions. revision: yes

Circularity Check

0 steps flagged

No significant circularity; method is self-contained description of adaptive inference on external diffusion priors

full rationale

The paper presents HRSino as a training-free inference procedure that applies existing diffusion models with adaptive allocation based on spatial heterogeneity. No equations or claims reduce the reported memory/time savings or accuracy maintenance to a fitted parameter, self-definition, or self-citation chain. The central claims rest on experimental comparisons to prior frameworks rather than internal re-derivation of the priors themselves. The derivation chain is therefore independent of the target results.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim depends on the effectiveness of diffusion models as generative priors for sinograms and the validity of adaptive effort allocation based on signal heterogeneity without accuracy loss.

axioms (1)

domain assumption Diffusion models provide strong generative priors for sinogram completion tasks.
This is invoked in the abstract as the foundation for applying diffusion models to the completion problem.

invented entities (1)

HRSino no independent evidence
purpose: Training-free efficient diffusion inference for high-resolution sinogram completion via adaptive spatial allocation.
This is the novel method introduced to address the inference cost issue.

pith-pipeline@v0.9.0 · 5683 in / 1317 out tokens · 45010 ms · 2026-05-19T10:35:22.666138+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

43 extracted references · 43 canonical work pages · 3 internal anchors

[1]

Computed tomography: fundamentals, system technology, image quality, applications

Willi A Kalender. Computed tomography: fundamentals, system technology, image quality, applications. John Wiley & Sons, 2011

work page 2011
[2]

Suppressing strain propagation in ultrahigh-ni cathodes during fast charging via epitaxial entropy-assisted coating

Chen Zhao, Chuanwei Wang, Xiang Liu, Inhui Hwang, Tianyi Li, Xinwei Zhou, Jiecheng Diao, Junjing Deng, Yan Qin, Zhenzhen Yang, et al. Suppressing strain propagation in ultrahigh-ni cathodes during fast charging via epitaxial entropy-assisted coating. Nature Energy, 9(3):345–356, 2024

work page 2024
[3]

Distributed optimization for nonrigid nano-tomography

Viktor Nikitin, Vincent De Andrade, Azat Slyamov, Benjamin J Gould, Yuepeng Zhang, Vandana Sam- pathkumar, Narayanan Kasthuri, Do ˘ga Gürsoy, and Francesco De Carlo. Distributed optimization for nonrigid nano-tomography. IEEE Transactions on Computational Imaging, 7:272–287, 2021

work page 2021
[4]

Quantifying mesoscale neuroanatomy using x-ray microtomography

Eva L Dyer, William Gray Roncal, Judy A Prasad, Hugo L Fernandes, Doga Gürsoy, Vincent De Andrade, Kamel Fezzaa, Xianghui Xiao, Joshua T V ogelstein, Chris Jacobsen, et al. Quantifying mesoscale neuroanatomy using x-ray microtomography. eneuro, 4(5), 2017

work page 2017
[5]

Petascale xct: 3d image reconstruction with hierarchical communications on multi-gpu nodes

Mert Hidayeto˘glu, Tekin Bicer, Simon Garcia De Gonzalo, Bin Ren, Vincent De Andrade, Doga Gursoy, Raj Kettimuthu, Ian T Foster, and Wen-mei W Hwu. Petascale xct: 3d image reconstruction with hierarchical communications on multi-gpu nodes. In SC20: International Conference for High Performance Computing, Networking, Storage and Analysis, pages 1–13. IEEE, 2020

work page 2020
[6]

Radiogenomics: what it is and why it is important

Maciej A Mazurowski. Radiogenomics: what it is and why it is important. Journal of the American College of Radiology, 12(8):862–866, 2015

work page 2015
[7]

Denoising diffusion probabilistic models

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020

work page 2020
[8]

Deep unsupervised learning using nonequilibrium thermodynamics

Jascha Sohl-Dickstein, Eric Weiss, Niru Maheswaranathan, and Surya Ganguli. Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning, pages 2256–2265. pmlr, 2015

work page 2015
[9]

Repaint: Inpainting using denoising diffusion probabilistic models

Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, and Luc Van Gool. Repaint: Inpainting using denoising diffusion probabilistic models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11461–11471, 2022

work page 2022
[10]

Palette: Image-to-image diffusion models

Chitwan Saharia, William Chan, Huiwen Chang, Chris Lee, Jonathan Ho, Tim Salimans, David Fleet, and Mohammad Norouzi. Palette: Image-to-image diffusion models. In ACM SIGGRAPH 2022 conference proceedings, pages 1–10, 2022

work page 2022
[11]

Progressive Distillation for Fast Sampling of Diffusion Models

Tim Salimans and Jonathan Ho. Progressive distillation for fast sampling of diffusion models. arXiv preprint arXiv:2202.00512, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022
[12]

On distillation of guided diffusion models

Chenlin Meng, Robin Rombach, Ruiqi Gao, Diederik Kingma, Stefano Ermon, Jonathan Ho, and Tim Salimans. On distillation of guided diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14297–14306, 2023

work page 2023
[13]

Snapfusion: Text-to-image diffusion model on mobile devices within two seconds

Yanyu Li, Huan Wang, Qing Jin, Ju Hu, Pavlo Chemerys, Yun Fu, Yanzhi Wang, Sergey Tulyakov, and Jian Ren. Snapfusion: Text-to-image diffusion model on mobile devices within two seconds. Advances in Neural Information Processing Systems, 36:20662–20678, 2023

work page 2023
[14]

Effortless efficiency: Low-cost pruning of diffusion models

Yang Zhang, Er Jin, Yanfei Dong, Ashkan Khakzar, Philip Torr, Johannes Stegmaier, and Kenji Kawaguchi. Effortless efficiency: Low-cost pruning of diffusion models. arXiv preprint arXiv:2412.02852, 2024

work page arXiv 2024
[15]

Dip-go: A diffusion pruner via few-step gradient optimization

Haowei Zhu, Dehua Tang, Ji Liu, Mingjie Lu, Jintu Zheng, Jinzhang Peng, Dong Li, Yu Wang, Fan Jiang, Lu Tian, et al. Dip-go: A diffusion pruner via few-step gradient optimization. Advances in Neural Information Processing Systems, 37:92581–92604, 2024

work page 2024
[16]

Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps

Cheng Lu, Yuhao Zhou, Fan Bao, Jianfei Chen, Chongxuan Li, and Jun Zhu. Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps. Advances in Neural Information Processing Systems, 35:5775–5787, 2022

work page 2022
[17]

Principles of computerized tomographic imaging

Malcolm Slaney and AC Kak. Principles of computerized tomographic imaging. IEEE press, 1988

work page 1988
[18]

Towards coherent image inpainting using denoising diffusion implicit models

Guanhua Zhang, Jiabao Ji, Yang Zhang, Mo Yu, Tommi S Jaakkola, and Shiyu Chang. Towards coherent image inpainting using denoising diffusion implicit models. 2023

work page 2023
[19]

Blended diffusion for text-driven editing of natural images

Omri Avrahami, Dani Lischinski, and Ohad Fried. Blended diffusion for text-driven editing of natural images. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 18208–18218, 2022. 11

work page 2022
[20]

Plug-and-play diffusion features for text- driven image-to-image translation

Narek Tumanyan, Michal Geyer, Shai Bagon, and Tali Dekel. Plug-and-play diffusion features for text- driven image-to-image translation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1921–1930, 2023

work page 1921
[21]

Adding conditional control to text-to-image diffusion models

Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models. In Proceedings of the IEEE/CVF international conference on computer vision, pages 3836–3847, 2023

work page 2023
[22]

A mathematical theory of communication

Claude E Shannon. A mathematical theory of communication. The Bell system technical journal , 27(3):379–423, 1948

work page 1948
[23]

Deep-neural-network- based sinogram synthesis for sparse-view ct image reconstruction

Hoyeon Lee, Jongha Lee, Hyeongseok Kim, Byungchul Cho, and Seungryong Cho. Deep-neural-network- based sinogram synthesis for sparse-view ct image reconstruction. IEEE Transactions on Radiation and Plasma Medical Sciences, 3(2):109–119, 2018

work page 2018
[24]

Deep convolutional neural network for inverse problems in imaging

Kyong Hwan Jin, Michael T McCann, Emmanuel Froustey, and Michael Unser. Deep convolutional neural network for inverse problems in imaging. IEEE transactions on image processing, 26(9):4509–4522, 2017

work page 2017
[25]

Fastcomposer: Tuning- free multi-subject image generation with localized attention

Guangxuan Xiao, Tianwei Yin, William T Freeman, Frédo Durand, and Song Han. Fastcomposer: Tuning- free multi-subject image generation with localized attention. International Journal of Computer Vision, pages 1–20, 2024

work page 2024
[26]

Hidiffusion: Unlock- ing higher-resolution creativity and efficiency in pretrained diffusion models

Shen Zhang, Zhaowei Chen, Zhenyu Zhao, Yuhao Chen, Yao Tang, and Jiajun Liang. Hidiffusion: Unlock- ing higher-resolution creativity and efficiency in pretrained diffusion models. In European Conference on Computer Vision, pages 145–161. Springer, 2024

work page 2024
[27]

Diffir: Efficient diffusion model for image restoration

Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xinglong Wu, Yapeng Tian, Wenming Yang, and Luc Van Gool. Diffir: Efficient diffusion model for image restoration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 13095–13105, 2023

work page 2023
[28]

Blended latent diffusion

Omri Avrahami, Ohad Fried, and Dani Lischinski. Blended latent diffusion. ACM transactions on graphics (TOG), 42(4):1–11, 2023

work page 2023
[29]

Training Deep Nets with Sublinear Memory Cost

Tianqi Chen, Bing Xu, Chiyuan Zhang, and Carlos Guestrin. Training deep nets with sublinear memory cost. arXiv preprint arXiv:1604.06174, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[30]

Checkmate: Breaking the memory wall with optimal tensor rematerialization

Paras Jain, Ajay Jain, Aniruddha Nrusimha, Amir Gholami, Pieter Abbeel, Joseph Gonzalez, Kurt Keutzer, and Ion Stoica. Checkmate: Breaking the memory wall with optimal tensor rematerialization. Proceedings of Machine Learning and Systems, 2:497–511, 2020

work page 2020
[31]

Fully dynamic inference with deep neural networks

Wenhan Xia, Hongxu Yin, Xiaoliang Dai, and Niraj K Jha. Fully dynamic inference with deep neural networks. IEEE Transactions on Emerging Topics in Computing, 10(2):962–972, 2021

work page 2021
[32]

Mest: Accurate and fast memory-economic sparse training framework on the edge

Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, et al. Mest: Accurate and fast memory-economic sparse training framework on the edge. Advances in Neural Information Processing Systems, 34:20838–20850, 2021

work page 2021
[33]

Fcdm: A physics- guided bidirectional frequency aware convolution and diffusion-based model for sinogram inpainting, 2025

Jiaze E, Srutarshi Banerjee, Tekin Bicer, Guannan Wang, Yanfu Zhang, and Bin Ren. Fcdm: A physics- guided bidirectional frequency aware convolution and diffusion-based model for sinogram inpainting, 2025

work page 2025
[34]

Denoising Diffusion Implicit Models

Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010
[35]

Masked autoencoders are scalable vision learners

Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, and Ross Girshick. Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 16000–16009, 2022

work page 2022
[36]

Tomobank: a tomographic data repository for computational x-ray science

Francesco De Carlo, Do˘ga Gürsoy, Daniel J Ching, K Joost Batenburg, Wolfgang Ludwig, Lucia Mancini, Federica Marone, Rajmund Mokso, Daniël M Pelt, Jan Sijbers, et al. Tomobank: a tomographic data repository for computational x-ray science. Measurement Science and Technology, 29(3):034004, 2018

work page 2018
[37]

Timbir: A method for time-space reconstruction from interlaced views

K Aditya Mohan, SV Venkatakrishnan, John W Gibbs, Emine Begum Gulsoy, Xianghui Xiao, Marc De Graef, Peter W V oorhees, and Charles A Bouman. Timbir: A method for time-space reconstruction from interlaced views. IEEE Transactions on Computational Imaging, 1(2):96–111, 2015

work page 2015
[38]

Fast tomographic reconstruction from limited data using artificial neural networks

Daniel Maria Pelt and Kees Joost Batenburg. Fast tomographic reconstruction from limited data using artificial neural networks. IEEE Transactions on Image Processing, 22(12):5238–5251, 2013. 12

work page 2013
[39]

scikit-image: image processing in python

Stefan Van der Walt, Johannes L Schönberger, Juan Nunez-Iglesias, François Boulogne, Joshua D Warner, Neil Yager, Emmanuelle Gouillart, and Tony Yu. scikit-image: image processing in python. PeerJ, 2:e453, 2014

work page 2014
[40]

Tomopy: a framework for the analysis of synchrotron tomographic data

Doga Gürsoy, Francesco De Carlo, Xianghui Xiao, and Chris Jacobsen. Tomopy: a framework for the analysis of synchrotron tomographic data. Journal of synchrotron radiation, 21(5):1188–1193, 2014

work page 2014
[41]

Image quality assessment: from error visibility to structural similarity

Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4):600–612, 2004

work page 2004
[42]

High-resolution image synthesis with latent diffusion models

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022

work page 2022
[43]

Regridding reconstruction algorithm for real-time tomographic imaging

F Marone and M Stampanoni. Regridding reconstruction algorithm for real-time tomographic imaging. Synchrotron Radiation, 19(6):1029–1037, 2012. 13 A Generalization to Other Diffusion-based Inpainting Models Unless stressed, all experimental settings in Appendix—-including hardware, inference configurations, PyTorch optimizations, sampling steps, evaluatio...

work page 2012

[1] [1]

Computed tomography: fundamentals, system technology, image quality, applications

Willi A Kalender. Computed tomography: fundamentals, system technology, image quality, applications. John Wiley & Sons, 2011

work page 2011

[2] [2]

Suppressing strain propagation in ultrahigh-ni cathodes during fast charging via epitaxial entropy-assisted coating

Chen Zhao, Chuanwei Wang, Xiang Liu, Inhui Hwang, Tianyi Li, Xinwei Zhou, Jiecheng Diao, Junjing Deng, Yan Qin, Zhenzhen Yang, et al. Suppressing strain propagation in ultrahigh-ni cathodes during fast charging via epitaxial entropy-assisted coating. Nature Energy, 9(3):345–356, 2024

work page 2024

[3] [3]

Distributed optimization for nonrigid nano-tomography

Viktor Nikitin, Vincent De Andrade, Azat Slyamov, Benjamin J Gould, Yuepeng Zhang, Vandana Sam- pathkumar, Narayanan Kasthuri, Do ˘ga Gürsoy, and Francesco De Carlo. Distributed optimization for nonrigid nano-tomography. IEEE Transactions on Computational Imaging, 7:272–287, 2021

work page 2021

[4] [4]

Quantifying mesoscale neuroanatomy using x-ray microtomography

Eva L Dyer, William Gray Roncal, Judy A Prasad, Hugo L Fernandes, Doga Gürsoy, Vincent De Andrade, Kamel Fezzaa, Xianghui Xiao, Joshua T V ogelstein, Chris Jacobsen, et al. Quantifying mesoscale neuroanatomy using x-ray microtomography. eneuro, 4(5), 2017

work page 2017

[5] [5]

Petascale xct: 3d image reconstruction with hierarchical communications on multi-gpu nodes

Mert Hidayeto˘glu, Tekin Bicer, Simon Garcia De Gonzalo, Bin Ren, Vincent De Andrade, Doga Gursoy, Raj Kettimuthu, Ian T Foster, and Wen-mei W Hwu. Petascale xct: 3d image reconstruction with hierarchical communications on multi-gpu nodes. In SC20: International Conference for High Performance Computing, Networking, Storage and Analysis, pages 1–13. IEEE, 2020

work page 2020

[6] [6]

Radiogenomics: what it is and why it is important

Maciej A Mazurowski. Radiogenomics: what it is and why it is important. Journal of the American College of Radiology, 12(8):862–866, 2015

work page 2015

[7] [7]

Denoising diffusion probabilistic models

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020

work page 2020

[8] [8]

Deep unsupervised learning using nonequilibrium thermodynamics

Jascha Sohl-Dickstein, Eric Weiss, Niru Maheswaranathan, and Surya Ganguli. Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning, pages 2256–2265. pmlr, 2015

work page 2015

[9] [9]

Repaint: Inpainting using denoising diffusion probabilistic models

Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, and Luc Van Gool. Repaint: Inpainting using denoising diffusion probabilistic models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11461–11471, 2022

work page 2022

[10] [10]

Palette: Image-to-image diffusion models

Chitwan Saharia, William Chan, Huiwen Chang, Chris Lee, Jonathan Ho, Tim Salimans, David Fleet, and Mohammad Norouzi. Palette: Image-to-image diffusion models. In ACM SIGGRAPH 2022 conference proceedings, pages 1–10, 2022

work page 2022

[11] [11]

Progressive Distillation for Fast Sampling of Diffusion Models

Tim Salimans and Jonathan Ho. Progressive distillation for fast sampling of diffusion models. arXiv preprint arXiv:2202.00512, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022

[12] [12]

On distillation of guided diffusion models

Chenlin Meng, Robin Rombach, Ruiqi Gao, Diederik Kingma, Stefano Ermon, Jonathan Ho, and Tim Salimans. On distillation of guided diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14297–14306, 2023

work page 2023

[13] [13]

Snapfusion: Text-to-image diffusion model on mobile devices within two seconds

Yanyu Li, Huan Wang, Qing Jin, Ju Hu, Pavlo Chemerys, Yun Fu, Yanzhi Wang, Sergey Tulyakov, and Jian Ren. Snapfusion: Text-to-image diffusion model on mobile devices within two seconds. Advances in Neural Information Processing Systems, 36:20662–20678, 2023

work page 2023

[14] [14]

Effortless efficiency: Low-cost pruning of diffusion models

Yang Zhang, Er Jin, Yanfei Dong, Ashkan Khakzar, Philip Torr, Johannes Stegmaier, and Kenji Kawaguchi. Effortless efficiency: Low-cost pruning of diffusion models. arXiv preprint arXiv:2412.02852, 2024

work page arXiv 2024

[15] [15]

Dip-go: A diffusion pruner via few-step gradient optimization

Haowei Zhu, Dehua Tang, Ji Liu, Mingjie Lu, Jintu Zheng, Jinzhang Peng, Dong Li, Yu Wang, Fan Jiang, Lu Tian, et al. Dip-go: A diffusion pruner via few-step gradient optimization. Advances in Neural Information Processing Systems, 37:92581–92604, 2024

work page 2024

[16] [16]

Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps

Cheng Lu, Yuhao Zhou, Fan Bao, Jianfei Chen, Chongxuan Li, and Jun Zhu. Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps. Advances in Neural Information Processing Systems, 35:5775–5787, 2022

work page 2022

[17] [17]

Principles of computerized tomographic imaging

Malcolm Slaney and AC Kak. Principles of computerized tomographic imaging. IEEE press, 1988

work page 1988

[18] [18]

Towards coherent image inpainting using denoising diffusion implicit models

Guanhua Zhang, Jiabao Ji, Yang Zhang, Mo Yu, Tommi S Jaakkola, and Shiyu Chang. Towards coherent image inpainting using denoising diffusion implicit models. 2023

work page 2023

[19] [19]

Blended diffusion for text-driven editing of natural images

Omri Avrahami, Dani Lischinski, and Ohad Fried. Blended diffusion for text-driven editing of natural images. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 18208–18218, 2022. 11

work page 2022

[20] [20]

Plug-and-play diffusion features for text- driven image-to-image translation

Narek Tumanyan, Michal Geyer, Shai Bagon, and Tali Dekel. Plug-and-play diffusion features for text- driven image-to-image translation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1921–1930, 2023

work page 1921

[21] [21]

Adding conditional control to text-to-image diffusion models

Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models. In Proceedings of the IEEE/CVF international conference on computer vision, pages 3836–3847, 2023

work page 2023

[22] [22]

A mathematical theory of communication

Claude E Shannon. A mathematical theory of communication. The Bell system technical journal , 27(3):379–423, 1948

work page 1948

[23] [23]

Deep-neural-network- based sinogram synthesis for sparse-view ct image reconstruction

Hoyeon Lee, Jongha Lee, Hyeongseok Kim, Byungchul Cho, and Seungryong Cho. Deep-neural-network- based sinogram synthesis for sparse-view ct image reconstruction. IEEE Transactions on Radiation and Plasma Medical Sciences, 3(2):109–119, 2018

work page 2018

[24] [24]

Deep convolutional neural network for inverse problems in imaging

Kyong Hwan Jin, Michael T McCann, Emmanuel Froustey, and Michael Unser. Deep convolutional neural network for inverse problems in imaging. IEEE transactions on image processing, 26(9):4509–4522, 2017

work page 2017

[25] [25]

Fastcomposer: Tuning- free multi-subject image generation with localized attention

Guangxuan Xiao, Tianwei Yin, William T Freeman, Frédo Durand, and Song Han. Fastcomposer: Tuning- free multi-subject image generation with localized attention. International Journal of Computer Vision, pages 1–20, 2024

work page 2024

[26] [26]

Hidiffusion: Unlock- ing higher-resolution creativity and efficiency in pretrained diffusion models

Shen Zhang, Zhaowei Chen, Zhenyu Zhao, Yuhao Chen, Yao Tang, and Jiajun Liang. Hidiffusion: Unlock- ing higher-resolution creativity and efficiency in pretrained diffusion models. In European Conference on Computer Vision, pages 145–161. Springer, 2024

work page 2024

[27] [27]

Diffir: Efficient diffusion model for image restoration

Bin Xia, Yulun Zhang, Shiyin Wang, Yitong Wang, Xinglong Wu, Yapeng Tian, Wenming Yang, and Luc Van Gool. Diffir: Efficient diffusion model for image restoration. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 13095–13105, 2023

work page 2023

[28] [28]

Blended latent diffusion

Omri Avrahami, Ohad Fried, and Dani Lischinski. Blended latent diffusion. ACM transactions on graphics (TOG), 42(4):1–11, 2023

work page 2023

[29] [29]

Training Deep Nets with Sublinear Memory Cost

Tianqi Chen, Bing Xu, Chiyuan Zhang, and Carlos Guestrin. Training deep nets with sublinear memory cost. arXiv preprint arXiv:1604.06174, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[30] [30]

Checkmate: Breaking the memory wall with optimal tensor rematerialization

Paras Jain, Ajay Jain, Aniruddha Nrusimha, Amir Gholami, Pieter Abbeel, Joseph Gonzalez, Kurt Keutzer, and Ion Stoica. Checkmate: Breaking the memory wall with optimal tensor rematerialization. Proceedings of Machine Learning and Systems, 2:497–511, 2020

work page 2020

[31] [31]

Fully dynamic inference with deep neural networks

Wenhan Xia, Hongxu Yin, Xiaoliang Dai, and Niraj K Jha. Fully dynamic inference with deep neural networks. IEEE Transactions on Emerging Topics in Computing, 10(2):962–972, 2021

work page 2021

[32] [32]

Mest: Accurate and fast memory-economic sparse training framework on the edge

Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, et al. Mest: Accurate and fast memory-economic sparse training framework on the edge. Advances in Neural Information Processing Systems, 34:20838–20850, 2021

work page 2021

[33] [33]

Fcdm: A physics- guided bidirectional frequency aware convolution and diffusion-based model for sinogram inpainting, 2025

Jiaze E, Srutarshi Banerjee, Tekin Bicer, Guannan Wang, Yanfu Zhang, and Bin Ren. Fcdm: A physics- guided bidirectional frequency aware convolution and diffusion-based model for sinogram inpainting, 2025

work page 2025

[34] [34]

Denoising Diffusion Implicit Models

Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010

[35] [35]

Masked autoencoders are scalable vision learners

Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, and Ross Girshick. Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 16000–16009, 2022

work page 2022

[36] [36]

Tomobank: a tomographic data repository for computational x-ray science

Francesco De Carlo, Do˘ga Gürsoy, Daniel J Ching, K Joost Batenburg, Wolfgang Ludwig, Lucia Mancini, Federica Marone, Rajmund Mokso, Daniël M Pelt, Jan Sijbers, et al. Tomobank: a tomographic data repository for computational x-ray science. Measurement Science and Technology, 29(3):034004, 2018

work page 2018

[37] [37]

Timbir: A method for time-space reconstruction from interlaced views

K Aditya Mohan, SV Venkatakrishnan, John W Gibbs, Emine Begum Gulsoy, Xianghui Xiao, Marc De Graef, Peter W V oorhees, and Charles A Bouman. Timbir: A method for time-space reconstruction from interlaced views. IEEE Transactions on Computational Imaging, 1(2):96–111, 2015

work page 2015

[38] [38]

Fast tomographic reconstruction from limited data using artificial neural networks

Daniel Maria Pelt and Kees Joost Batenburg. Fast tomographic reconstruction from limited data using artificial neural networks. IEEE Transactions on Image Processing, 22(12):5238–5251, 2013. 12

work page 2013

[39] [39]

scikit-image: image processing in python

Stefan Van der Walt, Johannes L Schönberger, Juan Nunez-Iglesias, François Boulogne, Joshua D Warner, Neil Yager, Emmanuelle Gouillart, and Tony Yu. scikit-image: image processing in python. PeerJ, 2:e453, 2014

work page 2014

[40] [40]

Tomopy: a framework for the analysis of synchrotron tomographic data

Doga Gürsoy, Francesco De Carlo, Xianghui Xiao, and Chris Jacobsen. Tomopy: a framework for the analysis of synchrotron tomographic data. Journal of synchrotron radiation, 21(5):1188–1193, 2014

work page 2014

[41] [41]

Image quality assessment: from error visibility to structural similarity

Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4):600–612, 2004

work page 2004

[42] [42]

High-resolution image synthesis with latent diffusion models

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022

work page 2022

[43] [43]

Regridding reconstruction algorithm for real-time tomographic imaging

F Marone and M Stampanoni. Regridding reconstruction algorithm for real-time tomographic imaging. Synchrotron Radiation, 19(6):1029–1037, 2012. 13 A Generalization to Other Diffusion-based Inpainting Models Unless stressed, all experimental settings in Appendix—-including hardware, inference configurations, PyTorch optimizations, sampling steps, evaluatio...

work page 2012