Unrolling Graph-based Douglas-Rachford Algorithm for Image Interpolation with Informed Initialization

Bingshuo Hu; Gene Cheung; Xue Zhang

arxiv: 2509.11926 · v4 · submitted 2025-09-15 · 💻 cs.CV

Unrolling Graph-based Douglas-Rachford Algorithm for Image Interpolation with Informed Initialization

Xue Zhang , Bingshuo Hu , Gene Cheung This is my paper

Pith reviewed 2026-05-18 16:53 UTC · model grok-4.3

classification 💻 cs.CV

keywords image interpolationDouglas-Rachford algorithmgraph filtersunrolled neural networksinformed initializationgraph shift variationimage restoration

0 comments

The pith

A known linear interpolator initializes a directed graph adjacency matrix that, after learned perturbations and unrolled Douglas-Rachford steps, produces state-of-the-art image interpolation with far fewer parameters.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows how to build a neural network for image interpolation by starting from a mathematical mapping that turns any linear interpolator into a graph filter solving a maximum a posteriori problem. This provides a strong baseline by initializing the graph's adjacency matrix directly from the interpolator. The network then learns small perturbations to this matrix and applies them through unrolled Douglas-Rachford iterations. Experiments across different interpolation tasks show this approach matches or exceeds current best methods while using drastically fewer parameters and less computation during inference.

Core claim

By leveraging a theorem that equates a pseudo-linear interpolator to a directed graph filter for a graph shift variation prior in a MAP estimation framework, the method initializes a directed graph adjacency matrix A from a known interpolator Θ. It then learns data-driven perturbation matrices P and P² to augment A, with the restoration effects realized through unrolled Douglas-Rachford iterations, resulting in a lightweight interpretable neural network that achieves state-of-the-art performance on image interpolation tasks.

What carries the argument

Unrolled Douglas-Rachford iterations applied to a perturbed directed graph adjacency matrix, where the initial matrix is obtained by mapping a known interpolator to a graph filter via the graph shift variation prior.

If this is right

The informed initialization from a known interpolator lowers the risk of poor local minima during training.
The resulting network reaches state-of-the-art interpolation quality on multiple scenarios.
Both the total number of network parameters and the computational cost at inference time decrease sharply.
The graph-based formulation keeps a degree of interpretability in how the restoration is performed.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same informed-initialization pattern could be tested on other inverse problems such as denoising or inpainting once suitable baseline interpolators are identified.
Viewing the learned network through the lens of graph signal processing may reveal new ways to verify stability or frequency response of the restoration.
The drastic parameter reduction suggests the approach could support real-time interpolation on mobile or embedded hardware.

Load-bearing premise

The theorem correctly maps any pseudo-linear interpolator to a directed graph filter that solves the MAP problem with graph shift variation prior, so that learned perturbations to the adjacency matrix deliver measurable restoration gains.

What would settle it

An experiment that removes the learned perturbation matrices, runs the remaining unrolled iterations on the same image interpolation benchmarks, and finds no drop or even higher performance would falsify the claimed benefit of the data-driven perturbations.

read the original abstract

Conventional deep neural nets (DNNs) initialize network parameters at random and then optimize each one via stochastic gradient descent (SGD), resulting in substantial risk of poor-performing local minima. Focusing on image interpolation and leveraging a recent theorem that maps a (pseudo-)linear interpolator {\Theta} to a directed graph filter that is a solution to a corresponding MAP problem with a graph shift variation (GSV) prior, we first initialize a directed graph adjacency matrix A given a known interpolator {\Theta}, establishing a baseline performance. Then, towards further gain, we learn perturbation matrices P and P(2) from data to augment A, whose restoration effects are implemented progressively via Douglas-Rachford (DR) iterations, which we unroll into a lightweight and interpretable neural net. Experiments on different image interpolation scenarios demonstrate state-of-the-art performance, while drastically reducing network parameters and inference complexity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper maps a known interpolator to a graph adjacency matrix via a cited theorem, adds learned perturbations, and unrolls Douglas-Rachford iterations into a small network that claims SOTA image interpolation with far fewer parameters.

read the letter

The main point is that they take an existing interpolator, use a recent theorem to turn it into a directed graph filter that solves the MAP problem with a graph shift variation prior, initialize the adjacency matrix from that, learn two small perturbation matrices on top, and unroll the Douglas-Rachford steps into a lightweight network. This gives a model that starts from a reasonable baseline instead of random weights and reportedly delivers better restoration with much lower parameter count and inference cost across several interpolation scenarios.

Referee Report

1 major / 3 minor

Summary. The manuscript proposes an image interpolation method that initializes a directed graph adjacency matrix A from a known (pseudo-)linear interpolator Θ via a recent theorem mapping it to a graph filter solving the MAP problem under a graph shift variation (GSV) prior. Perturbation matrices P and P(2) are then learned from data to augment A, with their effects realized through unrolled Douglas-Rachford iterations forming a lightweight neural network. Experiments across image interpolation scenarios claim state-of-the-art performance alongside substantial reductions in network parameters and inference complexity.

Significance. If the theorem holds for the employed interpolators and the unrolled iterations faithfully implement the perturbed graph filter, the work offers a principled, interpretable alternative to random initialization in DNNs for restoration tasks. The informed initialization and parameter efficiency constitute a clear strength, potentially advancing lightweight, domain-informed unrolling methods in graph signal processing and computer vision.

major comments (1)

[§3 (Proposed Method, theorem application and initialization of A)] The central claim that informed initialization from the theorem yields interpretable gains beyond generic unrolling rests on the theorem producing an exact MAP solution under the GSV prior for the specific Θ and discrete image operators. The manuscript invokes this mapping to establish baseline performance before learning P and P(2), but provides no explicit verification that the theorem's assumptions (linearity, shift-invariance, exact equivalence) hold on the image grid; if they do not, measured improvements cannot be attributed to the graph construction rather than added degrees of freedom in the DR unrolling.

minor comments (3)

[Abstract] The abstract refers to 'a recent theorem' without a citation; add the reference and a brief statement of its scope.
[§3.3 (Unrolling and network architecture)] Clarify the precise roles of the two perturbation matrices P and P(2) in the unrolled DR iterations and how they are optimized jointly.
[§4 (Experiments)] In the experimental section, report the number of parameters and FLOPs for all compared methods on the same hardware to substantiate the 'drastically reducing' claim.

Simulated Author's Rebuttal

1 responses · 0 unresolved

Thank you for your thorough review and positive assessment of our work. We are pleased that you recognize the potential of our informed initialization approach. Below, we address your major comment point by point.

read point-by-point responses

Referee: [§3 (Proposed Method, theorem application and initialization of A)] The central claim that informed initialization from the theorem yields interpretable gains beyond generic unrolling rests on the theorem producing an exact MAP solution under the GSV prior for the specific Θ and discrete image operators. The manuscript invokes this mapping to establish baseline performance before learning P and P(2), but provides no explicit verification that the theorem's assumptions (linearity, shift-invariance, exact equivalence) hold on the image grid; if they do not, measured improvements cannot be attributed to the graph construction rather than added degrees of freedom in the DR unrolling.

Authors: We thank the referee for highlighting this important point. The theorem we invoke is designed for pseudo-linear interpolators Θ, and the ones used in our experiments (e.g., nearest-neighbor, bilinear) satisfy linearity on the discrete pixel grid. Regarding shift-invariance, our graph construction assumes a regular grid structure with consistent neighborhood definitions, which aligns with the theorem's requirements for the GSV prior. Exact equivalence follows from the mapping provided in the referenced theorem. However, to make this explicit and address the concern about attribution of gains, we will revise the manuscript to include a dedicated paragraph or short subsection in §3 verifying these assumptions hold for our setup, perhaps with a small-scale example or reference to the theorem's proof applicability. This revision will ensure readers can confidently attribute the baseline to the informed graph initialization. revision: yes

Circularity Check

1 steps flagged

Central claim depends on unverified applicability of the cited theorem mapping any (pseudo-)linear interpolator Θ exactly to a directed graph filter solving the MAP problem with GSV prior.

specific steps

self citation load bearing [Abstract]
"leveraging a recent theorem that maps a (pseudo-)linear interpolator {Θ} to a directed graph filter that is a solution to a corresponding MAP problem with a graph shift variation (GSV) prior, we first initialize a directed graph adjacency matrix A given a known interpolator {Θ}, establishing a baseline performance."

The theorem directly supplies the mapping from Θ to graph filter A that solves the MAP problem, which is then used as the informed baseline before learning perturbations. If the cited theorem originates from overlapping authors and its assumptions (linearity, exact equivalence on discrete grids) are not re-verified here, the attribution of restoration gains to the 'informed' construction reduces to the self-cited mapping rather than independent derivation.

full rationale

The derivation initializes adjacency matrix A from known interpolator Θ via the recent theorem to establish baseline MAP solution under GSV prior, then augments with learned perturbations P/P(2) realized in unrolled DR iterations. This informed-initialization premise is load-bearing for interpretability and novelty claims, as experiments attribute gains to the graph construction. The theorem citation lacks explicit independent verification details in the provided text, creating moderate risk that baseline and gains reduce to the mapping itself (pattern 3). However, the unrolling and data-driven learning steps retain independent content, and SOTA experimental results on multiple scenarios provide external falsifiability, so the overall derivation is not fully forced by self-reference. Score kept at 4 rather than higher per proportionality rule.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on an external theorem for graph filter construction and on the empirical effectiveness of learned perturbations; no explicit free parameters are named beyond the learned matrices P and P(2), and no new physical entities are introduced.

free parameters (1)

Perturbation matrices P and P(2)
Learned from data to augment the initialized adjacency matrix A; their values are fitted during training to improve restoration.

axioms (1)

domain assumption A (pseudo-)linear interpolator Θ maps to a directed graph filter solving the MAP problem with graph shift variation prior.
Invoked in the first step to establish baseline performance from known interpolator.

pith-pipeline@v0.9.0 · 5684 in / 1336 out tokens · 42073 ms · 2026-05-18T16:53:37.233075+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Theorem 1: Interpolator [I_M; Θ] is the solution filter to the MAP problem (4) if M=N, Θ is invertible, and A_{M,N}=Θ^{-1}.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

31 extracted references · 31 canonical work pages · 2 internal anchors

[1]

black box

INTRODUCTION Image interpolation—a well-studied image restoration task— computes missing pixel values at targeted 2D grid locations given observed neighboring image pixels. While early methods include model-based ones such as linear and Bicubic interpolations [1], recent methods are dominated by deep-learning-based models such as SwinIR [2] and Restormer ...

work page
[2]

PRELIMINARIES 2.1. Graph Shift Variation A smooth (consistent) signal x with respect to (w.r.t.) a graph G can be mathematically described in many ways,e.g.,graph Laplacian regularizer(GLR) [11],graph total variation(GTV) [12]. Here, we focus ongraph shift variation(GSV) [16]: R(x) =∥x−Ax∥ 2 2.(1) Using A as agraph shift operator(GSO) [9]—e.g., row-stocha...

work page internal anchor Pith review Pith/arXiv arXiv 2025
[3]

PROBLEM FORMULATION 3.1. Feed-Forward Network Construction Theorem 1 states that, under mild conditions, there is a one-to-one mapping from an interpolator [IM;Θ] to a directed graph filter that is a solution to MAP problem (4). Leveraging Theorem 1, our goal is to first unroll a graph algorithm minimizing (4) to a neural net initialized to Θ, then furthe...

work page
[4]

Ours (Bicubic+)

EXPERIMENTS 4.1. Experimental Setup Experiments were conducted in a Python 3.12 environment, where all models were implemented using PyTorch and trained on NVIDIA GeForce RTX 3090 hardware. To train each learning model, we employed the DIV2K [21] dataset, containing 800 high-res training images and 100 validation images. Due to the high resolution, each t...

work page
[5]

CONCLUSION Leveraging a recent interpolator theorem [15] that maps a pseudo- linear interpolator Θ to a directed graph filter that is a solution to a MAP problem using the graph shift variation (GSV) [16] as signal prior, we build a lightweight and interpretable neural net by unrolling an iterative algorithm solving the MAP problem. The key novelty is tha...

work page
[6]

An introduction to the conjugate gradient method without the agonizing pain,

J. R Shewchuk, “An introduction to the conjugate gradient method without the agonizing pain,” Tech. Rep., USA, 1994

work page 1994
[7]

SwinIR: Image restoration using Swin transformer,

Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte, “SwinIR: Image restoration using Swin transformer,” inProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, October 2021, pp. 1833–1844

work page 2021
[8]

Restormer: Efficient transformer for high-resolution image restoration,

Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang, “Restormer: Efficient transformer for high-resolution image restoration,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 5728– 5739

work page 2022
[9]

ImageNet classification with deep convolutional neural networks,

Alex Krizhevsky, I. Sutskever, and G. Hinton, “ImageNet classification with deep convolutional neural networks,” in NIPS, 2012

work page 2012
[10]

Attention is all you need,

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin, “Attention is all you need,”Advances in neural information processing systems, vol. 30, 2017

work page 2017
[11]

Why are adaptive methods good for attention models?,

Jingzhao Zhang, Sai Praneeth Karimireddy, Andreas Veit, Seungyeon Kim, Sashank Reddi, Sanjiv Kumar, and Suvrit Sra, “Why are adaptive methods good for attention models?,” inAdvances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, Eds. 2020, vol. 33, pp. 15383–15393, Curran Associates, Inc

work page 2020
[12]

Algorithm unrolling: Interpretable, efficient deep learning for signal and image processing,

Vishal Monga, Yuelong Li, and Yonina C. Eldar, “Algorithm unrolling: Interpretable, efficient deep learning for signal and image processing,”IEEE Signal Processing Magazine, vol. 38, no. 2, pp. 18–44, 2021

work page 2021
[13]

White- box transformers via sparse rate reduction,

Yaodong Yu, Sam Buchanan, Druv Pai, Tianzhe Chu, Ziyang Wu, Shengbang Tong, Benjamin Haeffele, and Yi Ma, “White- box transformers via sparse rate reduction,” inAdvances in Neural Information Processing Systems, A. Oh, T. Neumann, A. Globerson, K. Saenko, M. Hardt, and S. Levine, Eds. 2023, vol. 36, pp. 9422–9457, Curran Associates, Inc

work page 2023
[14]

Graph signal processing: Overview, challenges, and applications,

A. Ortega, P. Frossard, J. Kovacevic, J. M. F. Moura, and P. Vandergheynst, “Graph signal processing: Overview, challenges, and applications,” inProceedings of the IEEE, May 2018, vol. 106, no.5, pp. 808–828

work page 2018
[15]

Graph spectral image processing,

G. Cheung, E. Magli, Y . Tanaka, and M. Ng, “Graph spectral image processing,” inProceedings of the IEEE, May 2018, vol. 106, no.5, pp. 907–930

work page 2018
[16]

Graph Laplacian regularization for image denoising: Analysis in the continuous domain,

Jiahao Pang and Gene Cheung, “Graph Laplacian regularization for image denoising: Analysis in the continuous domain,”IEEE Transactions on Image Processing, vol. 26, no. 4, pp. 1770–1785, 2017

work page 2017
[17]

Graph-based blind image deblurring from a single photograph,

Y . Bai, G. Cheung, X. Liu, and W. Gao, “Graph-based blind image deblurring from a single photograph,”IEEE Transactions on Image Processing, vol. 28, no.3, pp. 1404– 1418, 2019

work page 2019
[18]

Interpretable lightweight transformer via unrolling of learned graph smoothness priors,

VIET HO TAM THUC DO, Parham Eftekhar, Seyed Alireza Hosseini, Gene Cheung, and Philip Chou, “Interpretable lightweight transformer via unrolling of learned graph smoothness priors,” inAdvances in Neural Information Processing Systems, A. Globerson, L. Mackey, D. Belgrave, A. Fan, U. Paquet, J. Tomczak, and C. Zhang, Eds. 2024, vol. 37, pp. 6393–6416, Curr...

work page 2024
[19]

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio, “Neural machine translation by jointly learning to align and translate,”CoRR, vol. abs/1409.0473, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[20]

Mixed graph signal analysis of joint image denoising / interpolation,

Niruhan Viswarupan, Gene Cheung, Fengbo Lan, and Michael S. Brown, “Mixed graph signal analysis of joint image denoising / interpolation,” inICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024, pp. 9431–9435

work page 2024
[21]

The little engine that could: Regularization by denoising (RED),

Michael Elad Yaniv Romano and Peyman Milanfar, “The little engine that could: Regularization by denoising (RED),” in SIAM Journal on Imaging Sciences, 2017, vol. 10, no.4, pp. 1804–1844

work page 2017
[22]

Linear convergence and metric selection for Douglas-Rachford splitting and ADMM,

Pontus Giselsson and Stephen Boyd, “Linear convergence and metric selection for Douglas-Rachford splitting and ADMM,” IEEE Transactions on Automatic Control, vol. 62, no. 2, pp. 532–544, 2017

work page 2017
[23]

Bi-cgstab: A fast and smoothly converging variant of bi-cg for the solution of nonsymmetric linear systems,

Henk A. Van Der V orst, “Bi-cgstab: A fast and smoothly converging variant of bi-cg for the solution of nonsymmetric linear systems,”SIAM Journal on Scientific and Statistical Computing, vol. 13, pp. 631, 1992

work page 1992
[24]

Methods of conjugate gradients for solving linear systems,

Magnus R Hestenes and E. L. Stiefel, “Methods of conjugate gradients for solving linear systems,”Journal of Research of the National Bureau of Standards (United States), vol. 49, 1952

work page 1952
[25]

Graph algorithm unrolling with douglas-rachford iterations for image interpolation with guaranteed initialization,

Xue Zhang, Bingshuo Hu, and Gene Cheung, “Graph algorithm unrolling with douglas-rachford iterations for image interpolation with guaranteed initialization,”arXiv preprint URL https://arxiv.org/abs/2509.1192, 2025

work page arXiv 2025
[26]

Ntire 2017 challenge on single image super-resolution: Dataset and study,

Eirikur Agustsson and Radu Timofte, “Ntire 2017 challenge on single image super-resolution: Dataset and study,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2017, pp. 126–135

work page 2017
[27]

Kodak lossless true color image suite (photocd pcd0992),

Eastman Kodak, “Kodak lossless true color image suite (photocd pcd0992),”URL http://r0k. us/graphics/kodak, vol. 6, pp. 2, 1993

work page 1993
[28]

Single image super-resolution from transformed self-exemplars,

Jia-Bin Huang, Abhishek Singh, and Narendra Ahuja, “Single image super-resolution from transformed self-exemplars,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 5197–5206

work page 2015
[29]

Color demosaicking by local directional interpolation and nonlocal adaptive thresholding,

Lei Zhang, Xiaolin Wu, Antoni Buades, and Xin Li, “Color demosaicking by local directional interpolation and nonlocal adaptive thresholding,”Journal of Electronic imaging, vol. 20, no. 2, pp. 023016–023016, 2011

work page 2011
[30]

Image super-resolution using deep convolutional networks,

Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang, “Image super-resolution using deep convolutional networks,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 38, no. 2, pp. 295–307, 2016

work page 2016
[31]

Image quality assessment: from error visibility to structural similarity,

Zhou Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli, “Image quality assessment: from error visibility to structural similarity,”IEEE Transactions on Image Processing, vol. 13, no. 4, pp. 600–612, 2004

work page 2004

[1] [1]

black box

INTRODUCTION Image interpolation—a well-studied image restoration task— computes missing pixel values at targeted 2D grid locations given observed neighboring image pixels. While early methods include model-based ones such as linear and Bicubic interpolations [1], recent methods are dominated by deep-learning-based models such as SwinIR [2] and Restormer ...

work page

[2] [2]

PRELIMINARIES 2.1. Graph Shift Variation A smooth (consistent) signal x with respect to (w.r.t.) a graph G can be mathematically described in many ways,e.g.,graph Laplacian regularizer(GLR) [11],graph total variation(GTV) [12]. Here, we focus ongraph shift variation(GSV) [16]: R(x) =∥x−Ax∥ 2 2.(1) Using A as agraph shift operator(GSO) [9]—e.g., row-stocha...

work page internal anchor Pith review Pith/arXiv arXiv 2025

[3] [3]

PROBLEM FORMULATION 3.1. Feed-Forward Network Construction Theorem 1 states that, under mild conditions, there is a one-to-one mapping from an interpolator [IM;Θ] to a directed graph filter that is a solution to MAP problem (4). Leveraging Theorem 1, our goal is to first unroll a graph algorithm minimizing (4) to a neural net initialized to Θ, then furthe...

work page

[4] [4]

Ours (Bicubic+)

EXPERIMENTS 4.1. Experimental Setup Experiments were conducted in a Python 3.12 environment, where all models were implemented using PyTorch and trained on NVIDIA GeForce RTX 3090 hardware. To train each learning model, we employed the DIV2K [21] dataset, containing 800 high-res training images and 100 validation images. Due to the high resolution, each t...

work page

[5] [5]

CONCLUSION Leveraging a recent interpolator theorem [15] that maps a pseudo- linear interpolator Θ to a directed graph filter that is a solution to a MAP problem using the graph shift variation (GSV) [16] as signal prior, we build a lightweight and interpretable neural net by unrolling an iterative algorithm solving the MAP problem. The key novelty is tha...

work page

[6] [6]

An introduction to the conjugate gradient method without the agonizing pain,

J. R Shewchuk, “An introduction to the conjugate gradient method without the agonizing pain,” Tech. Rep., USA, 1994

work page 1994

[7] [7]

SwinIR: Image restoration using Swin transformer,

Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte, “SwinIR: Image restoration using Swin transformer,” inProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, October 2021, pp. 1833–1844

work page 2021

[8] [8]

Restormer: Efficient transformer for high-resolution image restoration,

Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang, “Restormer: Efficient transformer for high-resolution image restoration,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 5728– 5739

work page 2022

[9] [9]

ImageNet classification with deep convolutional neural networks,

Alex Krizhevsky, I. Sutskever, and G. Hinton, “ImageNet classification with deep convolutional neural networks,” in NIPS, 2012

work page 2012

[10] [10]

Attention is all you need,

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin, “Attention is all you need,”Advances in neural information processing systems, vol. 30, 2017

work page 2017

[11] [11]

Why are adaptive methods good for attention models?,

Jingzhao Zhang, Sai Praneeth Karimireddy, Andreas Veit, Seungyeon Kim, Sashank Reddi, Sanjiv Kumar, and Suvrit Sra, “Why are adaptive methods good for attention models?,” inAdvances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, Eds. 2020, vol. 33, pp. 15383–15393, Curran Associates, Inc

work page 2020

[12] [12]

Algorithm unrolling: Interpretable, efficient deep learning for signal and image processing,

Vishal Monga, Yuelong Li, and Yonina C. Eldar, “Algorithm unrolling: Interpretable, efficient deep learning for signal and image processing,”IEEE Signal Processing Magazine, vol. 38, no. 2, pp. 18–44, 2021

work page 2021

[13] [13]

White- box transformers via sparse rate reduction,

Yaodong Yu, Sam Buchanan, Druv Pai, Tianzhe Chu, Ziyang Wu, Shengbang Tong, Benjamin Haeffele, and Yi Ma, “White- box transformers via sparse rate reduction,” inAdvances in Neural Information Processing Systems, A. Oh, T. Neumann, A. Globerson, K. Saenko, M. Hardt, and S. Levine, Eds. 2023, vol. 36, pp. 9422–9457, Curran Associates, Inc

work page 2023

[14] [14]

Graph signal processing: Overview, challenges, and applications,

A. Ortega, P. Frossard, J. Kovacevic, J. M. F. Moura, and P. Vandergheynst, “Graph signal processing: Overview, challenges, and applications,” inProceedings of the IEEE, May 2018, vol. 106, no.5, pp. 808–828

work page 2018

[15] [15]

Graph spectral image processing,

G. Cheung, E. Magli, Y . Tanaka, and M. Ng, “Graph spectral image processing,” inProceedings of the IEEE, May 2018, vol. 106, no.5, pp. 907–930

work page 2018

[16] [16]

Graph Laplacian regularization for image denoising: Analysis in the continuous domain,

Jiahao Pang and Gene Cheung, “Graph Laplacian regularization for image denoising: Analysis in the continuous domain,”IEEE Transactions on Image Processing, vol. 26, no. 4, pp. 1770–1785, 2017

work page 2017

[17] [17]

Graph-based blind image deblurring from a single photograph,

Y . Bai, G. Cheung, X. Liu, and W. Gao, “Graph-based blind image deblurring from a single photograph,”IEEE Transactions on Image Processing, vol. 28, no.3, pp. 1404– 1418, 2019

work page 2019

[18] [18]

Interpretable lightweight transformer via unrolling of learned graph smoothness priors,

VIET HO TAM THUC DO, Parham Eftekhar, Seyed Alireza Hosseini, Gene Cheung, and Philip Chou, “Interpretable lightweight transformer via unrolling of learned graph smoothness priors,” inAdvances in Neural Information Processing Systems, A. Globerson, L. Mackey, D. Belgrave, A. Fan, U. Paquet, J. Tomczak, and C. Zhang, Eds. 2024, vol. 37, pp. 6393–6416, Curr...

work page 2024

[19] [19]

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio, “Neural machine translation by jointly learning to align and translate,”CoRR, vol. abs/1409.0473, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[20] [20]

Mixed graph signal analysis of joint image denoising / interpolation,

Niruhan Viswarupan, Gene Cheung, Fengbo Lan, and Michael S. Brown, “Mixed graph signal analysis of joint image denoising / interpolation,” inICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024, pp. 9431–9435

work page 2024

[21] [21]

The little engine that could: Regularization by denoising (RED),

Michael Elad Yaniv Romano and Peyman Milanfar, “The little engine that could: Regularization by denoising (RED),” in SIAM Journal on Imaging Sciences, 2017, vol. 10, no.4, pp. 1804–1844

work page 2017

[22] [22]

Linear convergence and metric selection for Douglas-Rachford splitting and ADMM,

Pontus Giselsson and Stephen Boyd, “Linear convergence and metric selection for Douglas-Rachford splitting and ADMM,” IEEE Transactions on Automatic Control, vol. 62, no. 2, pp. 532–544, 2017

work page 2017

[23] [23]

Bi-cgstab: A fast and smoothly converging variant of bi-cg for the solution of nonsymmetric linear systems,

Henk A. Van Der V orst, “Bi-cgstab: A fast and smoothly converging variant of bi-cg for the solution of nonsymmetric linear systems,”SIAM Journal on Scientific and Statistical Computing, vol. 13, pp. 631, 1992

work page 1992

[24] [24]

Methods of conjugate gradients for solving linear systems,

Magnus R Hestenes and E. L. Stiefel, “Methods of conjugate gradients for solving linear systems,”Journal of Research of the National Bureau of Standards (United States), vol. 49, 1952

work page 1952

[25] [25]

Graph algorithm unrolling with douglas-rachford iterations for image interpolation with guaranteed initialization,

Xue Zhang, Bingshuo Hu, and Gene Cheung, “Graph algorithm unrolling with douglas-rachford iterations for image interpolation with guaranteed initialization,”arXiv preprint URL https://arxiv.org/abs/2509.1192, 2025

work page arXiv 2025

[26] [26]

Ntire 2017 challenge on single image super-resolution: Dataset and study,

Eirikur Agustsson and Radu Timofte, “Ntire 2017 challenge on single image super-resolution: Dataset and study,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2017, pp. 126–135

work page 2017

[27] [27]

Kodak lossless true color image suite (photocd pcd0992),

Eastman Kodak, “Kodak lossless true color image suite (photocd pcd0992),”URL http://r0k. us/graphics/kodak, vol. 6, pp. 2, 1993

work page 1993

[28] [28]

Single image super-resolution from transformed self-exemplars,

Jia-Bin Huang, Abhishek Singh, and Narendra Ahuja, “Single image super-resolution from transformed self-exemplars,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 5197–5206

work page 2015

[29] [29]

Color demosaicking by local directional interpolation and nonlocal adaptive thresholding,

Lei Zhang, Xiaolin Wu, Antoni Buades, and Xin Li, “Color demosaicking by local directional interpolation and nonlocal adaptive thresholding,”Journal of Electronic imaging, vol. 20, no. 2, pp. 023016–023016, 2011

work page 2011

[30] [30]

Image super-resolution using deep convolutional networks,

Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang, “Image super-resolution using deep convolutional networks,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 38, no. 2, pp. 295–307, 2016

work page 2016

[31] [31]

Image quality assessment: from error visibility to structural similarity,

Zhou Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli, “Image quality assessment: from error visibility to structural similarity,”IEEE Transactions on Image Processing, vol. 13, no. 4, pp. 600–612, 2004

work page 2004