A General Decoupled Learning Framework for Parameterized Image Operators

Baoquan Chen; Dongdong Chen; Gang Hua; Lu Yuan; Nenghai Yu; Qingnan Fan

arxiv: 1907.05852 · v1 · pith:CXM2Y7P7new · submitted 2019-07-11 · 💻 cs.CV

A General Decoupled Learning Framework for Parameterized Image Operators

Qingnan Fan , Dongdong Chen , Lu Yuan , Gang Hua , Nenghai Yu , Baoquan Chen This is my paper

Pith reviewed 2026-05-24 23:38 UTC · model grok-4.3

classification 💻 cs.CV

keywords decoupled learningparameterized image operatorsweight learning networkbase networkimage processingdynamic weightsdeep networks

0 comments

The pith

A weight learning network dynamically adjusts a base network's weights according to any parameter value of an image operator.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a decoupled training setup in which a base network approximates a traditional image operator while a second network learns to map the operator's parameter to weight adjustments for the base network. Both networks train end-to-end so that one pair of networks handles every possible parameter setting instead of requiring a separate base network for each setting. The method is demonstrated on several classic parameterized operators such as filtering and enhancement tasks. A further single-layer variant keeps most computation shared across parameter values while still allowing on-the-fly adjustment.

Core claim

The decoupled learning framework trains a weight learning network to predict weight adjustments for a base network based on the input parameter of a parameterized image operator, enabling the base network to adapt to arbitrary parameter values through end-to-end training.

What carries the argument

The weight learning network, which takes the operator parameter as input and outputs adjustments to the base network's convolutional weights.

If this is right

The same base network can be reused across all parameter settings of a given operator after joint training.
The single-layer extension changes only one layer's weights at runtime while sharing the rest of the computation.
The approach applies directly to multiple traditional parameterized operators without redesigning the base architecture.
Parameter tuning becomes faster because weight adjustments are generated by a forward pass rather than full retraining.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The framework could be tested on operators whose parameters vary continuously rather than in discrete steps to check stability.
If the weight learning network is made parameter-free in its own architecture, the overall system might further reduce memory for multiple operators.
The method might extend to video or 3D operators if the weight adjustments can be made temporally consistent.

Load-bearing premise

A separate weight learning network can be trained to produce stable and effective weight adjustments for arbitrary parameter values without per-parameter retraining of the base network.

What would settle it

Train the framework on a discrete set of parameter values, then evaluate on a held-out parameter value never seen during training; if accuracy falls below that of separately trained per-parameter networks, the framework does not generalize as claimed.

Figures

Figures reproduced from arXiv: 1907.05852 by Baoquan Chen, Dongdong Chen, Gang Hua, Lu Yuan, Nenghai Yu, Qingnan Fan.

**Figure 2.** Figure 2: As can be seen, our single network trained on continuous [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 2.** Figure 2: Visual examples produced by our framework trained on continuous parameter settings of six image filters independently. Note all the visual [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: Visual examples produced by our framework trained on continuous parameter settings of three image restoration tasks independently. Note [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: Effective receptive field of L0 smoothing for different spatial positions and parameter λ. The top to bottom indicate the effective receptive field of a non-edge point, a moderate edge point, and a strong edge point. the larger the smoothing parameter λ is, the larger the effective field is, and most effective points fall within the object boundary. 2) For a moderate edge point, its receptive field stays s… view at source ↗

**Figure 5.** Figure 5: Equivalent analysis of the connection between the [PITH_FULL_IMAGE:figures/full_fig_p013_5.png] view at source ↗

read the original abstract

Many different deep networks have been used to approximate, accelerate or improve traditional image operators. Among these traditional operators, many contain parameters which need to be tweaked to obtain the satisfactory results, which we refer to as parameterized image operators. However, most existing deep networks trained for these operators are only designed for one specific parameter configuration, which does not meet the needs of real scenarios that usually require flexible parameters settings. To overcome this limitation, we propose a new decoupled learning algorithm to learn from the operator parameters to dynamically adjust the weights of a deep network for image operators, denoted as the base network. The learned algorithm is formed as another network, namely the weight learning network, which can be end-to-end jointly trained with the base network. Experiments demonstrate that the proposed framework can be successfully applied to many traditional parameterized image operators. To accelerate the parameter tuning for practical scenarios, the proposed framework can be further extended to dynamically change the weights of only one single layer of the base network while sharing most computation cost. We demonstrate that this cheap parameter-tuning extension of the proposed decoupled learning framework even outperforms the state-of-the-art alternative approaches.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper introduces a weight-learning network to let one base model handle varying parameters in image operators, but the abstract gives no numbers so the performance claims stay unverified.

read the letter

The main new piece is the decoupled setup: a small weight-learning network takes the operator parameter as input and produces adjustments to the base network's weights, trained end-to-end. This lets the same model cover a range of parameter values instead of needing separate models or retraining for each. The single-layer extension keeps most of the computation shared while still allowing parameter changes, which is a practical touch for tuning speed.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a decoupled learning framework for parameterized image operators consisting of a base network approximating the operator and a separate weight-learning network that maps operator parameter values to dynamic weight adjustments for the base network; the two are trained jointly end-to-end. It further introduces a single-layer variant of this framework that adjusts only one layer while sharing most computation and claims that the approach applies successfully to multiple traditional operators and that the single-layer extension outperforms state-of-the-art alternatives.

Significance. If the generalization and performance claims hold, the framework would provide a practical mechanism for flexible parameter control in deep approximations of classical image operators without per-parameter retraining, which could reduce overhead in applications such as filtering, enhancement, and restoration. The explicit separation of weight learning from the base network and the single-layer efficiency extension are concrete strengths that, if empirically validated with proper controls, would be useful contributions.

major comments (2)

[Abstract] Abstract: the central claim that 'experiments demonstrate that the proposed framework can be successfully applied to many traditional parameterized image operators' and that the single-layer extension 'even outperforms the state-of-the-art alternative approaches' is asserted without any quantitative metrics, error bars, dataset specifications, or ablation results. Because the soundness of the empirical superiority and generalization arguments rests on these unshown results, the abstract's assertion cannot be evaluated from the provided information.
[Framework description and Experiments] Framework description and Experiments: the weakest assumption is that the weight-learning network, trained on a finite set of sampled parameter values, produces effective adjustments for arbitrary or unseen parameter values without per-parameter retraining or instability. No explicit out-of-distribution testing, interpolation experiments, or continuous-parameter evaluation protocol is described that would substantiate this extrapolation, which is load-bearing for both the general framework and the single-layer claim.

minor comments (1)

The notation distinguishing the base network weights from the outputs of the weight-learning network could be made more explicit, ideally with a clear diagram or pseudocode in the method section.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the two major comments point by point below and will revise the manuscript to improve clarity and empirical support.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that 'experiments demonstrate that the proposed framework can be successfully applied to many traditional parameterized image operators' and that the single-layer extension 'even outperforms the state-of-the-art alternative approaches' is asserted without any quantitative metrics, error bars, dataset specifications, or ablation results. Because the soundness of the empirical superiority and generalization arguments rests on these unshown results, the abstract's assertion cannot be evaluated from the provided information.

Authors: We agree that the abstract would be strengthened by including brief quantitative highlights. In the revision we will update the abstract to reference key metrics (e.g., average PSNR/SSIM gains across operators and datasets) and the number of operators tested, while preserving conciseness. revision: yes
Referee: [Framework description and Experiments] Framework description and Experiments: the weakest assumption is that the weight-learning network, trained on a finite set of sampled parameter values, produces effective adjustments for arbitrary or unseen parameter values without per-parameter retraining or instability. No explicit out-of-distribution testing, interpolation experiments, or continuous-parameter evaluation protocol is described that would substantiate this extrapolation, which is load-bearing for both the general framework and the single-layer claim.

Authors: We acknowledge that the current manuscript does not explicitly describe out-of-distribution testing or dedicated interpolation protocols. To substantiate the generalization claim we will add a new subsection that details the parameter sampling strategy, reports results on interpolated and held-out parameter values, and clarifies the continuous evaluation protocol used for each operator. revision: yes

Circularity Check

0 steps flagged

No circularity; standard end-to-end training with external evaluation

full rationale

The paper proposes a decoupled framework with a base network and a jointly trained weight-learning network. All performance claims rest on experimental results evaluated on held-out images and operators, not on any quantity defined from the training loss or fitted parameters. No self-definitional equations, fitted inputs renamed as predictions, or load-bearing self-citations appear in the abstract or framework description. The derivation chain is self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that operator parameters can be mapped to useful network weights via a learned auxiliary network, plus standard supervised learning assumptions about data distribution and optimization convergence. No new physical entities or ad-hoc constants are introduced.

axioms (1)

domain assumption End-to-end gradient descent on paired input-output examples will produce a weight generator that generalizes across parameter settings of the target operator.
Invoked implicitly when the authors state that the two networks can be jointly trained to handle flexible parameter settings.

pith-pipeline@v0.9.0 · 5736 in / 1340 out tokens · 14496 ms · 2026-05-24T23:38:40.624851+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages

[1]

A generic deep architecture for single image reﬂection removal and image smoothing,

Q. Fan, J. Yang, G. Hua, B. Chen, and D. Wipf, “A generic deep architecture for single image reﬂection removal and image smoothing,” in Proceedings of the 16th International Conference on Computer Vision (ICCV), 2017, pp. 3238–3247

work page 2017
[2]

Accurate image super-resolution using very deep convolutional networks,

J. Kim, J. Kwon Lee, and K. Mu Lee, “Accurate image super-resolution using very deep convolutional networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 1646– 1654

work page 2016
[3]

Deep edge-aware ﬁlters,

L. Xu, J. Ren, Q. Yan, R. Liao, and J. Jia, “Deep edge-aware ﬁlters,” in International Conference on Machine Learning, 2015, pp. 1669–1678

work page 2015
[4]

Bilateral guided upsampling,

J. Chen, A. Adams, N. Wadhwa, and S. W. Hasinoff, “Bilateral guided upsampling,” ACM Transactions on Graphics (TOG) , vol. 35, no. 6, p. 203, 2016

work page 2016
[5]

Deep bilateral learning for real-time image enhancement,

M. Gharbi, J. Chen, J. T. Barron, S. W. Hasinoff, and F. Durand, “Deep bilateral learning for real-time image enhancement,” ACM Transactions on Graphics (TOG), vol. 36, no. 4, p. 118, 2017

work page 2017
[6]

Image smoothing via l 0 gradient minimization,

L. Xu, C. Lu, Y . Xu, and J. Jia, “Image smoothing via l 0 gradient minimization,” in ACM Transactions on Graphics (TOG), vol. 30, no. 6. ACM, 2011, p. 174

work page 2011
[7]

Structure-preserving image smoothing via region covariances,

L. Karacan, E. Erdem, and A. Erdem, “Structure-preserving image smoothing via region covariances,” ACM Transactions on Graphics (TOG), vol. 32, no. 6, p. 176, 2013

work page 2013
[8]

Structure extraction from texture via relative total variation,

L. Xu, Q. Yan, Y . Xia, and J. Jia, “Structure extraction from texture via relative total variation,” ACM Transactions on Graphics (TOG), vol. 31, no. 6, p. 139, 2012

work page 2012
[9]

Rolling guidance ﬁlter,

Q. Zhang, X. Shen, L. Xu, and J. Jia, “Rolling guidance ﬁlter,” in European conference on computer vision. Springer, 2014, pp. 815–830

work page 2014
[10]

A non-local algorithm for image denoising,

A. Buades, B. Coll, and J.-M. Morel, “A non-local algorithm for image denoising,” in Computer Vision and Pattern Recognition, 2005. CVPR

work page 2005
[11]

IEEE Computer Society Conference on , vol. 2. IEEE, 2005, pp. 60–65

work page 2005
[12]

Image denoising via sparse and redundant representations over learned dictionaries,

M. Elad and M. Aharon, “Image denoising via sparse and redundant representations over learned dictionaries,” IEEE Transactions on Image processing, vol. 15, no. 12, pp. 3736–3745, 2006

work page 2006
[13]

Image super-resolution via sparse representation,

J. Yang, J. Wright, T. S. Huang, and Y . Ma, “Image super-resolution via sparse representation,” IEEE transactions on image processing , vol. 19, no. 11, pp. 2861–2873, 2010

work page 2010
[14]

Image super-resolution using gradient proﬁle prior,

J. Sun, Z. Xu, and H.-Y . Shum, “Image super-resolution using gradient proﬁle prior,” in Computer Vision and Pattern Recognition, 2008. CVPR

work page 2008
[15]

IEEE, 2008, pp

IEEE Conference on. IEEE, 2008, pp. 1–8

work page 2008
[16]

Bayesian image super-resolution,

M. E. Tipping and C. M. Bishop, “Bayesian image super-resolution,” in Advances in neural information processing systems , 2003, pp. 1303– 1310

work page 2003
[17]

Image super-resolution using deep convolutional networks,

C. Dong, C. C. Loy, K. He, and X. Tang, “Image super-resolution using deep convolutional networks,”IEEE transactions on pattern analysis and machine intelligence, vol. 38, no. 2, pp. 295–307, 2016

work page 2016
[18]

Natural image denoising with convolutional networks,

V . Jain and S. Seung, “Natural image denoising with convolutional networks,” in Advances in Neural Information Processing Systems, 2009, pp. 769–776

work page 2009
[19]

Learning recursive ﬁlters for low- level vision via a hybrid neural network,

S. Liu, J. Pan, and M.-H. Yang, “Learning recursive ﬁlters for low- level vision via a hybrid neural network,” in European Conference on Computer Vision. Springer, 2016, pp. 560–576

work page 2016
[20]

Fast image processing with fully- convolutional networks,

Q. Chen, J. Xu, and V . Koltun, “Fast image processing with fully- convolutional networks,” inIEEE International Conference on Computer Vision, vol. 9, 2017

work page 2017
[21]

Joint bilateral upsampling,

J. Kopf, M. F. Cohen, D. Lischinski, and M. Uyttendaele, “Joint bilateral upsampling,” ACM Transactions on Graphics (ToG), vol. 26, no. 3, p. 96, 2007

work page 2007
[22]

Fast guided ﬁlter,

K. He and J. Sun, “Fast guided ﬁlter,” arXiv preprint arXiv:1505.0099

work page arXiv
[23]

Learning to control fast-weight memories: An alterna- tive to dynamic recurrent networks,

J. Schmidhuber, “Learning to control fast-weight memories: An alterna- tive to dynamic recurrent networks,” Neural Computation, vol. 4, no. 1, pp. 131–139, 1992

work page 1992
[24]

Learning to learn by gradient descent by gradient descent,

M. Andrychowicz, M. Denil, S. Gomez, M. W. Hoffman, D. Pfau, T. Schaul, B. Shillingford, and N. De Freitas, “Learning to learn by gradient descent by gradient descent,” inAdvances in Neural Information Processing Systems, 2016, pp. 3981–3989

work page 2016
[25]

Learned optimizers that scale and generalize,

O. Wichrowska, N. Maheswaranathan, M. W. Hoffman, S. G. Col- menarejo, M. Denil, N. de Freitas, and J. Sohl-Dickstein, “Learned optimizers that scale and generalize,” in International Conference on Machine Learning, 2017

work page 2017
[26]

Learning to learn for global optimization of black box JOURNAL OF LATEX CLASS FILES, AUGUST 2019 14 functions,

Y . Chen, M. W. Hoffman, S. G. Colmenarejo, M. Denil, T. P. Lillicrap, and N. de Freitas, “Learning to learn for global optimization of black box JOURNAL OF LATEX CLASS FILES, AUGUST 2019 14 functions,” in International Conference on Machine Learning , 2017, pp. 748–756

work page 2019
[27]

Hypernetworks,

D. Ha, A. Dai, and Q. V . Le, “Hypernetworks,” ICLR, 2017

work page 2017
[28]

Image smoothing via l0 gradient minimization,

L. Xu, C. Lu, Y . Xu, and J. Jia, “Image smoothing via l0 gradient minimization,” ACM Transactions on Graphics (SIGGRAPH Asia), 2011

work page 2011
[29]

Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis,

D. Ulyanov, A. Vedaldi, and V . Lempitsky, “Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis,” in Proc. CVPR, 2017

work page 2017
[30]

Multi-scale context aggregation by dilated convo- lutions,

F. Yu and V . Koltun, “Multi-scale context aggregation by dilated convo- lutions,” in ICLR, 2016

work page 2016
[31]

Edge-preserving decompositions for multi-scale tone and detail manipulation,

Z. Farbman, R. Fattal, D. Lischinski, and R. Szeliski, “Edge-preserving decompositions for multi-scale tone and detail manipulation,” in ACM Transactions on Graphics (TOG), vol. 27, no. 3. ACM, 2008, p. 67

work page 2008
[32]

100+ times faster weighted median ﬁlter (wmf),

Q. Zhang, L. Xu, and J. Jia, “100+ times faster weighted median ﬁlter (wmf),” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 2830–2837

work page 2014
[33]

Fast local laplacian ﬁlters: Theory and applications,

M. Aubry, S. Paris, S. W. Hasinoff, J. Kautz, and F. Durand, “Fast local laplacian ﬁlters: Theory and applications,” ACM Transactions on Graphics (TOG), vol. 33, no. 5, p. 167, 2014

work page 2014
[34]

Learning a deep convolu- tional network for image super-resolution,

C. Dong, C. C. Loy, K. He, and X. Tang, “Learning a deep convolu- tional network for image super-resolution,” in European conference on computer vision. Springer, 2014, pp. 184–199

work page 2014
[35]

Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connec- tions,

X. Mao, C. Shen, and Y .-B. Yang, “Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connec- tions,” in Advances in neural information processing systems , 2016, pp. 2802–2810

work page 2016
[36]

Compression artifacts reduction by a deep convolutional network,

C. Dong, Y . Deng, C. Change Loy, and X. Tang, “Compression artifacts reduction by a deep convolutional network,” in Proceedings of the IEEE International Conference on Computer Vision, 2015, pp. 576–584

work page 2015
[37]

Removing rain from single images via a deep detail network,

X. Fu, J. Huang, D. Zeng, Y . Huang, X. Ding, and J. Paisley, “Removing rain from single images via a deep detail network,” in IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 1715–1723

work page 2017
[38]

Xception: Deep learning with depthwise separable convolu- tions,

F. Chollet, “Xception: Deep learning with depthwise separable convolu- tions,” CVPR, 2017

work page 2017
[39]

Combining sketch and tone for pencil drawing production,

C. Lu, L. Xu, and J. Jia, “Combining sketch and tone for pencil drawing production,” in Proceedings of the Symposium on Non-Photorealistic Animation and Rendering . Eurographics Association, 2012, pp. 65– 73

work page 2012
[40]

Trainable nonlinear reaction diffusion: A ﬂexible framework for fast and effective image restoration,

Y . Chen and T. Pock, “Trainable nonlinear reaction diffusion: A ﬂexible framework for fast and effective image restoration,”IEEE transactions on pattern analysis and machine intelligence, vol. 39, no. 6, pp. 1256–1272, 2017

work page 2017
[41]

Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising,

K. Zhang, W. Zuo, Y . Chen, D. Meng, and L. Zhang, “Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising,” IEEE Transactions on Image Processing, vol. 26, no. 7, pp. 3142–3155, 2017

work page 2017
[42]

Memnet: A persistent memory network for image restoration,

Y . Tai, J. Yang, X. Liu, and C. Xu, “Memnet: A persistent memory network for image restoration,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 4539–4547

work page 2017
[43]

Accurate image super-resolution using very deep convolutional networks,

J. Kim, J. K. Lee, and K. M. Lee, “Accurate image super-resolution using very deep convolutional networks,” in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR Oral), June 2016

work page 2016
[44]

Deeply-recursive convolutional network for image super-resolution,

J. Kim, J. Kwon Lee, and K. Mu Lee, “Deeply-recursive convolutional network for image super-resolution,” in Proceedings of the IEEE confer- ence on computer vision and pattern recognition , 2016, pp. 1637–1645

work page 2016
[45]

Image denoising by sparse 3-d transform-domain collaborative ﬁltering,

K. Dabov, A. Foi, V . Katkovnik, and K. Egiazarian, “Image denoising by sparse 3-d transform-domain collaborative ﬁltering,” IEEE Transactions on image processing, vol. 16, no. 8, pp. 2080–2095, 2007

work page 2080
[46]

Weighted nuclear norm minimization with application to image denoising,

S. Gu, L. Zhang, W. Zuo, and X. Feng, “Weighted nuclear norm minimization with application to image denoising,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2014, pp. 2862–2869

work page 2014
[47]

Learning deep cnn denoiser prior for image restoration,

K. Zhang, W. Zuo, S. Gu, and L. Zhang, “Learning deep cnn denoiser prior for image restoration,” inIEEE Conference on Computer Vision and Pattern Recognition, vol. 2, 2017

work page 2017
[48]

Clearing the skies: A deep network architecture for single-image rain removal,

X. Fu, J. Huang, X. Ding, Y . Liao, and J. Paisley, “Clearing the skies: A deep network architecture for single-image rain removal,” IEEE Transactions on Image Processing, vol. 26, no. 6, pp. 2944–2956, 2017. Qingnan Fan is a Postdoctoral Scholar in the Computer Science Department of Stanford Uni- versity. He received his PhD degree from Shan- dong Univer...

work page 2017

[1] [1]

A generic deep architecture for single image reﬂection removal and image smoothing,

Q. Fan, J. Yang, G. Hua, B. Chen, and D. Wipf, “A generic deep architecture for single image reﬂection removal and image smoothing,” in Proceedings of the 16th International Conference on Computer Vision (ICCV), 2017, pp. 3238–3247

work page 2017

[2] [2]

Accurate image super-resolution using very deep convolutional networks,

J. Kim, J. Kwon Lee, and K. Mu Lee, “Accurate image super-resolution using very deep convolutional networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 1646– 1654

work page 2016

[3] [3]

Deep edge-aware ﬁlters,

L. Xu, J. Ren, Q. Yan, R. Liao, and J. Jia, “Deep edge-aware ﬁlters,” in International Conference on Machine Learning, 2015, pp. 1669–1678

work page 2015

[4] [4]

Bilateral guided upsampling,

J. Chen, A. Adams, N. Wadhwa, and S. W. Hasinoff, “Bilateral guided upsampling,” ACM Transactions on Graphics (TOG) , vol. 35, no. 6, p. 203, 2016

work page 2016

[5] [5]

Deep bilateral learning for real-time image enhancement,

M. Gharbi, J. Chen, J. T. Barron, S. W. Hasinoff, and F. Durand, “Deep bilateral learning for real-time image enhancement,” ACM Transactions on Graphics (TOG), vol. 36, no. 4, p. 118, 2017

work page 2017

[6] [6]

Image smoothing via l 0 gradient minimization,

L. Xu, C. Lu, Y . Xu, and J. Jia, “Image smoothing via l 0 gradient minimization,” in ACM Transactions on Graphics (TOG), vol. 30, no. 6. ACM, 2011, p. 174

work page 2011

[7] [7]

Structure-preserving image smoothing via region covariances,

L. Karacan, E. Erdem, and A. Erdem, “Structure-preserving image smoothing via region covariances,” ACM Transactions on Graphics (TOG), vol. 32, no. 6, p. 176, 2013

work page 2013

[8] [8]

Structure extraction from texture via relative total variation,

L. Xu, Q. Yan, Y . Xia, and J. Jia, “Structure extraction from texture via relative total variation,” ACM Transactions on Graphics (TOG), vol. 31, no. 6, p. 139, 2012

work page 2012

[9] [9]

Rolling guidance ﬁlter,

Q. Zhang, X. Shen, L. Xu, and J. Jia, “Rolling guidance ﬁlter,” in European conference on computer vision. Springer, 2014, pp. 815–830

work page 2014

[10] [10]

A non-local algorithm for image denoising,

A. Buades, B. Coll, and J.-M. Morel, “A non-local algorithm for image denoising,” in Computer Vision and Pattern Recognition, 2005. CVPR

work page 2005

[11] [11]

IEEE Computer Society Conference on , vol. 2. IEEE, 2005, pp. 60–65

work page 2005

[12] [12]

Image denoising via sparse and redundant representations over learned dictionaries,

M. Elad and M. Aharon, “Image denoising via sparse and redundant representations over learned dictionaries,” IEEE Transactions on Image processing, vol. 15, no. 12, pp. 3736–3745, 2006

work page 2006

[13] [13]

Image super-resolution via sparse representation,

J. Yang, J. Wright, T. S. Huang, and Y . Ma, “Image super-resolution via sparse representation,” IEEE transactions on image processing , vol. 19, no. 11, pp. 2861–2873, 2010

work page 2010

[14] [14]

Image super-resolution using gradient proﬁle prior,

J. Sun, Z. Xu, and H.-Y . Shum, “Image super-resolution using gradient proﬁle prior,” in Computer Vision and Pattern Recognition, 2008. CVPR

work page 2008

[15] [15]

IEEE, 2008, pp

IEEE Conference on. IEEE, 2008, pp. 1–8

work page 2008

[16] [16]

Bayesian image super-resolution,

M. E. Tipping and C. M. Bishop, “Bayesian image super-resolution,” in Advances in neural information processing systems , 2003, pp. 1303– 1310

work page 2003

[17] [17]

Image super-resolution using deep convolutional networks,

C. Dong, C. C. Loy, K. He, and X. Tang, “Image super-resolution using deep convolutional networks,”IEEE transactions on pattern analysis and machine intelligence, vol. 38, no. 2, pp. 295–307, 2016

work page 2016

[18] [18]

Natural image denoising with convolutional networks,

V . Jain and S. Seung, “Natural image denoising with convolutional networks,” in Advances in Neural Information Processing Systems, 2009, pp. 769–776

work page 2009

[19] [19]

Learning recursive ﬁlters for low- level vision via a hybrid neural network,

S. Liu, J. Pan, and M.-H. Yang, “Learning recursive ﬁlters for low- level vision via a hybrid neural network,” in European Conference on Computer Vision. Springer, 2016, pp. 560–576

work page 2016

[20] [20]

Fast image processing with fully- convolutional networks,

Q. Chen, J. Xu, and V . Koltun, “Fast image processing with fully- convolutional networks,” inIEEE International Conference on Computer Vision, vol. 9, 2017

work page 2017

[21] [21]

Joint bilateral upsampling,

J. Kopf, M. F. Cohen, D. Lischinski, and M. Uyttendaele, “Joint bilateral upsampling,” ACM Transactions on Graphics (ToG), vol. 26, no. 3, p. 96, 2007

work page 2007

[22] [22]

Fast guided ﬁlter,

K. He and J. Sun, “Fast guided ﬁlter,” arXiv preprint arXiv:1505.0099

work page arXiv

[23] [23]

Learning to control fast-weight memories: An alterna- tive to dynamic recurrent networks,

J. Schmidhuber, “Learning to control fast-weight memories: An alterna- tive to dynamic recurrent networks,” Neural Computation, vol. 4, no. 1, pp. 131–139, 1992

work page 1992

[24] [24]

Learning to learn by gradient descent by gradient descent,

M. Andrychowicz, M. Denil, S. Gomez, M. W. Hoffman, D. Pfau, T. Schaul, B. Shillingford, and N. De Freitas, “Learning to learn by gradient descent by gradient descent,” inAdvances in Neural Information Processing Systems, 2016, pp. 3981–3989

work page 2016

[25] [25]

Learned optimizers that scale and generalize,

O. Wichrowska, N. Maheswaranathan, M. W. Hoffman, S. G. Col- menarejo, M. Denil, N. de Freitas, and J. Sohl-Dickstein, “Learned optimizers that scale and generalize,” in International Conference on Machine Learning, 2017

work page 2017

[26] [26]

Learning to learn for global optimization of black box JOURNAL OF LATEX CLASS FILES, AUGUST 2019 14 functions,

Y . Chen, M. W. Hoffman, S. G. Colmenarejo, M. Denil, T. P. Lillicrap, and N. de Freitas, “Learning to learn for global optimization of black box JOURNAL OF LATEX CLASS FILES, AUGUST 2019 14 functions,” in International Conference on Machine Learning , 2017, pp. 748–756

work page 2019

[27] [27]

Hypernetworks,

D. Ha, A. Dai, and Q. V . Le, “Hypernetworks,” ICLR, 2017

work page 2017

[28] [28]

Image smoothing via l0 gradient minimization,

L. Xu, C. Lu, Y . Xu, and J. Jia, “Image smoothing via l0 gradient minimization,” ACM Transactions on Graphics (SIGGRAPH Asia), 2011

work page 2011

[29] [29]

Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis,

D. Ulyanov, A. Vedaldi, and V . Lempitsky, “Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis,” in Proc. CVPR, 2017

work page 2017

[30] [30]

Multi-scale context aggregation by dilated convo- lutions,

F. Yu and V . Koltun, “Multi-scale context aggregation by dilated convo- lutions,” in ICLR, 2016

work page 2016

[31] [31]

Edge-preserving decompositions for multi-scale tone and detail manipulation,

Z. Farbman, R. Fattal, D. Lischinski, and R. Szeliski, “Edge-preserving decompositions for multi-scale tone and detail manipulation,” in ACM Transactions on Graphics (TOG), vol. 27, no. 3. ACM, 2008, p. 67

work page 2008

[32] [32]

100+ times faster weighted median ﬁlter (wmf),

Q. Zhang, L. Xu, and J. Jia, “100+ times faster weighted median ﬁlter (wmf),” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 2830–2837

work page 2014

[33] [33]

Fast local laplacian ﬁlters: Theory and applications,

M. Aubry, S. Paris, S. W. Hasinoff, J. Kautz, and F. Durand, “Fast local laplacian ﬁlters: Theory and applications,” ACM Transactions on Graphics (TOG), vol. 33, no. 5, p. 167, 2014

work page 2014

[34] [34]

Learning a deep convolu- tional network for image super-resolution,

C. Dong, C. C. Loy, K. He, and X. Tang, “Learning a deep convolu- tional network for image super-resolution,” in European conference on computer vision. Springer, 2014, pp. 184–199

work page 2014

[35] [35]

Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connec- tions,

X. Mao, C. Shen, and Y .-B. Yang, “Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connec- tions,” in Advances in neural information processing systems , 2016, pp. 2802–2810

work page 2016

[36] [36]

Compression artifacts reduction by a deep convolutional network,

C. Dong, Y . Deng, C. Change Loy, and X. Tang, “Compression artifacts reduction by a deep convolutional network,” in Proceedings of the IEEE International Conference on Computer Vision, 2015, pp. 576–584

work page 2015

[37] [37]

Removing rain from single images via a deep detail network,

X. Fu, J. Huang, D. Zeng, Y . Huang, X. Ding, and J. Paisley, “Removing rain from single images via a deep detail network,” in IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 1715–1723

work page 2017

[38] [38]

Xception: Deep learning with depthwise separable convolu- tions,

F. Chollet, “Xception: Deep learning with depthwise separable convolu- tions,” CVPR, 2017

work page 2017

[39] [39]

Combining sketch and tone for pencil drawing production,

C. Lu, L. Xu, and J. Jia, “Combining sketch and tone for pencil drawing production,” in Proceedings of the Symposium on Non-Photorealistic Animation and Rendering . Eurographics Association, 2012, pp. 65– 73

work page 2012

[40] [40]

Trainable nonlinear reaction diffusion: A ﬂexible framework for fast and effective image restoration,

Y . Chen and T. Pock, “Trainable nonlinear reaction diffusion: A ﬂexible framework for fast and effective image restoration,”IEEE transactions on pattern analysis and machine intelligence, vol. 39, no. 6, pp. 1256–1272, 2017

work page 2017

[41] [41]

Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising,

K. Zhang, W. Zuo, Y . Chen, D. Meng, and L. Zhang, “Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising,” IEEE Transactions on Image Processing, vol. 26, no. 7, pp. 3142–3155, 2017

work page 2017

[42] [42]

Memnet: A persistent memory network for image restoration,

Y . Tai, J. Yang, X. Liu, and C. Xu, “Memnet: A persistent memory network for image restoration,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 4539–4547

work page 2017

[43] [43]

Accurate image super-resolution using very deep convolutional networks,

J. Kim, J. K. Lee, and K. M. Lee, “Accurate image super-resolution using very deep convolutional networks,” in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR Oral), June 2016

work page 2016

[44] [44]

Deeply-recursive convolutional network for image super-resolution,

J. Kim, J. Kwon Lee, and K. Mu Lee, “Deeply-recursive convolutional network for image super-resolution,” in Proceedings of the IEEE confer- ence on computer vision and pattern recognition , 2016, pp. 1637–1645

work page 2016

[45] [45]

Image denoising by sparse 3-d transform-domain collaborative ﬁltering,

K. Dabov, A. Foi, V . Katkovnik, and K. Egiazarian, “Image denoising by sparse 3-d transform-domain collaborative ﬁltering,” IEEE Transactions on image processing, vol. 16, no. 8, pp. 2080–2095, 2007

work page 2080

[46] [46]

Weighted nuclear norm minimization with application to image denoising,

S. Gu, L. Zhang, W. Zuo, and X. Feng, “Weighted nuclear norm minimization with application to image denoising,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2014, pp. 2862–2869

work page 2014

[47] [47]

Learning deep cnn denoiser prior for image restoration,

K. Zhang, W. Zuo, S. Gu, and L. Zhang, “Learning deep cnn denoiser prior for image restoration,” inIEEE Conference on Computer Vision and Pattern Recognition, vol. 2, 2017

work page 2017

[48] [48]

Clearing the skies: A deep network architecture for single-image rain removal,

X. Fu, J. Huang, X. Ding, Y . Liao, and J. Paisley, “Clearing the skies: A deep network architecture for single-image rain removal,” IEEE Transactions on Image Processing, vol. 26, no. 6, pp. 2944–2956, 2017. Qingnan Fan is a Postdoctoral Scholar in the Computer Science Department of Stanford Uni- versity. He received his PhD degree from Shan- dong Univer...

work page 2017