Stable and Controllable Neural Texture Synthesis and Style Transfer Using Histogram Losses

Eric Risser , Pierre Wilmot , Connelly Barnes

Authors on Pith no claims yet

classification 💻 cs.GR cs.CVcs.NE

keywords lossesqualitystyleimprovemethodsneuralsynthesistexture

read the original abstract

Recently, methods have been proposed that perform texture synthesis and style transfer by using convolutional neural networks (e.g. Gatys et al. [2015,2016]). These methods are exciting because they can in some cases create results with state-of-the-art quality. However, in this paper, we show these methods also have limitations in texture quality, stability, requisite parameter tuning, and lack of user controls. This paper presents a multiscale synthesis pipeline based on convolutional neural networks that ameliorates these issues. We first give a mathematical explanation of the source of instabilities in many previous approaches. We then improve these instabilities by using histogram losses to synthesize textures that better statistically match the exemplar. We also show how to integrate localized style losses in our multiscale framework. These losses can improve the quality of large features, improve the separation of content and style, and offer artistic controls such as paint by numbers. We demonstrate that our approach offers improved quality, convergence in fewer iterations, and more stability over the optimization.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Image-Guided Geometric Stylization of 3D Meshes
cs.CV 2026-04 unverdicted novelty 7.0

A coarse-to-fine pipeline deforms 3D meshes to reflect geometric features from an image using diffusion model representations while preserving topology and part-level semantics.
Toward Real-World Adoption of Portrait Relighting via Hybrid Domain Knowledge Fusion
cs.CV 2026-04 unverdicted novelty 6.0

Hybrid Domain Knowledge Fusion distills expertise from specialized models across synthetic, OLAT, and real datasets into a lightweight student model for state-of-the-art portrait relighting with 6x-240x faster inference.