Recognition: unknown
Stable and Controllable Neural Texture Synthesis and Style Transfer Using Histogram Losses
read the original abstract
Recently, methods have been proposed that perform texture synthesis and style transfer by using convolutional neural networks (e.g. Gatys et al. [2015,2016]). These methods are exciting because they can in some cases create results with state-of-the-art quality. However, in this paper, we show these methods also have limitations in texture quality, stability, requisite parameter tuning, and lack of user controls. This paper presents a multiscale synthesis pipeline based on convolutional neural networks that ameliorates these issues. We first give a mathematical explanation of the source of instabilities in many previous approaches. We then improve these instabilities by using histogram losses to synthesize textures that better statistically match the exemplar. We also show how to integrate localized style losses in our multiscale framework. These losses can improve the quality of large features, improve the separation of content and style, and offer artistic controls such as paint by numbers. We demonstrate that our approach offers improved quality, convergence in fewer iterations, and more stability over the optimization.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
Image-Guided Geometric Stylization of 3D Meshes
A coarse-to-fine pipeline deforms 3D meshes to reflect geometric features from an image using diffusion model representations while preserving topology and part-level semantics.
-
Toward Real-World Adoption of Portrait Relighting via Hybrid Domain Knowledge Fusion
Hybrid Domain Knowledge Fusion distills expertise from specialized models across synthetic, OLAT, and real datasets into a lightweight student model for state-of-the-art portrait relighting with 6x-240x faster inference.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.