A learned representation for artistic style

Vincent Dumoulin, Jonathon Shlens, Manjunath Kudlur · 2016 · cs.CV · arXiv 1610.07629

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open full Pith review browse 6 citing papers arXiv PDF

abstract

The diversity of painting styles represents a rich visual vocabulary for the construction of an image. The degree to which one may learn and parsimoniously capture this visual vocabulary measures our understanding of the higher level features of paintings, if not images in general. In this work we investigate the construction of a single, scalable deep network that can parsimoniously capture the artistic style of a diversity of paintings. We demonstrate that such a network generalizes across a diversity of artistic styles by reducing a painting to a point in an embedding space. Importantly, this model permits a user to explore new painting styles by arbitrarily combining the styles learned from individual paintings. We hope that this work provides a useful step towards building rich models of paintings and offers a window on to the structure of the learned representation of artistic style.

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer

cs.CV · 2026-04-14 · unverdicted · novelty 7.0

MAST is a mask-guided attention allocation method that enables artifact-free multi-style transfer in diffusion models by anchoring layout, distributing attention mass, scaling sharpness, and injecting details.

Diffusion Models Beat GANs on Image Synthesis

cs.LG · 2021-05-11 · accept · novelty 7.0

Diffusion models with architecture improvements and classifier guidance achieve superior FID scores to GANs on unconditional and conditional ImageNet image synthesis.

CanonCGT: Reference-Based Color Grading via Canonical Pivot Representation

cs.CV · 2026-06-01 · unverdicted · novelty 6.0

CanonCGT introduces a canonical pivot representation and dual-phase training (DP-CGT) for stable, photorealistic reference-based color grading that outperforms prior methods in consistency.

The Override Gap: A Magnitude Account of Knowledge Conflict Failure in Hypernetwork-Based Instant LLM Adaptation

cs.LG · 2026-04-26 · conditional · novelty 6.0 · 2 refs

Knowledge conflicts in hypernetwork LLM adaptation stem from constant adapter margins losing to frequency-dependent pretrained margins; selective layer boosting and conflict-aware triggering raise deep-conflict accuracy to 71-72.5% on Gemma-2B and Mistral-7B.

Head-Pose-Aware Visual Speech Recognition with FiLM Modulation

cs.CV · 2026-05-30 · unverdicted · novelty 5.0

HP-VSR-ResFiLM adds a single residual FiLM modulation block conditioned on head pose to a CNN visual encoder, yielding WER of 25.0% on LRS2 and 33.2% on LRS3 under standard training conditions.

Principles and Practice of Deep Representation Learning: or a Mathematical Theory of Memory

cs.LG · 2026-06-04 · unverdicted · novelty 3.0

The book presents principles from optimization and information theory to explain deep network architectures and enable new interpretable models.

citing papers explorer

Showing 6 of 6 citing papers.

MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer cs.CV · 2026-04-14 · unverdicted · none · ref 7
MAST is a mask-guided attention allocation method that enables artifact-free multi-style transfer in diffusion models by anchoring layout, distributing attention mass, scaling sharpness, and injecting details.
Diffusion Models Beat GANs on Image Synthesis cs.LG · 2021-05-11 · accept · none · ref 16
Diffusion models with architecture improvements and classifier guidance achieve superior FID scores to GANs on unconditional and conditional ImageNet image synthesis.
CanonCGT: Reference-Based Color Grading via Canonical Pivot Representation cs.CV · 2026-06-01 · unverdicted · none · ref 10 · internal anchor
CanonCGT introduces a canonical pivot representation and dual-phase training (DP-CGT) for stable, photorealistic reference-based color grading that outperforms prior methods in consistency.
The Override Gap: A Magnitude Account of Knowledge Conflict Failure in Hypernetwork-Based Instant LLM Adaptation cs.LG · 2026-04-26 · conditional · none · ref 8 · 2 links
Knowledge conflicts in hypernetwork LLM adaptation stem from constant adapter margins losing to frequency-dependent pretrained margins; selective layer boosting and conflict-aware triggering raise deep-conflict accuracy to 71-72.5% on Gemma-2B and Mistral-7B.
Head-Pose-Aware Visual Speech Recognition with FiLM Modulation cs.CV · 2026-05-30 · unverdicted · none · ref 36 · internal anchor
HP-VSR-ResFiLM adds a single residual FiLM modulation block conditioned on head pose to a CNN visual encoder, yielding WER of 25.0% on LRS2 and 33.2% on LRS3 under standard training conditions.
Principles and Practice of Deep Representation Learning: or a Mathematical Theory of Memory cs.LG · 2026-06-04 · unverdicted · none · ref 23 · internal anchor
The book presents principles from optimization and information theory to explain deep network architectures and enable new interpretable models.

A learned representation for artistic style

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer