Unsupervised Image to Sequence Translation with Canvas-Drawer Networks
read the original abstract
Encoding images as a series of high-level constructs, such as brush strokes or discrete shapes, can often be key to both human and machine understanding. In many cases, however, data is only available in pixel form. We present a method for generating images directly in a high-level domain (e.g. brush strokes), without the need for real pairwise data. Specifically, we train a "canvas" network to imitate the mapping of high-level constructs to pixels, followed by a high-level "drawing" network which is optimized through this mapping towards solving a desired image recreation or translation task. We successfully discover sequential vector representations of symbols, large sketches, and 3D objects, utilizing only pixel data. We display applications of our method in image segmentation, and present several ablation studies comparing various configurations.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
PortraVec: Image-Based Portrait Vectorization with Text-Guided Manipulation
PortraVec converts portrait images to vector sketches via attention-aware offset sampling for structure and region-based parameter freezing for text-guided local edits, claiming better consistency and controllability ...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.