PortraVec: Image-Based Portrait Vectorization with Text-Guided Manipulation

Dandan Long; Ruihui Li; Ying Liu; Yiqi Liang

arxiv: 2410.04182 · v2 · submitted 2024-10-05 · 💻 cs.CV

PortraVec: Image-Based Portrait Vectorization with Text-Guided Manipulation

Yiqi Liang , Ying Liu , Dandan Long , Ruihui Li This is my paper

Pith reviewed 2026-05-23 19:37 UTC · model grok-4.3

classification 💻 cs.CV

keywords portrait vectorizationtext-guided editingvector sketchesimage to vectorsemantic manipulationface structure preservationlocal editing

0 comments

The pith

PortraVec turns portrait photos into vector sketches editable by text while preserving facial structure.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents PortraVec as a way to convert pixel portraits into vector sketches that accept text instructions for local changes. Prior vector methods lose face wholeness or fine details and offer no semantic control. The method splits into an image-guided stage that samples offsets to lock structure and fix deviations, then a manipulation stage that freezes parameters in regions to change only selected parts. If the modules work as described, the output vectors stay coherent globally yet respond to text for targeted edits. Experiments position it ahead of existing approaches on consistency, fidelity, and controllability.

Core claim

PortraVec converts pixel-based portrait images into vector sketches via a two-stage image-guided generation module that employs Attention-aware Offset Sampling to capture face structure while correcting detail deviations, paired with a text-guided manipulation module that uses Region-based Parameter Freezing to enable local semantic editing while maintaining global consistency.

What carries the argument

Attention-aware Offset Sampling for structure capture and correction plus Region-based Parameter Freezing for selective local edits.

If this is right

Vector outputs retain better structural consistency than prior vectorization techniques.
Text instructions change only targeted facial regions without affecting the rest of the sketch.
Generated vectors show higher visual fidelity to the input image.
The approach supports semantic controllability not available in existing methods.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same sampling and freezing pattern could apply to non-portrait images if structure detection generalizes.
Design tools might adopt the output vectors for faster iteration on client-specific edits.
Integration with existing text-to-image models could allow mixed pixel-vector workflows.

Load-bearing premise

The two modules capture facial integrity and support local edits without introducing artifacts or losing global coherence.

What would settle it

Quantitative or visual comparison on a held-out portrait set where text edits produce measurable drops in facial landmark alignment or introduce visible artifacts relative to baselines.

Figures

Figures reproduced from arXiv: 2410.04182 by Dandan Long, Ruihui Li, Ying Liu, Yiqi Liang.

**Figure 2.** Figure 2: Editing the brush style on SVGs. Our method gen [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗

**Figure 4.** Figure 4: More details on VectorPD. The top path represents [PITH_FULL_IMAGE:figures/full_fig_p003_4.png] view at source ↗

**Figure 3.** Figure 3: An illustration of our framework. Given a tar [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗

**Figure 5.** Figure 5: The optimization iteration process reflects two-round optimization mechanism of VectorPD. [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗

**Figure 9.** Figure 9: Different stroke selection methods in the first [PITH_FULL_IMAGE:figures/full_fig_p005_9.png] view at source ↗

**Figure 10.** Figure 10: Trade-off between facial strokes Nf and contour strokes Nc. As shown in the sketch, Nf increases and the face of the portrait sketch becomes more detailed. As Nc increases, the contour of the portrait sketch becomes more complete. In [PITH_FULL_IMAGE:figures/full_fig_p005_10.png] view at source ↗

**Figure 11.** Figure 11: Portrait sketching results and comparisons. From left to right, the images showcase the results from Virtual Sketching, [PITH_FULL_IMAGE:figures/full_fig_p006_11.png] view at source ↗

**Figure 12.** Figure 12: Comparison of portrait sketches at different levels of abstraction. For the woman on the left, the top result is generated [PITH_FULL_IMAGE:figures/full_fig_p007_12.png] view at source ↗

read the original abstract

While portrait sketch generation is a special task in sketch synthesis, most existing methods are pixel-based, limiting their interpretability and editability. With the rise of vector generation techniques, representing sketches using vector elements may provide more flexible manipulation. However, due to the overlapping nature of vector graphics and the coarse detail modeling, existing vectorization methods struggle to capture facial integrity and fine-grained details, and lack semantic control. To address these issues, we propose PortraVec, a framework for converting pixel-based portrait images into vector sketches with text control. Specifically, we propose a two-stage image-guided generation module using Attention-aware Offset Sampling to capture face structure while correcting detail deviations, and a text-guided manipulation module based on Region-based Parameter Freezing to enable local semantic editing while maintaining global consistency. Experiments show that PortraVec achieves superior structural consistency, visual fidelity, and semantic controllability compared to state-of-the-art methods.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

PortraVec pairs two new modules for portrait vectorization with text edits, but the abstract supplies zero numbers or comparisons to support the superiority claims.

read the letter

The paper's core contribution is a two-stage setup for turning portrait photos into vector sketches that can be locally edited via text. The first stage uses Attention-aware Offset Sampling to pull structure from the image while fixing detail deviations. The second uses Region-based Parameter Freezing so text prompts can change one area without breaking global coherence. That combination targets a real pain point in vector methods: overlapping strokes and weak semantic control on faces. The framing is straightforward and the modules are described clearly enough that someone could try to reimplement the ideas. Credit for focusing on a narrow but common use case instead of claiming broad advances in vector graphics. The abstract does not include any equations, so there is no circularity issue there. The main weakness is that every performance claim rests on the two modules working exactly as stated, yet the text gives no metrics, no baselines, no ablations, and no failure examples. The stress-test note is accurate: if either module adds artifacts or fails to isolate edits, the headline gains do not follow, and nothing in the provided material lets us check. This is aimed at people already working on sketch vectorization or portrait editing tools. A reader in that niche might pick up the module designs for inspiration, but the lack of evidence makes it hard to judge whether the approach actually improves on prior work. I would not cite it from the abstract alone. It could merit peer review if the full manuscript contains proper experiments and comparisons, but the current version is too thin to evaluate on its own terms.

Referee Report

2 major / 1 minor

Summary. The paper proposes PortraVec, a framework for converting pixel-based portrait images into vector sketches with text-guided manipulation. It introduces a two-stage image-guided generation module using Attention-aware Offset Sampling to capture face structure and correct deviations, and a text-guided manipulation module based on Region-based Parameter Freezing to enable local semantic editing while maintaining global consistency. The abstract claims that experiments demonstrate superior structural consistency, visual fidelity, and semantic controllability compared to state-of-the-art methods.

Significance. If the proposed modules prove effective as described, the work could contribute to more interpretable and editable vector representations for portraits, addressing limitations in existing vectorization methods regarding facial integrity and semantic control.

major comments (2)

[Abstract] Abstract: The central claim that 'Experiments show that PortraVec achieves superior structural consistency, visual fidelity, and semantic controllability compared to state-of-the-art methods' is asserted without any quantitative metrics, baseline comparisons, ablation studies, dataset details, error bars, or failure cases, rendering the efficacy of the two modules unevaluable.
[Abstract] Abstract (modules description): The load-bearing assumption that Attention-aware Offset Sampling successfully captures facial integrity while correcting deviations and that Region-based Parameter Freezing enables local text edits without introducing artifacts or losing global coherence is not supported by any isolated quantitative validation or component-wise analysis.

minor comments (1)

[Abstract] Abstract: Consider adding a sentence specifying the evaluation metrics (e.g., FID, LPIPS, or vector-specific measures) and datasets used to ground the superiority claims.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the review and the opportunity to clarify the presentation of our results. We agree that the abstract would benefit from greater specificity regarding the supporting evidence and will revise it to better summarize the quantitative evaluations, ablations, and dataset details already present in the full manuscript.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim that 'Experiments show that PortraVec achieves superior structural consistency, visual fidelity, and semantic controllability compared to state-of-the-art methods' is asserted without any quantitative metrics, baseline comparisons, ablation studies, dataset details, error bars, or failure cases, rendering the efficacy of the two modules unevaluable.

Authors: The abstract is a high-level summary constrained by length. The full manuscript (Section 4) provides the requested details: quantitative metrics (e.g., LPIPS, SSIM, FID for fidelity and consistency), comparisons against multiple SOTA baselines, ablation studies on both modules, dataset information (portrait images from standard benchmarks with train/test splits), error bars from repeated runs, and analysis of failure cases. We will revise the abstract to incorporate key quantitative highlights and dataset references to improve evaluability. revision: yes
Referee: [Abstract] Abstract (modules description): The load-bearing assumption that Attention-aware Offset Sampling successfully captures facial integrity while correcting deviations and that Region-based Parameter Freezing enables local text edits without introducing artifacts or losing global coherence is not supported by any isolated quantitative validation or component-wise analysis.

Authors: The manuscript contains component-wise ablations (Section 4.3) that isolate Attention-aware Offset Sampling (quantified via structure preservation metrics before/after offset correction) and Region-based Parameter Freezing (measured by local edit accuracy vs. global coherence scores, with artifact analysis). These directly validate the modules' contributions. We will expand cross-references from the abstract to these ablations and consider adding further isolated metrics in a revised version. revision: partial

Circularity Check

0 steps flagged

No circularity; empirical method proposal with external validation

full rationale

The paper describes a proposed framework consisting of an image-guided generation module and a text-guided manipulation module. No equations, parameter fits, predictions, or self-citations appear in the abstract or described content that reduce any claimed result to its own inputs by construction. Superiority is asserted via experiments against state-of-the-art methods, rendering the work self-contained against external benchmarks rather than internally referential.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 2 invented entities

The central claim depends on the unverified effectiveness of two newly named modules and on standard assumptions that deep generative models can be guided by attention and parameter freezing without side effects. No free parameters, axioms, or invented physical entities are stated in the abstract.

axioms (1)

domain assumption Deep generative models for images can be conditioned on both image and text inputs while preserving structural integrity.
Implicit in the two-stage generation and manipulation pipeline described in the abstract.

invented entities (2)

Attention-aware Offset Sampling no independent evidence
purpose: Capture face structure while correcting detail deviations in vector generation
New module introduced to address limitations of existing vectorization methods.
Region-based Parameter Freezing no independent evidence
purpose: Enable local semantic editing via text while maintaining global consistency
New module introduced for text-guided manipulation.

pith-pipeline@v0.9.0 · 5690 in / 1412 out tokens · 20448 ms · 2026-05-23T19:37:10.613549+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

43 extracted references · 43 canonical work pages · 5 internal anchors

[1]

Berger, I.; Shamir, A.; Mahler, M.; Carter, E.; and Hodgins, J. 2013. Style and abstraction in portrait sketching. ACM Transactions on Graphics (TOG), 32(4): 1--12

work page 2013
[2]

Bessmeltsev, M.; and Solomon, J. 2019. Vectorization of line drawings via polyvector fields. ACM Transactions on Graphics (TOG), 38(1): 1--12

work page 2019
[3]

Biederman, I.; and Ju, G. 1988. Surface versus edge-based determinants of visual recognition. Cognitive psychology, 20(1): 38--64

work page 1988
[4]

DeCarlo, D.; Finkelstein, A.; Rusinkiewicz, S.; and Santella, A. 2003. Suggestive contours for conveying shape. In ACM SIGGRAPH 2003 Papers, 848--855

work page 2003
[5]

Ding, L.; and Goshtasby, A. 2001. On the Canny edge detector. Pattern recognition, 34(3): 721--725

work page 2001
[6]

E.; Yamins, D

Fan, J. E.; Yamins, D. L.; and Turk-Browne, N. B. 2018. Common object representations for visual production and recognition. Cognitive science, 42(8): 2670--2698

work page 2018
[7]

Frans, K.; and Cheng, C.-Y. 2018. Unsupervised image to sequence translation with canvas-drawer networks. arXiv preprint arXiv:1809.08340

work page internal anchor Pith review Pith/arXiv arXiv 2018
[8]

A.; Ecker, A

Gatys, L. A.; Ecker, A. S.; and Bethge, M. 2016. Image style transfer using convolutional neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2414--2423

work page 2016
[9]

W.; Pont, S

Gryaditskaya, Y.; Sypesteyn, M.; Hoftijzer, J. W.; Pont, S. C.; Durand, F.; and Bousseau, A. 2019. OpenSketch: a richly-annotated dataset of product design sketches. ACM Trans. Graph., 38(6): 232--1

work page 2019
[10]

Hertzmann, A. 2020. Why do line drawings work? a realism hypothesis. Perception, 49(4): 439--451

work page 2020
[11]

Huang, Z.; Heng, W.; and Zhou, S. 2019. Learning to paint with model-based deep reinforcement learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 8709--8718

work page 2019
[12]

Huang, Z.; Peng, Y.; Hibino, T.; Zhao, C.; Xie, H.; Fukusato, T.; and Miyata, K. 2022. dualface: Two-stage drawing guidance for freehand portrait sketching. Computational Visual Media, 8: 63--77

work page 2022
[13]

Johnson, J.; Alahi, A.; and Fei-Fei, L. 2016. Perceptual losses for real-time style transfer and super-resolution. In Computer Vision--ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14, 694--711. Springer

work page 2016
[14]

Karras, T.; Aila, T.; Laine, S.; and Lehtinen, J. 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196

work page internal anchor Pith review Pith/arXiv arXiv 2017
[15]

Kazemi, V.; and Sullivan, J. 2014. One millisecond face alignment with an ensemble of regression trees. In Proceedings of the IEEE conference on computer vision and pattern recognition, 1867--1874

work page 2014
[16]

Lee, C.-H.; Liu, Z.; Wu, L.; and Luo, P. 2020. Maskgan: Towards diverse and interactive facial image manipulation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5549--5558

work page 2020
[17]

Li, C.; and Wand, M. 2016. Combining markov random fields and convolutional neural networks for image synthesis. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2479--2486

work page 2016
[18]

Li, H.; and Mould, D. 2011. Structure-preserving stippling by priority-based error diffusion. In Proceedings of Graphics Interface 2011, 127--134

work page 2011
[19]

Li, M.; Lin, Z.; Mech, R.; Yumer, E.; and Ramanan, D. 2019. Photo-sketching: Inferring contour drawings from images. In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), 1403--1412. IEEE

work page 2019
[20]

Li, T.-M.; Luk \'a c , M.; Gharbi, M.; and Ragan-Kelley, J. 2020. Differentiable vector graphics rasterization for editing and learning. ACM Transactions on Graphics (TOG), 39(6): 1--15

work page 2020
[21]

Ma, X.; Zhou, Y.; Xu, X.; Sun, B.; Filev, V.; Orlov, N.; Fu, Y.; and Shi, H. 2022. Towards layer-wise image vectorization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 16314--16323

work page 2022
[22]

Meng, M.; Zhao, M.; and Zhu, S.-C. 2010. Artistic paper-cut of human portraits. In Proceedings of the 18th ACM international conference on Multimedia, 931--934

work page 2010
[23]

Mo, H.; Simo-Serra, E.; Gao, C.; Zou, C.; and Wang, R. 2021. General virtual sketching framework for vector line art. ACM Transactions on Graphics (TOG), 40(4): 1--14

work page 2021
[24]

FaceShop: Deep Sketch-based Face Image Editing

Portenier, T.; Hu, Q.; Szabo, A.; Bigdeli, S. A.; Favaro, P.; and Zwicker, M. 2018. Faceshop: Deep sketch-based face image editing. arXiv preprint arXiv:1804.08972

work page internal anchor Pith review Pith/arXiv arXiv 2018
[25]

W.; Hallacy, C.; Ramesh, A.; Goh, G.; Agarwal, S.; Sastry, G.; Askell, A.; Mishkin, P.; Clark, J.; et al

Radford, A.; Kim, J. W.; Hallacy, C.; Ramesh, A.; Goh, G.; Agarwal, S.; Sastry, G.; Askell, A.; Mishkin, P.; Clark, J.; et al. 2021. Learning transferable visual models from natural language supervision. In International conference on machine learning, 8748--8763. PMLR

work page 2021
[26]

Reddy, P.; Gharbi, M.; Lukac, M.; and Mitra, N. J. 2021. Im2vec: Synthesizing vector graphics without vector supervision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 7342--7351

work page 2021
[27]

L.; and Lai, Y.-K

Rosin, P. L.; and Lai, Y.-K. 2018. Watercolour rendering of portraits. In Image and Video Technology: PSIVT 2017 International Workshops, Wuhan, China, November 20-24, 2017, Revised Selected Papers 8, 268--282. Springer

work page 2018
[28]

Shao, H.; Weng, X.; and He, S. 2017. Functional organization of the face-sensitive areas in human occipital-temporal cortex. Neuroimage, 157: 129--143

work page 2017
[29]

Shen, I.-C.; and Chen, B.-Y. 2021. Clipgen: A deep generative model for clipart vectorization and synthesis. IEEE Transactions on Visualization and Computer Graphics, 28(12): 4211--4224

work page 2021
[30]

Simo-Serra, E.; Iizuka, S.; and Ishikawa, H. 2018. Mastering sketching: adversarial augmentation for structured prediction. ACM Transactions on Graphics (TOG), 37(1): 1--13

work page 2018
[31]

Simonyan, K.; and Zisserman, A. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556

work page internal anchor Pith review Pith/arXiv arXiv 2014
[32]

Tian, Y.; and Ha, D. 2022. Modern evolution strategies for creativity: Fitting concrete images and abstract concepts. In Artificial Intelligence in Music, Sound, Art and Design: 11th International Conference, EvoMUSART 2022, Held as Part of EvoStar 2022, Madrid, Spain, April 20--22, 2022, Proceedings, 275--291. Springer

work page 2022
[33]

Ulyanov, D.; Lebedev, V.; Vedaldi, A.; and Lempitsky, V. 2016. Texture networks: Feed-forward synthesis of textures and stylized images. arXiv preprint arXiv:1603.03417

work page internal anchor Pith review Pith/arXiv arXiv 2016
[34]

Y.; Bachmann, R

Vinker, Y.; Pajouheshgar, E.; Bo, J. Y.; Bachmann, R. C.; Bermano, A. H.; Cohen-Or, D.; Zamir, A.; and Shamir, A. 2022. Clipasso: Semantically-aware object sketching. ACM Transactions on Graphics (TOG), 41(4): 1--11

work page 2022
[35]

Wang, A.; Ren, M.; and Zemel, R. 2021. Sketchembednet: Learning novel concepts by imitating drawings. In International Conference on Machine Learning, 10870--10881. PMLR

work page 2021
[36]

P.; Hunter, A.; and Greig, D

Wang, T.; Collomosse, J. P.; Hunter, A.; and Greig, D. 2013. Learnable stroke models for example-based portrait painting. In British Machine Vision Conference (BMVC)

work page 2013
[37]

C.; Sheikh, H

Wang, Z.; Bovik, A. C.; Sheikh, H. R.; and Simoncelli, E. P. 2004. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4): 600--612

work page 2004
[38]

E.; and Olsen, S

Winnem \"o ller, H.; Kyprianidis, J. E.; and Olsen, S. C. 2012. XDoG: An eXtended difference-of-Gaussians compendium including advanced image stylization. Computers & Graphics, 36(6): 740--753

work page 2012
[39]

M.; Yin, Q.; Song, Y.-Z.; Xiang, T.; and Wang, L

Xu, P.; Hospedales, T. M.; Yin, Q.; Song, Y.-Z.; Xiang, T.; and Wang, L. 2022. Deep learning for free-hand sketch: A survey. IEEE transactions on pattern analysis and machine intelligence, 45(1): 285--312

work page 2022
[40]

Xu, X.; Xie, M.; Miao, P.; Qu, W.; Xiao, W.; Zhang, H.; Liu, X.; and Wong, T.-T. 2019. Perceptual-aware sketch simplification based on integrated VGG layers. IEEE transactions on visualization and computer graphics, 27(1): 178--189

work page 2019
[41]

A.; Shechtman, E.; and Wang, O

Zhang, R.; Isola, P.; Efros, A. A.; Shechtman, E.; and Wang, O. 2018. The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE conference on computer vision and pattern recognition, 586--595

work page 2018
[42]

, " * write output.state after.block = add.period write newline

ENTRY address archivePrefix author booktitle chapter edition editor eid eprint howpublished institution isbn journal key month note number organization pages publisher school series title type volume year label extra.label sort.label short.list INTEGERS output.state before.all mid.sentence after.sentence after.block FUNCTION init.state.consts #0 'before.a...

work page
[43]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize " " * FUNCT...

work page

[1] [1]

Berger, I.; Shamir, A.; Mahler, M.; Carter, E.; and Hodgins, J. 2013. Style and abstraction in portrait sketching. ACM Transactions on Graphics (TOG), 32(4): 1--12

work page 2013

[2] [2]

Bessmeltsev, M.; and Solomon, J. 2019. Vectorization of line drawings via polyvector fields. ACM Transactions on Graphics (TOG), 38(1): 1--12

work page 2019

[3] [3]

Biederman, I.; and Ju, G. 1988. Surface versus edge-based determinants of visual recognition. Cognitive psychology, 20(1): 38--64

work page 1988

[4] [4]

DeCarlo, D.; Finkelstein, A.; Rusinkiewicz, S.; and Santella, A. 2003. Suggestive contours for conveying shape. In ACM SIGGRAPH 2003 Papers, 848--855

work page 2003

[5] [5]

Ding, L.; and Goshtasby, A. 2001. On the Canny edge detector. Pattern recognition, 34(3): 721--725

work page 2001

[6] [6]

E.; Yamins, D

Fan, J. E.; Yamins, D. L.; and Turk-Browne, N. B. 2018. Common object representations for visual production and recognition. Cognitive science, 42(8): 2670--2698

work page 2018

[7] [7]

Frans, K.; and Cheng, C.-Y. 2018. Unsupervised image to sequence translation with canvas-drawer networks. arXiv preprint arXiv:1809.08340

work page internal anchor Pith review Pith/arXiv arXiv 2018

[8] [8]

A.; Ecker, A

Gatys, L. A.; Ecker, A. S.; and Bethge, M. 2016. Image style transfer using convolutional neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2414--2423

work page 2016

[9] [9]

W.; Pont, S

Gryaditskaya, Y.; Sypesteyn, M.; Hoftijzer, J. W.; Pont, S. C.; Durand, F.; and Bousseau, A. 2019. OpenSketch: a richly-annotated dataset of product design sketches. ACM Trans. Graph., 38(6): 232--1

work page 2019

[10] [10]

Hertzmann, A. 2020. Why do line drawings work? a realism hypothesis. Perception, 49(4): 439--451

work page 2020

[11] [11]

Huang, Z.; Heng, W.; and Zhou, S. 2019. Learning to paint with model-based deep reinforcement learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 8709--8718

work page 2019

[12] [12]

Huang, Z.; Peng, Y.; Hibino, T.; Zhao, C.; Xie, H.; Fukusato, T.; and Miyata, K. 2022. dualface: Two-stage drawing guidance for freehand portrait sketching. Computational Visual Media, 8: 63--77

work page 2022

[13] [13]

Johnson, J.; Alahi, A.; and Fei-Fei, L. 2016. Perceptual losses for real-time style transfer and super-resolution. In Computer Vision--ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14, 694--711. Springer

work page 2016

[14] [14]

Karras, T.; Aila, T.; Laine, S.; and Lehtinen, J. 2017. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196

work page internal anchor Pith review Pith/arXiv arXiv 2017

[15] [15]

Kazemi, V.; and Sullivan, J. 2014. One millisecond face alignment with an ensemble of regression trees. In Proceedings of the IEEE conference on computer vision and pattern recognition, 1867--1874

work page 2014

[16] [16]

Lee, C.-H.; Liu, Z.; Wu, L.; and Luo, P. 2020. Maskgan: Towards diverse and interactive facial image manipulation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5549--5558

work page 2020

[17] [17]

Li, C.; and Wand, M. 2016. Combining markov random fields and convolutional neural networks for image synthesis. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2479--2486

work page 2016

[18] [18]

Li, H.; and Mould, D. 2011. Structure-preserving stippling by priority-based error diffusion. In Proceedings of Graphics Interface 2011, 127--134

work page 2011

[19] [19]

Li, M.; Lin, Z.; Mech, R.; Yumer, E.; and Ramanan, D. 2019. Photo-sketching: Inferring contour drawings from images. In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), 1403--1412. IEEE

work page 2019

[20] [20]

Li, T.-M.; Luk \'a c , M.; Gharbi, M.; and Ragan-Kelley, J. 2020. Differentiable vector graphics rasterization for editing and learning. ACM Transactions on Graphics (TOG), 39(6): 1--15

work page 2020

[21] [21]

Ma, X.; Zhou, Y.; Xu, X.; Sun, B.; Filev, V.; Orlov, N.; Fu, Y.; and Shi, H. 2022. Towards layer-wise image vectorization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 16314--16323

work page 2022

[22] [22]

Meng, M.; Zhao, M.; and Zhu, S.-C. 2010. Artistic paper-cut of human portraits. In Proceedings of the 18th ACM international conference on Multimedia, 931--934

work page 2010

[23] [23]

Mo, H.; Simo-Serra, E.; Gao, C.; Zou, C.; and Wang, R. 2021. General virtual sketching framework for vector line art. ACM Transactions on Graphics (TOG), 40(4): 1--14

work page 2021

[24] [24]

FaceShop: Deep Sketch-based Face Image Editing

Portenier, T.; Hu, Q.; Szabo, A.; Bigdeli, S. A.; Favaro, P.; and Zwicker, M. 2018. Faceshop: Deep sketch-based face image editing. arXiv preprint arXiv:1804.08972

work page internal anchor Pith review Pith/arXiv arXiv 2018

[25] [25]

W.; Hallacy, C.; Ramesh, A.; Goh, G.; Agarwal, S.; Sastry, G.; Askell, A.; Mishkin, P.; Clark, J.; et al

Radford, A.; Kim, J. W.; Hallacy, C.; Ramesh, A.; Goh, G.; Agarwal, S.; Sastry, G.; Askell, A.; Mishkin, P.; Clark, J.; et al. 2021. Learning transferable visual models from natural language supervision. In International conference on machine learning, 8748--8763. PMLR

work page 2021

[26] [26]

Reddy, P.; Gharbi, M.; Lukac, M.; and Mitra, N. J. 2021. Im2vec: Synthesizing vector graphics without vector supervision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 7342--7351

work page 2021

[27] [27]

L.; and Lai, Y.-K

Rosin, P. L.; and Lai, Y.-K. 2018. Watercolour rendering of portraits. In Image and Video Technology: PSIVT 2017 International Workshops, Wuhan, China, November 20-24, 2017, Revised Selected Papers 8, 268--282. Springer

work page 2018

[28] [28]

Shao, H.; Weng, X.; and He, S. 2017. Functional organization of the face-sensitive areas in human occipital-temporal cortex. Neuroimage, 157: 129--143

work page 2017

[29] [29]

Shen, I.-C.; and Chen, B.-Y. 2021. Clipgen: A deep generative model for clipart vectorization and synthesis. IEEE Transactions on Visualization and Computer Graphics, 28(12): 4211--4224

work page 2021

[30] [30]

Simo-Serra, E.; Iizuka, S.; and Ishikawa, H. 2018. Mastering sketching: adversarial augmentation for structured prediction. ACM Transactions on Graphics (TOG), 37(1): 1--13

work page 2018

[31] [31]

Simonyan, K.; and Zisserman, A. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556

work page internal anchor Pith review Pith/arXiv arXiv 2014

[32] [32]

Tian, Y.; and Ha, D. 2022. Modern evolution strategies for creativity: Fitting concrete images and abstract concepts. In Artificial Intelligence in Music, Sound, Art and Design: 11th International Conference, EvoMUSART 2022, Held as Part of EvoStar 2022, Madrid, Spain, April 20--22, 2022, Proceedings, 275--291. Springer

work page 2022

[33] [33]

Ulyanov, D.; Lebedev, V.; Vedaldi, A.; and Lempitsky, V. 2016. Texture networks: Feed-forward synthesis of textures and stylized images. arXiv preprint arXiv:1603.03417

work page internal anchor Pith review Pith/arXiv arXiv 2016

[34] [34]

Y.; Bachmann, R

Vinker, Y.; Pajouheshgar, E.; Bo, J. Y.; Bachmann, R. C.; Bermano, A. H.; Cohen-Or, D.; Zamir, A.; and Shamir, A. 2022. Clipasso: Semantically-aware object sketching. ACM Transactions on Graphics (TOG), 41(4): 1--11

work page 2022

[35] [35]

Wang, A.; Ren, M.; and Zemel, R. 2021. Sketchembednet: Learning novel concepts by imitating drawings. In International Conference on Machine Learning, 10870--10881. PMLR

work page 2021

[36] [36]

P.; Hunter, A.; and Greig, D

Wang, T.; Collomosse, J. P.; Hunter, A.; and Greig, D. 2013. Learnable stroke models for example-based portrait painting. In British Machine Vision Conference (BMVC)

work page 2013

[37] [37]

C.; Sheikh, H

Wang, Z.; Bovik, A. C.; Sheikh, H. R.; and Simoncelli, E. P. 2004. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4): 600--612

work page 2004

[38] [38]

E.; and Olsen, S

Winnem \"o ller, H.; Kyprianidis, J. E.; and Olsen, S. C. 2012. XDoG: An eXtended difference-of-Gaussians compendium including advanced image stylization. Computers & Graphics, 36(6): 740--753

work page 2012

[39] [39]

M.; Yin, Q.; Song, Y.-Z.; Xiang, T.; and Wang, L

Xu, P.; Hospedales, T. M.; Yin, Q.; Song, Y.-Z.; Xiang, T.; and Wang, L. 2022. Deep learning for free-hand sketch: A survey. IEEE transactions on pattern analysis and machine intelligence, 45(1): 285--312

work page 2022

[40] [40]

Xu, X.; Xie, M.; Miao, P.; Qu, W.; Xiao, W.; Zhang, H.; Liu, X.; and Wong, T.-T. 2019. Perceptual-aware sketch simplification based on integrated VGG layers. IEEE transactions on visualization and computer graphics, 27(1): 178--189

work page 2019

[41] [41]

A.; Shechtman, E.; and Wang, O

Zhang, R.; Isola, P.; Efros, A. A.; Shechtman, E.; and Wang, O. 2018. The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE conference on computer vision and pattern recognition, 586--595

work page 2018

[42] [42]

, " * write output.state after.block = add.period write newline

ENTRY address archivePrefix author booktitle chapter edition editor eid eprint howpublished institution isbn journal key month note number organization pages publisher school series title type volume year label extra.label sort.label short.list INTEGERS output.state before.all mid.sentence after.sentence after.block FUNCTION init.state.consts #0 'before.a...

work page

[43] [43]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize " " * FUNCT...

work page