pith. sign in

arxiv: 1812.01083 · v1 · pith:L4AHJ3UZnew · submitted 2018-12-03 · 💻 cs.CL

A System for Automated Image Editing from Natural Language Commands

classification 💻 cs.CL
keywords imageeditingcommandslanguagenaturalactionscorpusentities
0
0 comments X
read the original abstract

This work presents the task of modifying images in an image editing program using natural language written commands. We utilize a corpus of over 6000 image edit text requests to alter real world images collected via crowdsourcing. A novel framework composed of actions and entities to map a user's natural language request to executable commands in an image editing program is described. We resolve previously labeled annotator disagreement through a voting process and complete annotation of the corpus. We experimented with different machine learning models and found that the LSTM, the SVM, and the bidirectional LSTM-CRF joint models are the best to detect image editing actions and associated entities in a given utterance.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.