A. Bhunia, Ayan Das, U. Muhammad, Yongxin Yang, Timothy M. Hospedales, T. Xiang, Yulia Gryaditskaya, Yi-Zhe Song
{"title":"Pixelor: a competitive sketching AI agent. so you think you can sketch?","authors":"A. Bhunia, Ayan Das, U. Muhammad, Yongxin Yang, Timothy M. Hospedales, T. Xiang, Yulia Gryaditskaya, Yi-Zhe Song","doi":"10.1145/3414685.3417840","DOIUrl":null,"url":null,"abstract":"We present the first competitive drawing agent Pixelor that exhibits human-level \nperformance at a Pictionary-like sketching game, where the participant whose sketch is recognized first is a winner. Our AI agent can autonomously \nsketch a given visual concept, and achieve a recognizable rendition as quickly \nor faster than a human competitor. The key to victory for the agent’s goal \nis to learn the optimal stroke sequencing strategies that generate the most \nrecognizable and distinguishable strokes first. Training Pixelor is done in two \nsteps. First, we infer the stroke order that maximizes early recognizability of \nhuman training sketches. Second, this order is used to supervise the training \nof a sequence-to-sequence stroke generator. Our key technical contributions \nare a tractable search of the exponential space of orderings using neural \nsorting; and an improved Seq2Seq Wasserstein (S2S-WAE) generator that \nuses an optimal-transport loss to accommodate the multi-modal nature of the \noptimal stroke distribution. Our analysis shows that Pixelor is better than the \nhuman players of the Quick, Draw! game, under both AI and human judging \nof early recognition. To analyze the impact of human competitors’ strategies, \nwe conducted a further human study with participants being given unlimited \nthinking time and training in early recognizability by feedback from an AI \njudge. The study shows that humans do gradually improve their strategies \nwith training, but overall Pixelor still matches human performance. The code \nand the dataset are available at http://sketchx.ai/pixelor.","PeriodicalId":7121,"journal":{"name":"ACM Trans. Graph.","volume":"5 1","pages":"166:1-166:15"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Trans. Graph.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3414685.3417840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23
Abstract
We present the first competitive drawing agent Pixelor that exhibits human-level
performance at a Pictionary-like sketching game, where the participant whose sketch is recognized first is a winner. Our AI agent can autonomously
sketch a given visual concept, and achieve a recognizable rendition as quickly
or faster than a human competitor. The key to victory for the agent’s goal
is to learn the optimal stroke sequencing strategies that generate the most
recognizable and distinguishable strokes first. Training Pixelor is done in two
steps. First, we infer the stroke order that maximizes early recognizability of
human training sketches. Second, this order is used to supervise the training
of a sequence-to-sequence stroke generator. Our key technical contributions
are a tractable search of the exponential space of orderings using neural
sorting; and an improved Seq2Seq Wasserstein (S2S-WAE) generator that
uses an optimal-transport loss to accommodate the multi-modal nature of the
optimal stroke distribution. Our analysis shows that Pixelor is better than the
human players of the Quick, Draw! game, under both AI and human judging
of early recognition. To analyze the impact of human competitors’ strategies,
we conducted a further human study with participants being given unlimited
thinking time and training in early recognizability by feedback from an AI
judge. The study shows that humans do gradually improve their strategies
with training, but overall Pixelor still matches human performance. The code
and the dataset are available at http://sketchx.ai/pixelor.