AP: Artificial Programming

Summit on Advances in Programming Languages Pub Date : 2017-06-06 DOI:10.4230/LIPIcs.SNAPL.2017.16

Rishabh Singh, Pushmeet Kohli

{"title":"AP: Artificial Programming","authors":"Rishabh Singh, Pushmeet Kohli","doi":"10.4230/LIPIcs.SNAPL.2017.16","DOIUrl":null,"url":null,"abstract":"The ability to automatically discover a program consistent with a given user intent (specification) is the holy grail of Computer Science. While significant progress has been made on the so-called problem of Program Synthesis, a number of challenges remain; particularly for the case of synthesizing richer and larger programs. This is in large part due to the difficulty of search over the space of programs. In this paper, we argue that the above-mentioned challenge can be tackled by learning synthesizers automatically from a large amount of training data. We present a first step in this direction by describing our novel synthesis approach based on two neural architectures for tackling the two key challenges of Learning to understand partial input-output specifications and Learning to search programs. The first neural architecture called the Spec Encoder computes a continuous representation of the specification, whereas the second neural architecture called the Program Generator incrementally constructs programs in a hypothesis space that is conditioned by the specification vector. The key idea of the approach is to train these architectures using a large set of (spec,P) pairs, where P denotes a program sampled from the DSL L and spec denotes the corresponding specification satisfied by P. We demonstrate the effectiveness of our approach on two preliminary instantiations. The first instantiation, called Neural FlashFill, corresponds to the domain of string manipulation programs similar to that of FlashFill. The second domain considers string transformation programs consisting of composition of API functions. We show that a neural system is able to perform quite well in learning a large majority of programs from few input-output examples. We believe this new approach will not only dramatically expand the applicability and effectiveness of Program Synthesis, but also would lead to the coming together of the Program Synthesis and Machine Learning research disciplines.","PeriodicalId":231548,"journal":{"name":"Summit on Advances in Programming Languages","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Summit on Advances in Programming Languages","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LIPIcs.SNAPL.2017.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

The ability to automatically discover a program consistent with a given user intent (specification) is the holy grail of Computer Science. While significant progress has been made on the so-called problem of Program Synthesis, a number of challenges remain; particularly for the case of synthesizing richer and larger programs. This is in large part due to the difficulty of search over the space of programs. In this paper, we argue that the above-mentioned challenge can be tackled by learning synthesizers automatically from a large amount of training data. We present a first step in this direction by describing our novel synthesis approach based on two neural architectures for tackling the two key challenges of Learning to understand partial input-output specifications and Learning to search programs. The first neural architecture called the Spec Encoder computes a continuous representation of the specification, whereas the second neural architecture called the Program Generator incrementally constructs programs in a hypothesis space that is conditioned by the specification vector. The key idea of the approach is to train these architectures using a large set of (spec,P) pairs, where P denotes a program sampled from the DSL L and spec denotes the corresponding specification satisfied by P. We demonstrate the effectiveness of our approach on two preliminary instantiations. The first instantiation, called Neural FlashFill, corresponds to the domain of string manipulation programs similar to that of FlashFill. The second domain considers string transformation programs consisting of composition of API functions. We show that a neural system is able to perform quite well in learning a large majority of programs from few input-output examples. We believe this new approach will not only dramatically expand the applicability and effectiveness of Program Synthesis, but also would lead to the coming together of the Program Synthesis and Machine Learning research disciplines.

查看原文本刊更多论文

美联社:人工编程

自动发现与给定用户意图(规范)一致的程序的能力是计算机科学的圣杯。虽然在所谓的方案综合问题上取得了重大进展，但仍然存在一些挑战;特别是对于合成更丰富和更大的程序的情况。这在很大程度上是由于程序空间搜索的困难。在本文中，我们认为上述挑战可以通过从大量训练数据中自动学习合成器来解决。我们在这个方向上迈出了第一步，描述了我们基于两种神经体系结构的新型合成方法，用于解决学习理解部分输入输出规范和学习搜索程序这两个关键挑战。第一个被称为规范编码器的神经结构计算规范的连续表示，而第二个被称为程序生成器的神经结构在一个由规范向量限定的假设空间中增量地构建程序。该方法的关键思想是使用大量(spec,P)对来训练这些体系结构，其中P表示从DSL L中采样的程序，spec表示P满足的相应规范。我们在两个初步实例上证明了我们的方法的有效性。第一个实例称为Neural FlashFill，对应于类似于FlashFill的字符串处理程序的领域。第二个领域考虑由API函数组合组成的字符串转换程序。我们表明，神经系统能够从很少的输入输出示例中学习绝大多数程序。我们相信这种新方法不仅会极大地扩展程序综合的适用性和有效性，而且还会导致程序综合和机器学习研究学科的结合。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Summit on Advances in Programming Languages

自引率

0.00%

发文量