Artem Popov, Dmitrii Orekhov, Denis V. Litvinov, N. Korolev, Gleb Morgachev
{"title":"Time-Efficient Code Completion Model for the R Programming Language","authors":"Artem Popov, Dmitrii Orekhov, Denis V. Litvinov, N. Korolev, Gleb Morgachev","doi":"10.18653/v1/2021.nlp4prog-1.4","DOIUrl":null,"url":null,"abstract":"In this paper we present a deep learning code completion model for the R language. We introduce several techniques to utilize language modeling based architecture in the code completion task. With these techniques, the model requires low resources, but still achieves high quality. We also present an evaluation dataset for the R language completion task. Our dataset contains multiple autocompletion usage contexts that provides robust validation results. The dataset is publicly available.","PeriodicalId":435990,"journal":{"name":"Proceedings of the 1st Workshop on Natural Language Processing for Programming (NLP4Prog 2021)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st Workshop on Natural Language Processing for Programming (NLP4Prog 2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2021.nlp4prog-1.4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper we present a deep learning code completion model for the R language. We introduce several techniques to utilize language modeling based architecture in the code completion task. With these techniques, the model requires low resources, but still achieves high quality. We also present an evaluation dataset for the R language completion task. Our dataset contains multiple autocompletion usage contexts that provides robust validation results. The dataset is publicly available.