LoGenText: Automatically Generating Logging Texts Using Neural Machine Translation

2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) Pub Date : 2022-03-01 DOI:10.1109/saner53432.2022.00051

Zishuo Ding, Heng Li, Weiyi Shang

{"title":"LoGenText: Automatically Generating Logging Texts Using Neural Machine Translation","authors":"Zishuo Ding, Heng Li, Weiyi Shang","doi":"10.1109/saner53432.2022.00051","DOIUrl":null,"url":null,"abstract":"The textual descriptions in logging statements (i.e., logging texts) are printed during system executions and exposed to multiple stakeholders including developers, operators, users, and regulatory authorities. Writing proper logging texts is an important but often challenging task for developers. However, despite extensive research on automated logging suggestions, research on suggesting logging texts rarely exists. In this paper, we present LoGenText, an automated approach that generates logging texts by translating the related source code into short textual descriptions. LoGenText takes the preceding source code of a logging text as the input and considers other context information such as the location of the logging statement, to automatically generate the logging text using neural machine translation models. We evaluate LoGenText on 10 open-source projects, and compare the automatically generated logging texts with the developer-inserted logging texts in the source code. We find that LoGenText generates logging texts that achieve BLEU scores of 23.3 to 41.8 and ROUGE-L scores of 42.1 to 53.9, which outperforms the state-of-the-art approach by a large margin. In addition, we perform a human evaluation involving 42 participants, which further demonstrates the quality of the logging texts generated by LoGenText. Our work is an important step towards automated generation of logging statements, which can potentially save developers' efforts and improve the quality of software logging.","PeriodicalId":437520,"journal":{"name":"2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/saner53432.2022.00051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

Abstract

The textual descriptions in logging statements (i.e., logging texts) are printed during system executions and exposed to multiple stakeholders including developers, operators, users, and regulatory authorities. Writing proper logging texts is an important but often challenging task for developers. However, despite extensive research on automated logging suggestions, research on suggesting logging texts rarely exists. In this paper, we present LoGenText, an automated approach that generates logging texts by translating the related source code into short textual descriptions. LoGenText takes the preceding source code of a logging text as the input and considers other context information such as the location of the logging statement, to automatically generate the logging text using neural machine translation models. We evaluate LoGenText on 10 open-source projects, and compare the automatically generated logging texts with the developer-inserted logging texts in the source code. We find that LoGenText generates logging texts that achieve BLEU scores of 23.3 to 41.8 and ROUGE-L scores of 42.1 to 53.9, which outperforms the state-of-the-art approach by a large margin. In addition, we perform a human evaluation involving 42 participants, which further demonstrates the quality of the logging texts generated by LoGenText. Our work is an important step towards automated generation of logging statements, which can potentially save developers' efforts and improve the quality of software logging.

查看原文本刊更多论文

LoGenText:使用神经机器翻译自动生成日志文本

日志记录语句中的文本描述(即日志记录文本)在系统执行期间打印出来，并公开给多个涉众，包括开发人员、操作人员、用户和监管机构。编写适当的日志文本对开发人员来说是一项重要但往往具有挑战性的任务。然而，尽管对自动日志建议进行了广泛的研究，但对建议日志文本的研究很少。在本文中，我们介绍了LoGenText，这是一种通过将相关的源代码翻译成简短的文本描述来生成日志文本的自动化方法。LoGenText将前面的日志文本源代码作为输入，并考虑其他上下文信息，例如日志语句的位置，从而使用神经机器翻译模型自动生成日志文本。我们在10个开源项目中评估了LoGenText，并将自动生成的日志文本与开发人员在源代码中插入的日志文本进行了比较。我们发现，LoGenText生成的日志文本的BLEU分数达到23.3到41.8,ROUGE-L分数达到42.1到53.9，大大优于最先进的方法。此外，我们执行了一个涉及42名参与者的人工评估，这进一步证明了LoGenText生成的日志文本的质量。我们的工作是朝着自动生成日志语句迈出的重要一步，它可以潜在地节省开发人员的工作并提高软件日志的质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)

自引率

0.00%

发文量