What information about code snippets is available in different software-related documents? An exploratory study

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER) Pub Date : 2017-02-01 DOI:10.1109/SANER.2017.7884638

Preetha Chatterjee, Manziba Akanda Nishi, Kostadin Damevski, Vinay Augustine, L. Pollock, Nicholas A. Kraft

引用次数: 28

Abstract

A large corpora of software-related documents is available on the Web, and these documents offer the unique opportunity to learn from what developers are saying or asking about the code snippets that they are discussing. For example, the natural language in a bug report provides information about what is not functioning properly in a particular code snippet. Previous research has mined information about code snippets from bug reports, emails, and Q&A forums. This paper describes an exploratory study into the kinds of information that is embedded in different software-related documents. The goal of the study is to gain insight into the potential value and difficulty of mining the natural language text associated with the code snippets found in a variety of software-related documents, including blog posts, API documentation, code reviews, and public chats.

查看原文本刊更多论文

在不同的软件相关文档中有哪些关于代码片段的信息?探索性研究

Web上有大量与软件相关的文档，这些文档提供了独特的机会，可以从开发人员所说的或询问的有关他们正在讨论的代码片段中学习。例如，bug报告中的自然语言提供了关于特定代码片段中哪些部分不能正常工作的信息。以前的研究是从bug报告、电子邮件和问答论坛中挖掘代码片段的信息。本文对不同软件相关文档中嵌入的各种信息进行了探索性研究。该研究的目标是深入了解挖掘与各种软件相关文档(包括博客文章、API文档、代码审查和公共聊天)中发现的代码片段相关的自然语言文本的潜在价值和难度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE 24th International Conference on Software Analysis, Evolution and Reengineering (SANER)

自引率

0.00%

发文量