Feature Location Using Crowd-Based Screencasts

2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR) Pub Date : 2018-05-28 DOI:10.1145/3196398.3196439

P. Moslehi, Bram Adams, J. Rilling

{"title":"Feature Location Using Crowd-Based Screencasts","authors":"P. Moslehi, Bram Adams, J. Rilling","doi":"10.1145/3196398.3196439","DOIUrl":null,"url":null,"abstract":"Crowd-based multi-media documents such as screencasts have emerged as a source for documenting requirements of agile software projects. For example, screencasts can describe buggy scenarios of a software product, or present new features in an upcoming release. Unfortunately, the binary format of videos makes traceability between the video content and other related software artifacts (e.g., source code, bug reports) difficult. In this paper, we propose an LDA-based feature location approach that takes as input a set of screencasts (i.e., the GUI text and/or spoken words) to establish traceability link between the features described in the screencasts and source code fragments implementing them. We report on a case study conducted on 10 WordPress screencasts, to evaluate the applicability of our approach in linking these screencasts to their relevant source code artifacts. We find that the approach is able to successfully pinpoint relevant source code files at the top 10 hits using speech and GUI text. We also found that term frequency rebalancing can reduce noise and yield more precise results.","PeriodicalId":6639,"journal":{"name":"2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR)","volume":"63 1","pages":"192-202"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3196398.3196439","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

Crowd-based multi-media documents such as screencasts have emerged as a source for documenting requirements of agile software projects. For example, screencasts can describe buggy scenarios of a software product, or present new features in an upcoming release. Unfortunately, the binary format of videos makes traceability between the video content and other related software artifacts (e.g., source code, bug reports) difficult. In this paper, we propose an LDA-based feature location approach that takes as input a set of screencasts (i.e., the GUI text and/or spoken words) to establish traceability link between the features described in the screencasts and source code fragments implementing them. We report on a case study conducted on 10 WordPress screencasts, to evaluate the applicability of our approach in linking these screencasts to their relevant source code artifacts. We find that the approach is able to successfully pinpoint relevant source code files at the top 10 hits using speech and GUI text. We also found that term frequency rebalancing can reduce noise and yield more precise results.

查看原文本刊更多论文

使用基于人群的屏幕录制功能定位

基于群体的多媒体文档(如屏幕视频)已经成为敏捷软件项目需求文档的来源。例如，屏幕视频可以描述软件产品的错误场景，或者在即将发布的版本中呈现新特性。不幸的是，视频的二进制格式使得视频内容和其他相关软件工件(例如，源代码、bug报告)之间的可追溯性变得困难。在本文中，我们提出了一种基于lda的特征定位方法，该方法将一组屏幕视频(即GUI文本和/或口语)作为输入，以在屏幕视频中描述的特征和实现它们的源代码片段之间建立可追溯性链接。我们报告了一个对10个WordPress屏幕视频进行的案例研究，以评估我们的方法在将这些屏幕视频与其相关源代码构件链接方面的适用性。我们发现，该方法能够使用语音和GUI文本成功地在前10个点击中找到相关的源代码文件。我们还发现，词频再平衡可以减少噪声并产生更精确的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR)

自引率

0.00%

发文量