REquirements TRacing On target (RETRO) enhanced with an automated thesaurus builder: An empirical study

2013 7th International Workshop on Traceability in Emerging Forms of Software Engineering (TEFSE) Pub Date : 2013-05-19 DOI:10.1109/TEFSE.2013.6620156

Sandeep Pandanaboyana, S. Sridharan, Jesse Yannelli, J. Hayes

引用次数: 11

Abstract

Several techniques have been proposed to increase the performance of the tracing process, including use of a thesaurus. Some thesauri pre-exist and have been shown to improve the recall for some datasets. But the drawback is that they are manually generated by analysts based on study and analysis of the textual artifacts being traced. To alleviate that effort, we developed an application that accepts textual artifacts as input and generates a thesaurus dynamically, we call it Thesaurus Builder. We evaluated the performance of REquirements TRacing On target (RETRO) with a Thesaurus generated by Thesaurus Builder. We found that recall increased from 81.9% with no thesaurus to 87.18% when the dynamic thesaurus was used. We also found that Okapi weighting resulted in better recall and precision than TF-IDF weighting, but only precision was statistically significant.

查看原文本刊更多论文

使用自动化同义词库构建器增强的目标上的需求跟踪(RETRO):一项实证研究

已经提出了几种技术来提高跟踪过程的性能，包括使用同义词库。一些同义词典预先存在，并已被证明可以提高某些数据集的召回率。但是缺点是，它们是由分析人员根据对所跟踪的文本工件的研究和分析手工生成的。为了减轻这种工作量，我们开发了一个应用程序，它接受文本构件作为输入，并动态地生成同义词库，我们将其称为thesaurus Builder。我们使用由Thesaurus Builder生成的Thesaurus来评估需求跟踪目标(RETRO)的性能。我们发现，当使用动态词库时，召回率从无词库时的81.9%提高到87.18%。我们还发现，霍加皮加权比TF-IDF加权具有更好的查全率和查准率，但只有查准率具有统计学意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 7th International Workshop on Traceability in Emerging Forms of Software Engineering (TEFSE)

自引率

0.00%

发文量