- Book学术

发布求助

文献互助智能选刊最新文献

2019 IEEE International Conference on Software Maintenance and Evolution (ICSME) Pub Date : 2019-09-01 DOI:10.1109/ICSME.2019.00097

Reem S. Alsuhaibani

引用次数: 0

摘要

本文提出了一项实证研究，以评估使用马尔可夫链查找和预测源代码中发现的函数标识符的语法模式的有效性。这项研究使用了一个专门的词性标注器来标注从20个c++开源系统中提取的函数标识符。创建一个包含93K个带注释的唯一函数标识符的数据集用于分析。该分析包括使用一阶马尔可夫链对标识符名称的词性标签序列进行建模，并使用概率转移矩阵。模型的评估是通过对整个带注释的函数标识符名称集进行10倍交叉验证。初步结果具有较好的适用性和准确性。该模型预测测试集上最常见词性标签的准确率中值为91.53%。未来的工作包括利用这些结果创建源代码功能标识符的质量评估和自动修复工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Applying Markov Models to Identify Grammatical Patterns of Function Identifiers

An empirical study to evaluate the effectiveness of using Markov chains in finding and predicting the grammatical patterns of function identifiers found in source code is presented. The study uses a specialized part-of-speech tagger to annotate function identifiers extracted from 20 C++ open-source systems. A dataset of 93K annotated unique function identifiers is created for analysis. The analysis includes using a first-order Markov chain to model part of speech tag sequences of the identifier names, using a probability transition matrix. The evaluation of the model is via a 10-fold cross validation over the entire set of annotated function identifier names. The preliminary results are promising in terms of applicability and accuracy. The model achieved an accuracy median of 91.53% in predicting the most common part of speech tag on a test set. Future work involves utilizing these results in creating a quality assessment and automatic repairing tool for source code function identifiers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE International Conference on Software Maintenance and Evolution (ICSME)

自引率

0.00%

发文量