Bi-Directional LSTM-Based Search Engine for Source Code Retrieval

Nazia Bibi, T. Rana, A. Maqbool
{"title":"Bi-Directional LSTM-Based Search Engine for Source Code Retrieval","authors":"Nazia Bibi, T. Rana, A. Maqbool","doi":"10.1109/ComTech57708.2023.10165061","DOIUrl":null,"url":null,"abstract":"The code retrieval process looks for the most relevant code fragments. For this purpose, many code-search methods support natural-language queries. But still, there are problems with figuring out the pros and cons of each technique and choosing the best one for the task of code search. The proposed approach employs Word2Vec embedding and Bi- LSTM, a neural network model, to identify query title similarities with the existing dataset to search the code fragment. A novel model named BiD-CSE (Bi-Direction LSTM Code Search Engine) is proposed which generates source code recommendations. The proposed system assists programmers in locating appropriate implementations of sample code based on requirements specified in the query. The evaluation of the proposed framework is performed using three datasets. BiD-CSE collects and analyzes the input query which is then parsed to find the best match in the dataset. BiD-CSE finds the desired code by matching variables, functions, doc-strings, and comments. The BiD-CSE model is implemented using a web-based platform that allows users to enter a query and obtain the top ten most effective results. Evaluation results show that the performance of the model is better than the current baseline approaches.","PeriodicalId":203804,"journal":{"name":"2023 International Conference on Communication Technologies (ComTech)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Communication Technologies (ComTech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ComTech57708.2023.10165061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The code retrieval process looks for the most relevant code fragments. For this purpose, many code-search methods support natural-language queries. But still, there are problems with figuring out the pros and cons of each technique and choosing the best one for the task of code search. The proposed approach employs Word2Vec embedding and Bi- LSTM, a neural network model, to identify query title similarities with the existing dataset to search the code fragment. A novel model named BiD-CSE (Bi-Direction LSTM Code Search Engine) is proposed which generates source code recommendations. The proposed system assists programmers in locating appropriate implementations of sample code based on requirements specified in the query. The evaluation of the proposed framework is performed using three datasets. BiD-CSE collects and analyzes the input query which is then parsed to find the best match in the dataset. BiD-CSE finds the desired code by matching variables, functions, doc-strings, and comments. The BiD-CSE model is implemented using a web-based platform that allows users to enter a query and obtain the top ten most effective results. Evaluation results show that the performance of the model is better than the current baseline approaches.
基于lstm的双向源代码检索搜索引擎
代码检索过程查找最相关的代码片段。为此,许多代码搜索方法支持自然语言查询。但是,要找出每种技术的优点和缺点,并为代码搜索任务选择最佳技术,仍然存在一些问题。该方法采用Word2Vec嵌入和神经网络模型Bi- LSTM识别查询标题与现有数据集的相似度来搜索代码片段。提出了一种新的生成源代码推荐的模型——双向LSTM代码搜索引擎(BiD-CSE)。建议的系统帮助程序员根据查询中指定的需求定位示例代码的适当实现。使用三个数据集对提出的框架进行评估。BiD-CSE收集并分析输入查询,然后对输入查询进行解析,以在数据集中找到最佳匹配。BiD-CSE通过匹配变量、函数、文档字符串和注释来查找所需的代码。BiD-CSE模型采用基于web的平台实现,用户只需输入查询,即可获得最有效的前10个结果。评估结果表明,该模型的性能优于现有的基线方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信