Machine Learning Techniques to Understand Partial and Implied Data Values for Conversion of Natural Language to SQL Queries on HPCC Systems

A. Prasad, G. Shobha, N. Deepamala, Sourabh S Badhya, Y. Yashwanth, Shetty Rohan
{"title":"Machine Learning Techniques to Understand Partial and Implied Data Values for Conversion of Natural Language to SQL Queries on HPCC Systems","authors":"A. Prasad, G. Shobha, N. Deepamala, Sourabh S Badhya, Y. Yashwanth, Shetty Rohan","doi":"10.1109/CSITSS47250.2019.9031035","DOIUrl":null,"url":null,"abstract":"There has been an exponential growth in the amount of data produced daily in recent years, owing to the widespread use of technology. Ease of access to this data is of utmost importance in this day and age. Although in the past, use of structured query languages to query the data stored in the hard-drive was satisfactory, use of natural language to access the data is more desired. This paper talks about mapping partial data values to its corresponding data values and attributes in the schema to enrich the natural language query. Machine learning algorithm, Long Short Term Memory, preceded by an Embedding layer has been used on the HPCC Systems platform. The resulting model gives an accuracy of 99.6%, while its implementation with the experimental setup gives an accuracy of 92%.","PeriodicalId":236457,"journal":{"name":"2019 4th International Conference on Computational Systems and Information Technology for Sustainable Solution (CSITSS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Computational Systems and Information Technology for Sustainable Solution (CSITSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSITSS47250.2019.9031035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

There has been an exponential growth in the amount of data produced daily in recent years, owing to the widespread use of technology. Ease of access to this data is of utmost importance in this day and age. Although in the past, use of structured query languages to query the data stored in the hard-drive was satisfactory, use of natural language to access the data is more desired. This paper talks about mapping partial data values to its corresponding data values and attributes in the schema to enrich the natural language query. Machine learning algorithm, Long Short Term Memory, preceded by an Embedding layer has been used on the HPCC Systems platform. The resulting model gives an accuracy of 99.6%, while its implementation with the experimental setup gives an accuracy of 92%.
理解HPCC系统上自然语言到SQL查询转换的部分和隐含数据值的机器学习技术
近年来,由于技术的广泛使用,每天产生的数据量呈指数级增长。在这个时代,方便地访问这些数据是至关重要的。虽然在过去,使用结构化查询语言来查询存储在硬盘中的数据是令人满意的,但使用自然语言来访问数据是更可取的。本文讨论了将部分数据值映射到模式中相应的数据值和属性,以丰富自然语言查询。机器学习算法,长短期记忆,前嵌入层已在HPCC系统平台上使用。所得模型的准确率为99.6%,而在实验装置上实现的准确率为92%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信