Bengali Informative Chatbot

Md. Kowsher, M. A. Alam, M. J. Uddin, Md. Rafiqul Islam, Nuruzzaman Pias, Abu Rayhan Md Saifullah
{"title":"Bengali Informative Chatbot","authors":"Md. Kowsher, M. A. Alam, M. J. Uddin, Md. Rafiqul Islam, Nuruzzaman Pias, Abu Rayhan Md Saifullah","doi":"10.1109/IC4ME247184.2019.9036585","DOIUrl":null,"url":null,"abstract":"Bengali Informative Chatbot (BIC) is an effective technique that helps a user to trace relevant information by Natural Language Processing (NLP). In this research paper, we introduce an algorithmic Bengali Informative Chatbot (BIC) based on information that is significant mathematically and statistically. This paper is demonstrated by two algorithms for finding out the lemmatization of Bengali words such as Trie and Dictionary Based Search by Removing Affix (DBSRA) as well as compared with Edit Distance for the exact lemmatization. We present the Bengali Anaphora resolution system using the Hobbs’ algorithm to get the correct expression of information. As the actions of chatbot replying algorithms, the TF-IDF and Cosine Similarity are developed to find out the accurate answer from the documents. In this study, we introduce a Bengali Language Toolkit (BLTK) and Bengali Language Expression (BRE) that make the easiest implication of our task. We have also developed Bengali root word’s corpus, synonym word’s corpus, stop word’s corpus and gathered 672 articles as questions and answers form the popular Bengali newspapers ‘The Daily Prothom Alo’ is our inserted information. For testing this system, we have created 19334 questions from the introduced information and got 97.22% accurate answer by proposed BIC.","PeriodicalId":368690,"journal":{"name":"2019 International Conference on Computer, Communication, Chemical, Materials and Electronic Engineering (IC4ME2)","volume":"222 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Computer, Communication, Chemical, Materials and Electronic Engineering (IC4ME2)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC4ME247184.2019.9036585","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Bengali Informative Chatbot (BIC) is an effective technique that helps a user to trace relevant information by Natural Language Processing (NLP). In this research paper, we introduce an algorithmic Bengali Informative Chatbot (BIC) based on information that is significant mathematically and statistically. This paper is demonstrated by two algorithms for finding out the lemmatization of Bengali words such as Trie and Dictionary Based Search by Removing Affix (DBSRA) as well as compared with Edit Distance for the exact lemmatization. We present the Bengali Anaphora resolution system using the Hobbs’ algorithm to get the correct expression of information. As the actions of chatbot replying algorithms, the TF-IDF and Cosine Similarity are developed to find out the accurate answer from the documents. In this study, we introduce a Bengali Language Toolkit (BLTK) and Bengali Language Expression (BRE) that make the easiest implication of our task. We have also developed Bengali root word’s corpus, synonym word’s corpus, stop word’s corpus and gathered 672 articles as questions and answers form the popular Bengali newspapers ‘The Daily Prothom Alo’ is our inserted information. For testing this system, we have created 19334 questions from the introduced information and got 97.22% accurate answer by proposed BIC.
孟加拉语信息聊天机器人
孟加拉语信息聊天机器人(BIC)是一种利用自然语言处理(NLP)帮助用户追踪相关信息的有效技术。在本文中,我们介绍了一种基于数学和统计意义的孟加拉语信息聊天机器人(BIC)算法。本文介绍了三列法(Trie)和基于字典的去词缀搜索法(DBSRA)这两种孟加拉语词的词序查找算法,并与编辑距离法(Edit Distance)进行了比较。本文提出了一种利用Hobbs算法求解孟加拉语回指的系统,以获得正确的信息表达。作为聊天机器人应答算法的动作,开发了TF-IDF和余弦相似度,从文档中找到准确的答案。在这项研究中,我们引入了一个孟加拉语工具包(BLTK)和孟加拉语表达(BRE),使我们的任务最简单的含义。我们还开发了孟加拉语词根词语料库、近义词语料库、停顿词语料库,并从孟加拉语流行报纸中收集了672篇文章作为问答,《每日问答》是我们的插入信息。在系统的测试中,我们从引入的信息中创建了19334个问题,并得到了97.22%的正确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信