Improve Question Classification Genetic Algorithm Based Feature Selection and Convolution Neural Network

Asmaa Ahmed Shama
{"title":"Improve Question Classification Genetic Algorithm Based Feature Selection and Convolution Neural Network","authors":"Asmaa Ahmed Shama","doi":"10.29072/basjs.20240111","DOIUrl":null,"url":null,"abstract":"Natural Language Processing (NLP) approaches play a crucial role in classifying inquiries and comprehending human language in diverse applications. A Question Answering System (QAS) consists of three components are question processing, information retrieval, and answer selection. Question Answering Systems (QASs) are a distinct form of information retrieval. The most crucial aspect of QAS is deciding on the question type since it influences the other sections following. However, an important question-answering system requires a prominent question classification system. In the past, there are different methods to solve this problem, such as rule-based learning, and hybrid approaches. However, the problem with these methods is that the rules require a lot of effort to create and are very limited. In this study, the utilization of genetic algorithm and deep neural network techniques enhances the quality control problem-solving process. This research utilizes the UIUC dataset. This collection comprises 5452 questions designed for learning purposes and an additional 500 questions specifically intended for assessment. The suggested solution involves converting each query into a matrix, with each row representing the Word2vec of a word. Subsequently, a Genetic Algorithm (GA) is employed to identify the most optimal features. Ultimately, a Convolutional Neural Network is utilized for classification, yielding a remarkable accuracy of 98.2% in our experimentation with the question dataset.","PeriodicalId":326824,"journal":{"name":"BASRA JOURNAL OF SCIENCE","volume":"57 24","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BASRA JOURNAL OF SCIENCE","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29072/basjs.20240111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Natural Language Processing (NLP) approaches play a crucial role in classifying inquiries and comprehending human language in diverse applications. A Question Answering System (QAS) consists of three components are question processing, information retrieval, and answer selection. Question Answering Systems (QASs) are a distinct form of information retrieval. The most crucial aspect of QAS is deciding on the question type since it influences the other sections following. However, an important question-answering system requires a prominent question classification system. In the past, there are different methods to solve this problem, such as rule-based learning, and hybrid approaches. However, the problem with these methods is that the rules require a lot of effort to create and are very limited. In this study, the utilization of genetic algorithm and deep neural network techniques enhances the quality control problem-solving process. This research utilizes the UIUC dataset. This collection comprises 5452 questions designed for learning purposes and an additional 500 questions specifically intended for assessment. The suggested solution involves converting each query into a matrix, with each row representing the Word2vec of a word. Subsequently, a Genetic Algorithm (GA) is employed to identify the most optimal features. Ultimately, a Convolutional Neural Network is utilized for classification, yielding a remarkable accuracy of 98.2% in our experimentation with the question dataset.
改进问题分类 基于遗传算法的特征选择和卷积神经网络
自然语言处理(NLP)方法在各种应用中的查询分类和理解人类语言方面发挥着至关重要的作用。问题解答系统(QAS)由问题处理、信息检索和答案选择三个部分组成。问题解答系统(QAS)是信息检索的一种独特形式。问题解答系统最关键的环节是确定问题类型,因为这影响到后面的其他部分。然而,一个重要的问题解答系统需要一个突出的问题分类系统。过去,有不同的方法来解决这个问题,如基于规则的学习和混合方法。然而,这些方法的问题在于,规则的创建需要耗费大量精力,而且非常有限。在本研究中,遗传算法和深度神经网络技术的使用增强了质量控制问题的解决过程。本研究利用了 UIUC 数据集。该数据集包括 5452 道为学习目的而设计的问题和另外 500 道专门用于评估的问题。建议的解决方案包括将每个查询转换成矩阵,每一行代表一个单词的 Word2vec。随后,采用遗传算法(GA)来确定最佳特征。最后,利用卷积神经网络进行分类,在我们对问题数据集的实验中,准确率高达 98.2%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信