基于多目标优化特征选择方法的情感分析

Deeplakshmi Sachin Zingade, Rajesh Keshavrao Deshmukh, D. Kadam
{"title":"基于多目标优化特征选择方法的情感分析","authors":"Deeplakshmi Sachin Zingade, Rajesh Keshavrao Deshmukh, D. Kadam","doi":"10.1109/INCET57972.2023.10169912","DOIUrl":null,"url":null,"abstract":"Polarity categorization affects sentiment analysis. Categorization is a fundamental knowledge discovery challenge. Categorization accuracy depends on data quality. Hence, data must be preprocessed to obtain meaningful information. In real-world applications, data is enormous, and many properties are duplicates or useless. The classification depends on feature selection. It finds the best data representation qualities. Feature selection reduces model training time. When unnecessary attributes are eliminated, models learn better. Combinatorial optimization makes feature selection harder. Feature selection balances decreasing features and enhancing classification performance. We propose two multi-objective optimization techniques for the feature selection. The particle swarm optimization (PSO) and Krill Herd Algorithm (KHA) are applied for optimal feature selection. The proposed model consists of key steps such as review pre-processing, multi-objective optimization-based feature selection, and supervised classification. The performance of both PSO-based and KHA-based models is evaluated using the two sentiment analysis datasets. The results show the efficiency of both models in terms of precision, recall, accuracy, and F1-score parameters.","PeriodicalId":403008,"journal":{"name":"2023 4th International Conference for Emerging Technology (INCET)","volume":"143 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sentiment Analysis using Multi-objective Optimization-based Feature Selection Approach\",\"authors\":\"Deeplakshmi Sachin Zingade, Rajesh Keshavrao Deshmukh, D. Kadam\",\"doi\":\"10.1109/INCET57972.2023.10169912\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Polarity categorization affects sentiment analysis. Categorization is a fundamental knowledge discovery challenge. Categorization accuracy depends on data quality. Hence, data must be preprocessed to obtain meaningful information. In real-world applications, data is enormous, and many properties are duplicates or useless. The classification depends on feature selection. It finds the best data representation qualities. Feature selection reduces model training time. When unnecessary attributes are eliminated, models learn better. Combinatorial optimization makes feature selection harder. Feature selection balances decreasing features and enhancing classification performance. We propose two multi-objective optimization techniques for the feature selection. The particle swarm optimization (PSO) and Krill Herd Algorithm (KHA) are applied for optimal feature selection. The proposed model consists of key steps such as review pre-processing, multi-objective optimization-based feature selection, and supervised classification. The performance of both PSO-based and KHA-based models is evaluated using the two sentiment analysis datasets. The results show the efficiency of both models in terms of precision, recall, accuracy, and F1-score parameters.\",\"PeriodicalId\":403008,\"journal\":{\"name\":\"2023 4th International Conference for Emerging Technology (INCET)\",\"volume\":\"143 \",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 4th International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INCET57972.2023.10169912\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCET57972.2023.10169912","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

极性分类影响情感分析。分类是一个基本的知识发现挑战。分类的准确性取决于数据质量。因此,必须对数据进行预处理以获得有意义的信息。在真实的应用程序中,数据是巨大的,许多属性是重复的或无用的。分类依赖于特征选择。它找到了最佳的数据表示质量。特征选择减少了模型训练时间。当不必要的属性被消除时,模型学习得更好。组合优化使得特征选择更加困难。特征选择在减少特征和提高分类性能之间取得平衡。我们提出了两种多目标优化技术用于特征选择。采用粒子群算法(PSO)和磷虾群算法(KHA)进行特征选择。该模型包括评论预处理、基于多目标优化的特征选择和监督分类等关键步骤。使用两个情感分析数据集对基于pso和基于ha的模型的性能进行了评估。结果表明,两种模型在准确率、召回率、准确率和f1评分参数方面都是有效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sentiment Analysis using Multi-objective Optimization-based Feature Selection Approach
Polarity categorization affects sentiment analysis. Categorization is a fundamental knowledge discovery challenge. Categorization accuracy depends on data quality. Hence, data must be preprocessed to obtain meaningful information. In real-world applications, data is enormous, and many properties are duplicates or useless. The classification depends on feature selection. It finds the best data representation qualities. Feature selection reduces model training time. When unnecessary attributes are eliminated, models learn better. Combinatorial optimization makes feature selection harder. Feature selection balances decreasing features and enhancing classification performance. We propose two multi-objective optimization techniques for the feature selection. The particle swarm optimization (PSO) and Krill Herd Algorithm (KHA) are applied for optimal feature selection. The proposed model consists of key steps such as review pre-processing, multi-objective optimization-based feature selection, and supervised classification. The performance of both PSO-based and KHA-based models is evaluated using the two sentiment analysis datasets. The results show the efficiency of both models in terms of precision, recall, accuracy, and F1-score parameters.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信