Enhance the Performance of Independent Component Analysis for Text Classification by Using Particle Swarm Optimization

H. Shabat, N. Abbas
{"title":"Enhance the Performance of Independent Component Analysis for Text Classification by Using Particle Swarm Optimization","authors":"H. Shabat, N. Abbas","doi":"10.1109/ICOASE51841.2020.9436547","DOIUrl":null,"url":null,"abstract":"Independent component analysis is a statistical model that is used to separate a multivariate signal into additive components. Independent component analysis has gained much attention in recent years in the neural networks and signals processing fields. Several data mining applications with Independent component analysis have been considered, such as latent variable decomposition, analysis of text document data, detection of hidden signals in satellite imagery, and weather data mining. The conventional Independent component analysis search scheme is based on a gradient algorithm, which requires a predefined learning rate. Therefore, it cannot solve the convergence dilemma. To overwhelm the disadvantage, particle swarm optimization is employed in the ICA algorithm. In statistics, negentropy is used as a measure of distance to normality. The present study used a metaheuristic, particle swarm optimization algorithm that employs negentropy as a fitness function to enhance the performance of independent component analysis for the text classification model as one of the text mining applications. The proposed system was applied to a medical corpus, and two experiments were executed. Results show that the performance of the PSO-ICA algorithm is superior to the FastICA for text classification, where it achieves an overall F -measure of 89% for text classification compared with the FastICA algorithm, which provides 85% of an overall F -measure for text classification.","PeriodicalId":126112,"journal":{"name":"2020 International Conference on Advanced Science and Engineering (ICOASE)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Advanced Science and Engineering (ICOASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOASE51841.2020.9436547","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Independent component analysis is a statistical model that is used to separate a multivariate signal into additive components. Independent component analysis has gained much attention in recent years in the neural networks and signals processing fields. Several data mining applications with Independent component analysis have been considered, such as latent variable decomposition, analysis of text document data, detection of hidden signals in satellite imagery, and weather data mining. The conventional Independent component analysis search scheme is based on a gradient algorithm, which requires a predefined learning rate. Therefore, it cannot solve the convergence dilemma. To overwhelm the disadvantage, particle swarm optimization is employed in the ICA algorithm. In statistics, negentropy is used as a measure of distance to normality. The present study used a metaheuristic, particle swarm optimization algorithm that employs negentropy as a fitness function to enhance the performance of independent component analysis for the text classification model as one of the text mining applications. The proposed system was applied to a medical corpus, and two experiments were executed. Results show that the performance of the PSO-ICA algorithm is superior to the FastICA for text classification, where it achieves an overall F -measure of 89% for text classification compared with the FastICA algorithm, which provides 85% of an overall F -measure for text classification.
利用粒子群算法提高文本分类中独立成分分析的性能
独立分量分析是一种统计模型,用于将多变量信号分离成可加成分。近年来,独立分量分析在神经网络和信号处理领域受到了广泛的关注。本文讨论了独立分量分析在数据挖掘中的应用,如潜在变量分解、文本文档数据分析、卫星图像中隐藏信号的检测和天气数据挖掘等。传统的独立分量分析搜索方案是基于梯度算法的,它需要一个预定义的学习率。因此,它不能解决收敛困境。为了克服这一缺点,在ICA算法中引入了粒子群算法。在统计学中,负熵被用作距离正态距离的度量。本研究采用元启发式粒子群优化算法,以负熵为适应度函数,提高文本分类模型的独立成分分析性能。将该系统应用于医学语料库,并进行了两次实验。结果表明,PSO-ICA算法在文本分类方面的性能优于FastICA算法,其文本分类的总体F值为89%,而FastICA算法的文本分类总体F值为85%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信