{"title":"Enhance the Performance of Independent Component Analysis for Text Classification by Using Particle Swarm Optimization","authors":"H. Shabat, N. Abbas","doi":"10.1109/ICOASE51841.2020.9436547","DOIUrl":null,"url":null,"abstract":"Independent component analysis is a statistical model that is used to separate a multivariate signal into additive components. Independent component analysis has gained much attention in recent years in the neural networks and signals processing fields. Several data mining applications with Independent component analysis have been considered, such as latent variable decomposition, analysis of text document data, detection of hidden signals in satellite imagery, and weather data mining. The conventional Independent component analysis search scheme is based on a gradient algorithm, which requires a predefined learning rate. Therefore, it cannot solve the convergence dilemma. To overwhelm the disadvantage, particle swarm optimization is employed in the ICA algorithm. In statistics, negentropy is used as a measure of distance to normality. The present study used a metaheuristic, particle swarm optimization algorithm that employs negentropy as a fitness function to enhance the performance of independent component analysis for the text classification model as one of the text mining applications. The proposed system was applied to a medical corpus, and two experiments were executed. Results show that the performance of the PSO-ICA algorithm is superior to the FastICA for text classification, where it achieves an overall F -measure of 89% for text classification compared with the FastICA algorithm, which provides 85% of an overall F -measure for text classification.","PeriodicalId":126112,"journal":{"name":"2020 International Conference on Advanced Science and Engineering (ICOASE)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Advanced Science and Engineering (ICOASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOASE51841.2020.9436547","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Independent component analysis is a statistical model that is used to separate a multivariate signal into additive components. Independent component analysis has gained much attention in recent years in the neural networks and signals processing fields. Several data mining applications with Independent component analysis have been considered, such as latent variable decomposition, analysis of text document data, detection of hidden signals in satellite imagery, and weather data mining. The conventional Independent component analysis search scheme is based on a gradient algorithm, which requires a predefined learning rate. Therefore, it cannot solve the convergence dilemma. To overwhelm the disadvantage, particle swarm optimization is employed in the ICA algorithm. In statistics, negentropy is used as a measure of distance to normality. The present study used a metaheuristic, particle swarm optimization algorithm that employs negentropy as a fitness function to enhance the performance of independent component analysis for the text classification model as one of the text mining applications. The proposed system was applied to a medical corpus, and two experiments were executed. Results show that the performance of the PSO-ICA algorithm is superior to the FastICA for text classification, where it achieves an overall F -measure of 89% for text classification compared with the FastICA algorithm, which provides 85% of an overall F -measure for text classification.