{"title":"Multi-view weighted feature fusion with wavelet transform and CNN for enhanced CT image recognition","authors":"Zilong Zhou, Yue Yu, Chaoyang Song, Zhen Liu, Manman Shi, Jingxiang Zhang","doi":"10.3233/jifs-233373","DOIUrl":null,"url":null,"abstract":"Reducing noise in CT images and extracting key features are crucial for improving the accuracy of medical diagnoses, but it remains a challenging problem due to the complex characteristics of CT images and the limitations of existing methods. It is worth noting that multiple views can provide a richer representation of information compared to a single view, and the unique advantages of the wavelet transform in feature analysis. In this study, a novel Multi-View Weighted Feature Fusion algorithm called MVWF is proposed to address the challenge of enhancing CT image recognition utilizing wavelet transform and convolutional neural networks. In the proposed approach, the wavelet transform is employed to extract both detailed and primary features of CT images from two views, including high frequency and low frequency. To mitigate information loss, the source domain is also considered as a view within the multi-view structure. Furthermore, AlexNet is deployed to extract deeper features from the multi-view structure. Additionally, the MVWF algorithm introduces a balance factor to account for both specific information and global information in CT images. To accentuate significant multi-view features and reduce feature dimensionality, random forest is used to assess feature importance followed by weighted fusion. Finally, CT image recognition is accomplished using the SVM classifier. The performance of the MVWF algorithm has been compared with classical multi-view algorithms and common single-view methods on COVID-CT and SARS-COV-2 datasets. The experimental results indicate that an average improvement of 6.8% in CT image recognition accuracy can be achieved by utilizing the proposed algorithm. Particularly, the MVF algorithm and MVWF algorithm have attained AUC values of 0.9972 and 0.9982, respectively, under the SARS-COV-2 dataset, demonstrating outstanding recognition performance. The proposed algorithms can capture more robust and comprehensive high-quality feature representation by considering feature correlations across views and feature importance based on Multi-view.","PeriodicalId":54795,"journal":{"name":"Journal of Intelligent & Fuzzy Systems","volume":"9 3","pages":"0"},"PeriodicalIF":1.7000,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent & Fuzzy Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/jifs-233373","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Reducing noise in CT images and extracting key features are crucial for improving the accuracy of medical diagnoses, but it remains a challenging problem due to the complex characteristics of CT images and the limitations of existing methods. It is worth noting that multiple views can provide a richer representation of information compared to a single view, and the unique advantages of the wavelet transform in feature analysis. In this study, a novel Multi-View Weighted Feature Fusion algorithm called MVWF is proposed to address the challenge of enhancing CT image recognition utilizing wavelet transform and convolutional neural networks. In the proposed approach, the wavelet transform is employed to extract both detailed and primary features of CT images from two views, including high frequency and low frequency. To mitigate information loss, the source domain is also considered as a view within the multi-view structure. Furthermore, AlexNet is deployed to extract deeper features from the multi-view structure. Additionally, the MVWF algorithm introduces a balance factor to account for both specific information and global information in CT images. To accentuate significant multi-view features and reduce feature dimensionality, random forest is used to assess feature importance followed by weighted fusion. Finally, CT image recognition is accomplished using the SVM classifier. The performance of the MVWF algorithm has been compared with classical multi-view algorithms and common single-view methods on COVID-CT and SARS-COV-2 datasets. The experimental results indicate that an average improvement of 6.8% in CT image recognition accuracy can be achieved by utilizing the proposed algorithm. Particularly, the MVF algorithm and MVWF algorithm have attained AUC values of 0.9972 and 0.9982, respectively, under the SARS-COV-2 dataset, demonstrating outstanding recognition performance. The proposed algorithms can capture more robust and comprehensive high-quality feature representation by considering feature correlations across views and feature importance based on Multi-view.
期刊介绍:
The purpose of the Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology is to foster advancements of knowledge and help disseminate results concerning recent applications and case studies in the areas of fuzzy logic, intelligent systems, and web-based applications among working professionals and professionals in education and research, covering a broad cross-section of technical disciplines.