Multimodal fish maw type recognition based on Wasserstein generative adversarial network combined with gradient penalty and spectral fusion

IF 4.3 2区 化学 Q1 SPECTROSCOPY
Hai Yin , Qihang Yang , Fangyuan Huang , Hongjie Li , Hui Wang , Huadan Zheng , Furong Huang
{"title":"Multimodal fish maw type recognition based on Wasserstein generative adversarial network combined with gradient penalty and spectral fusion","authors":"Hai Yin ,&nbsp;Qihang Yang ,&nbsp;Fangyuan Huang ,&nbsp;Hongjie Li ,&nbsp;Hui Wang ,&nbsp;Huadan Zheng ,&nbsp;Furong Huang","doi":"10.1016/j.saa.2024.125430","DOIUrl":null,"url":null,"abstract":"<div><div>There are many types of fish maw with significantly varying prices. The specific type directly affects its market value and medicinal efficacy. This paper proposes a fish maw type recognition method based on Wasserstein generative adversarial network combined with gradient penalty (WGAN-GP) and spectral fusion. By collecting Raman and near-infrared (NIR) spectral data of four types of fish maw (Beihai Male Fish Maw, Beihai Female Fish Maw, Yellow Croaker Fish Maw, and Red Mouth Croaker Fish Maw), we used WGAN-GP for data augmentation. The performance of three spectral fusion strategies (data layer, feature layer, and decision layer) was explored based on two one-dimensional convolutional neural network (1D-CNN) models. The results indicate that, after applying data augmentation and expanding the training set to 3,600 samples, the performances of the 1D-VGG (NIR), 1D-VGG (Raman), 1D-ResNet (NIR), and 1D-ResNet (Raman) models all reach optimal levels. The accuracies on the test set are improved by 15.48%, 13.10%, 1.19%, and 5.95%, respectively. Under different fusion strategies, the 1D-VGG (Raman)-1D-VGG (NIR) model at the feature layer and 1D-ResNet (Raman)(1.0)-1D-ResNet (NIR)(1.0) model at the decision layer achieved the same classification results. They exceeded other models in accuracy (98.21%), precision (98.27%), recall (98.21%), and F1-score (98.21%) on the test set. In summary, this study demonstrated the great potential of data enhancement and multimodal spectral data fusion in fish maw type identification, providing analytical tools for the development of fish maw detection equipment based on multimodal techniques.</div></div>","PeriodicalId":433,"journal":{"name":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","volume":"327 ","pages":"Article 125430"},"PeriodicalIF":4.3000,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386142524015968","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0

Abstract

There are many types of fish maw with significantly varying prices. The specific type directly affects its market value and medicinal efficacy. This paper proposes a fish maw type recognition method based on Wasserstein generative adversarial network combined with gradient penalty (WGAN-GP) and spectral fusion. By collecting Raman and near-infrared (NIR) spectral data of four types of fish maw (Beihai Male Fish Maw, Beihai Female Fish Maw, Yellow Croaker Fish Maw, and Red Mouth Croaker Fish Maw), we used WGAN-GP for data augmentation. The performance of three spectral fusion strategies (data layer, feature layer, and decision layer) was explored based on two one-dimensional convolutional neural network (1D-CNN) models. The results indicate that, after applying data augmentation and expanding the training set to 3,600 samples, the performances of the 1D-VGG (NIR), 1D-VGG (Raman), 1D-ResNet (NIR), and 1D-ResNet (Raman) models all reach optimal levels. The accuracies on the test set are improved by 15.48%, 13.10%, 1.19%, and 5.95%, respectively. Under different fusion strategies, the 1D-VGG (Raman)-1D-VGG (NIR) model at the feature layer and 1D-ResNet (Raman)(1.0)-1D-ResNet (NIR)(1.0) model at the decision layer achieved the same classification results. They exceeded other models in accuracy (98.21%), precision (98.27%), recall (98.21%), and F1-score (98.21%) on the test set. In summary, this study demonstrated the great potential of data enhancement and multimodal spectral data fusion in fish maw type identification, providing analytical tools for the development of fish maw detection equipment based on multimodal techniques.

Abstract Image

基于 Wasserstein 生成式对抗网络结合梯度惩罚和频谱融合的多模态鱼肚类型识别。
鱼肚的种类很多,价格也大不相同。具体种类直接影响其市场价值和药用功效。本文提出了一种基于 Wasserstein 成因对抗网络结合梯度惩罚(WGAN-GP)和光谱融合的鱼肚类型识别方法。通过收集四种鱼肚(北海雄鱼肚、北海雌鱼肚、黄花鱼鱼肚和红嘴大黄鱼鱼肚)的拉曼和近红外光谱数据,利用 WGAN-GP 进行数据增强。基于两个一维卷积神经网络(1D-CNN)模型,探讨了三种光谱融合策略(数据层、特征层和决策层)的性能。结果表明,在应用数据增强并将训练集扩展到 3,600 个样本后,1D-VGG(近红外)、1D-VGG(拉曼)、1D-ResNet(近红外)和 1D-ResNet(拉曼)模型的性能都达到了最佳水平。测试集上的准确率分别提高了 15.48%、13.10%、1.19% 和 5.95%。在不同的融合策略下,特征层的 1D-VGG (Raman)-1D-VGG (NIR) 模型和决策层的 1D-ResNet (Raman)(1.0)-1D-ResNet (NIR)(1.0) 模型取得了相同的分类结果。它们在测试集上的准确率(98.21%)、精确率(98.27%)、召回率(98.21%)和 F1 分数(98.21%)都超过了其他模型。总之,本研究证明了数据增强和多模态光谱数据融合在鱼肚类型识别中的巨大潜力,为开发基于多模态技术的鱼肚检测设备提供了分析工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.40
自引率
11.40%
发文量
1364
审稿时长
40 days
期刊介绍: Spectrochimica Acta, Part A: Molecular and Biomolecular Spectroscopy (SAA) is an interdisciplinary journal which spans from basic to applied aspects of optical spectroscopy in chemistry, medicine, biology, and materials science. The journal publishes original scientific papers that feature high-quality spectroscopic data and analysis. From the broad range of optical spectroscopies, the emphasis is on electronic, vibrational or rotational spectra of molecules, rather than on spectroscopy based on magnetic moments. Criteria for publication in SAA are novelty, uniqueness, and outstanding quality. Routine applications of spectroscopic techniques and computational methods are not appropriate. Topics of particular interest of Spectrochimica Acta Part A include, but are not limited to: Spectroscopy and dynamics of bioanalytical, biomedical, environmental, and atmospheric sciences, Novel experimental techniques or instrumentation for molecular spectroscopy, Novel theoretical and computational methods, Novel applications in photochemistry and photobiology, Novel interpretational approaches as well as advances in data analysis based on electronic or vibrational spectroscopy.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信