{"title":"[基于EfficientNet-B4模型的云南钉螺视觉智能识别模型构建]。","authors":"S Bai, J Zhou, Y Dong, J Zhang, L Shi, K Yang","doi":"10.16250/j.32.1915.2024194","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To construct a visual intelligent recognition model for <i>Oncomelania hupensis robertsoni</i> in Yunnan Province based on the EfficientNet-B4 model, and to evaluate the impact of data augmentation methods and model hyperparameters on the recognition of <i>O. hupensis robertsoni</i>.</p><p><strong>Methods: </strong>A total of 400 <i>O. hupensis robertsoni</i> and 400 <i>Tricula</i> snails were collected from Yongsheng County, Yunnan Province in June 2024, and snail images were captured following identification and classification of 300 <i>O. hupensis robertsoni</i> and 300 <i>Tricula</i> snails. A total of 925 <i>O. hupensis robertsoni</i> images and 1 062 <i>Tricula</i> snail images were collected as a dataset and divided into a training set and a validation set at a ratio of 8:2, while 352 images captured from the remaining 100 <i>O. hupensis robertsoni</i> and 354 images from the remaining 100 <i>Tricula</i> snails served as an external test set. All acquired images were subjected to preprocessing, including cropping and resizing. Three data augmentation approaches were employed, including baseline, Mixup and Gaussian blurring, and model hyperparameters included two optimization algorithms of adaptive moment estimation (Adam) and stochastic gradient descent (SGD), two loss functions of focal loss and cross entropy loss, and two learning rate decay strategies of cosine annealing and multi-step. The intelligent recognition models of <i>O. hupensis robertsoni</i> and <i>Tricula</i> snails were constructed based on the EfficientNet-B4 model, and 7 training strategy groups were generated by combinations of different data augmentation approaches and hyperparameters. The performance of intelligent recognition models was tested with external test sets, and evaluated with accuracy, precision, recall, F1 score, loss, Youden's index, and the area under the receiver operating characteristic curve (AUC) under different training strategies.</p><p><strong>Results: </strong>The variation of loss values was comparable among intelligent recognition models with different data augmentation approaches. The Group 4 model constructed with Mixup and Gaussian blurring data augmentation approaches showed the optimal performance, with an accuracy of 90.38%, precision of 90.07%, F1 score of 89.44%, Youden's index of 0.81 and AUC of 0.961 in the external test set. The accuracy of models using the SGD optimizer reduced by 29.16% as compared to those using the Adam optimizer (χ<sup>2</sup> = 81.325, <i>P</i> < 0.001), and the accuracy of models using the cross entropy loss function reduced by 0.80% as compared to the Group 4 model (χ<sup>2</sup> = 3.147, <i>P</i> > 0.05), while the accuracy of models using the multi-step learning rate decay strategy increased by 0.65% as compared to the Group 4 model (χ<sup>2</sup> = 0.208, <i>P</i> > 0.05). In addition, the model with the baseline + Mixup + Gaussianblurring data augmentation approach and hyperparameters of Adam optimizer, focal loss function and multi-step learning rate decay strategy showed the highest performance, with an accuracy of 91.03%, precision of 91.97%, recall of 88.11%, F1 score of 90.00%, Youden's index of 0.82 and AUC values of 0.969 in external test set, respectively.</p><p><strong>Conclusions: </strong>The intelligent recognition model of <i>O. hupensis robertsoni</i> based on EfficientNet-B4 model is accurate for identification of <i>O. hupensis robertsoni</i> and <i>Tricula</i> snails in Yunnan Province.</p>","PeriodicalId":38874,"journal":{"name":"中国血吸虫病防治杂志","volume":"36 6","pages":"555-561"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"[Construction of a visual intelligent identification model for <i>Oncomelania hupensis robertsoni</i> in Yunnan Province based on the EfficientNet-B4 model].\",\"authors\":\"S Bai, J Zhou, Y Dong, J Zhang, L Shi, K Yang\",\"doi\":\"10.16250/j.32.1915.2024194\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>To construct a visual intelligent recognition model for <i>Oncomelania hupensis robertsoni</i> in Yunnan Province based on the EfficientNet-B4 model, and to evaluate the impact of data augmentation methods and model hyperparameters on the recognition of <i>O. hupensis robertsoni</i>.</p><p><strong>Methods: </strong>A total of 400 <i>O. hupensis robertsoni</i> and 400 <i>Tricula</i> snails were collected from Yongsheng County, Yunnan Province in June 2024, and snail images were captured following identification and classification of 300 <i>O. hupensis robertsoni</i> and 300 <i>Tricula</i> snails. A total of 925 <i>O. hupensis robertsoni</i> images and 1 062 <i>Tricula</i> snail images were collected as a dataset and divided into a training set and a validation set at a ratio of 8:2, while 352 images captured from the remaining 100 <i>O. hupensis robertsoni</i> and 354 images from the remaining 100 <i>Tricula</i> snails served as an external test set. All acquired images were subjected to preprocessing, including cropping and resizing. Three data augmentation approaches were employed, including baseline, Mixup and Gaussian blurring, and model hyperparameters included two optimization algorithms of adaptive moment estimation (Adam) and stochastic gradient descent (SGD), two loss functions of focal loss and cross entropy loss, and two learning rate decay strategies of cosine annealing and multi-step. The intelligent recognition models of <i>O. hupensis robertsoni</i> and <i>Tricula</i> snails were constructed based on the EfficientNet-B4 model, and 7 training strategy groups were generated by combinations of different data augmentation approaches and hyperparameters. The performance of intelligent recognition models was tested with external test sets, and evaluated with accuracy, precision, recall, F1 score, loss, Youden's index, and the area under the receiver operating characteristic curve (AUC) under different training strategies.</p><p><strong>Results: </strong>The variation of loss values was comparable among intelligent recognition models with different data augmentation approaches. The Group 4 model constructed with Mixup and Gaussian blurring data augmentation approaches showed the optimal performance, with an accuracy of 90.38%, precision of 90.07%, F1 score of 89.44%, Youden's index of 0.81 and AUC of 0.961 in the external test set. The accuracy of models using the SGD optimizer reduced by 29.16% as compared to those using the Adam optimizer (χ<sup>2</sup> = 81.325, <i>P</i> < 0.001), and the accuracy of models using the cross entropy loss function reduced by 0.80% as compared to the Group 4 model (χ<sup>2</sup> = 3.147, <i>P</i> > 0.05), while the accuracy of models using the multi-step learning rate decay strategy increased by 0.65% as compared to the Group 4 model (χ<sup>2</sup> = 0.208, <i>P</i> > 0.05). In addition, the model with the baseline + Mixup + Gaussianblurring data augmentation approach and hyperparameters of Adam optimizer, focal loss function and multi-step learning rate decay strategy showed the highest performance, with an accuracy of 91.03%, precision of 91.97%, recall of 88.11%, F1 score of 90.00%, Youden's index of 0.82 and AUC values of 0.969 in external test set, respectively.</p><p><strong>Conclusions: </strong>The intelligent recognition model of <i>O. hupensis robertsoni</i> based on EfficientNet-B4 model is accurate for identification of <i>O. hupensis robertsoni</i> and <i>Tricula</i> snails in Yunnan Province.</p>\",\"PeriodicalId\":38874,\"journal\":{\"name\":\"中国血吸虫病防治杂志\",\"volume\":\"36 6\",\"pages\":\"555-561\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"中国血吸虫病防治杂志\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.16250/j.32.1915.2024194\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"中国血吸虫病防治杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.16250/j.32.1915.2024194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
摘要
目的:构建基于EfficientNet-B4模型的云南钉螺视觉智能识别模型,并评价数据增强方法和模型超参数对钉螺识别的影响。方法:于2024年6月在云南省永胜县采集罗氏钉螺400只、三角螺400只,对其中的罗氏钉螺300只、三角螺300只进行鉴定分类,采集钉螺图像。选取925张罗氏湖北钉螺图像和1 062张三角螺图像作为数据集,按8:2的比例划分为训练集和验证集,剩余100张罗氏湖北钉螺图像中的352张和剩余100张三角螺图像中的354张作为外部测试集。所有获得的图像进行预处理,包括裁剪和调整大小。采用基线、Mixup和高斯模糊三种数据增强方法,模型超参数包括自适应矩估计(Adam)和随机梯度下降(SGD)两种优化算法,焦点损失和交叉熵损失两种损失函数,余弦退火和多步学习率衰减策略。基于EfficientNet-B4模型构建了robertensis robertsoni和Tricula蜗牛的智能识别模型,并结合不同的数据增强方法和超参数生成了7个训练策略组。采用外部测试集对智能识别模型的性能进行测试,并对不同训练策略下智能识别模型的准确率、精密度、召回率、F1分数、损失、约登指数和接收者工作特征曲线下面积(AUC)进行评价。结果:不同数据增强方法的智能识别模型的损失值变化具有可比性。采用Mixup和高斯模糊数据增强方法构建的第4组模型表现出最优的性能,外部测试集的准确率为90.38%,精密度为90.07%,F1得分为89.44%,约登指数为0.81,AUC为0.961。使用SGD优化器的模型的准确率比使用Adam优化器的模型降低了29.16% (χ2 = 81.325, P < 0.001),使用交叉熵损失函数的模型的准确率比第4组模型降低了0.80% (χ2 = 3.147, P > 0.05),而使用多步学习率衰减策略的模型的准确率比第4组模型提高了0.65% (χ2 = 0.208, P > 0.05)。此外,采用baseline + Mixup + Gaussianblurring数据增强方法和Adam优化器、焦点损失函数和多步学习率衰减策略的超参数模型表现出最高的性能,准确率为91.03%,精密度为91.97%,召回率为88.11%,F1分数为90.00%,外部测试集的Youden指数为0.82,AUC值为0.969。结论:基于EfficientNet-B4模型的罗氏钉螺智能识别模型可准确识别云南省罗氏钉螺和三尾螺。
[Construction of a visual intelligent identification model for Oncomelania hupensis robertsoni in Yunnan Province based on the EfficientNet-B4 model].
Objective: To construct a visual intelligent recognition model for Oncomelania hupensis robertsoni in Yunnan Province based on the EfficientNet-B4 model, and to evaluate the impact of data augmentation methods and model hyperparameters on the recognition of O. hupensis robertsoni.
Methods: A total of 400 O. hupensis robertsoni and 400 Tricula snails were collected from Yongsheng County, Yunnan Province in June 2024, and snail images were captured following identification and classification of 300 O. hupensis robertsoni and 300 Tricula snails. A total of 925 O. hupensis robertsoni images and 1 062 Tricula snail images were collected as a dataset and divided into a training set and a validation set at a ratio of 8:2, while 352 images captured from the remaining 100 O. hupensis robertsoni and 354 images from the remaining 100 Tricula snails served as an external test set. All acquired images were subjected to preprocessing, including cropping and resizing. Three data augmentation approaches were employed, including baseline, Mixup and Gaussian blurring, and model hyperparameters included two optimization algorithms of adaptive moment estimation (Adam) and stochastic gradient descent (SGD), two loss functions of focal loss and cross entropy loss, and two learning rate decay strategies of cosine annealing and multi-step. The intelligent recognition models of O. hupensis robertsoni and Tricula snails were constructed based on the EfficientNet-B4 model, and 7 training strategy groups were generated by combinations of different data augmentation approaches and hyperparameters. The performance of intelligent recognition models was tested with external test sets, and evaluated with accuracy, precision, recall, F1 score, loss, Youden's index, and the area under the receiver operating characteristic curve (AUC) under different training strategies.
Results: The variation of loss values was comparable among intelligent recognition models with different data augmentation approaches. The Group 4 model constructed with Mixup and Gaussian blurring data augmentation approaches showed the optimal performance, with an accuracy of 90.38%, precision of 90.07%, F1 score of 89.44%, Youden's index of 0.81 and AUC of 0.961 in the external test set. The accuracy of models using the SGD optimizer reduced by 29.16% as compared to those using the Adam optimizer (χ2 = 81.325, P < 0.001), and the accuracy of models using the cross entropy loss function reduced by 0.80% as compared to the Group 4 model (χ2 = 3.147, P > 0.05), while the accuracy of models using the multi-step learning rate decay strategy increased by 0.65% as compared to the Group 4 model (χ2 = 0.208, P > 0.05). In addition, the model with the baseline + Mixup + Gaussianblurring data augmentation approach and hyperparameters of Adam optimizer, focal loss function and multi-step learning rate decay strategy showed the highest performance, with an accuracy of 91.03%, precision of 91.97%, recall of 88.11%, F1 score of 90.00%, Youden's index of 0.82 and AUC values of 0.969 in external test set, respectively.
Conclusions: The intelligent recognition model of O. hupensis robertsoni based on EfficientNet-B4 model is accurate for identification of O. hupensis robertsoni and Tricula snails in Yunnan Province.
期刊介绍:
Chinese Journal of Schistosomiasis Control (ISSN: 1005-6661, CN: 32-1374/R), founded in 1989, is a technical and scientific journal under the supervision of Jiangsu Provincial Health Commission and organised by Jiangsu Institute of Schistosomiasis Control. It is a scientific and technical journal under the supervision of Jiangsu Provincial Health Commission and sponsored by Jiangsu Institute of Schistosomiasis Prevention and Control. The journal carries out the policy of prevention-oriented, control-oriented, nationwide and grassroots, adheres to the tenet of scientific research service for the prevention and treatment of schistosomiasis and other parasitic diseases, and mainly publishes academic papers reflecting the latest achievements and dynamics of prevention and treatment of schistosomiasis and other parasitic diseases, scientific research and management, etc. The main columns are Guest Contributions, Experts‘ Commentary, Experts’ Perspectives, Experts' Forums, Theses, Prevention and Treatment Research, Experimental Research, The main columns include Guest Contributions, Expert Commentaries, Expert Perspectives, Expert Forums, Treatises, Prevention and Control Studies, Experimental Studies, Clinical Studies, Prevention and Control Experiences, Prevention and Control Management, Reviews, Case Reports, and Information, etc. The journal is a useful reference material for the professional and technical personnel of schistosomiasis and parasitic disease prevention and control research, management workers, and teachers and students of medical schools.
The journal is now included in important domestic databases, such as Chinese Core List (8th edition), China Science Citation Database (Core Edition), China Science and Technology Core Journals (Statistical Source Journals), and is also included in MEDLINE/PubMed, Scopus, EBSCO, Chemical Abstract, Embase, Zoological Record, JSTChina, Ulrichsweb, Western Pacific Region Index Medicus, CABI and other international authoritative databases.