Applying machine learning with MobileNetV2 model for rapid screening of vaginal discharge samples in vaginitis diagnosis.

IF 3.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Thi Bang-Suong Nguyen, Hoang-Bac Nguyen, Thi Xuan-Thao Le, Thi Hong-Chau Bui, Le Song-Toan Nguyen, Thao-Huong Nguyen, Truong Cong-Minh Nguyen
{"title":"Applying machine learning with MobileNetV2 model for rapid screening of vaginal discharge samples in vaginitis diagnosis.","authors":"Thi Bang-Suong Nguyen, Hoang-Bac Nguyen, Thi Xuan-Thao Le, Thi Hong-Chau Bui, Le Song-Toan Nguyen, Thao-Huong Nguyen, Truong Cong-Minh Nguyen","doi":"10.1038/s41598-025-04626-9","DOIUrl":null,"url":null,"abstract":"<p><p>Vaginitis is a prevalent gynecological condition that impacts women's quality of life, with most women likely to experience it at least once. Traditional diagnosis involves manually observing vaginal discharge samples under a microscope. This process relies heavily on the technician's expertise and is vulnerable to subjective biases. The study aimed to improve diagnostic accuracy by applying machine learning, specifically the MobileNetV2 model, to automate the classification of vaginal discharge samples. This model supports doctors in identifying causative agents of vaginitis, including Gardnerella vaginalis, fungi, and other pathogens like bacteria or Trichomonas vaginalis. A dataset of 3,164 images from 1,582 vaginal discharge samples of women aged 18 and over was analyzed. Images were taken under a 40x optical microscope with a resolution of 800 × 800 pixels and classified into three groups: Group B (mixed bacteria or Trichomonas vaginalis), Group C (Gardnerella vaginalis, identified by clue cells), and Group F (fungi, e.g., Candida albicans, which appear as hyphae or yeast cells in samples). The model was trained using 80% of data for training, 10% for validation, and 10% for testing. Performance was evaluated using two statistical metrics: the F1 score (a measure of accuracy balancing precision and recall) and the AUC-PR (Area Under the Curve of the Precision-Recall curve, a measure of model reliability for imbalanced datasets). The MobileNetV2 model performed well across all datasets, achieving an F1 score > 0.75 and an AUC-PR > 0.80. It demonstrated the best performance in identifying Gardnerella vaginalis (Group C), with both metrics exceeding 0.90. In conclusion, this study highlights MobileNetV2's potential as a rapid screening tool for vaginitis, particularly in identifying Gardnerella vaginalis (F1 score and AUC-PR > 0.90). While challenges have remained in classifying co-infections (e.g., Groups B vs. F), the model's stability across datasets underscores its practical utility. Integrating AI into vaginitis diagnosis could enhance efficiency, reduce human error, and improve early detection, ultimately advancing patient care.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"19171"},"PeriodicalIF":3.9000,"publicationDate":"2025-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12126550/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-04626-9","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Vaginitis is a prevalent gynecological condition that impacts women's quality of life, with most women likely to experience it at least once. Traditional diagnosis involves manually observing vaginal discharge samples under a microscope. This process relies heavily on the technician's expertise and is vulnerable to subjective biases. The study aimed to improve diagnostic accuracy by applying machine learning, specifically the MobileNetV2 model, to automate the classification of vaginal discharge samples. This model supports doctors in identifying causative agents of vaginitis, including Gardnerella vaginalis, fungi, and other pathogens like bacteria or Trichomonas vaginalis. A dataset of 3,164 images from 1,582 vaginal discharge samples of women aged 18 and over was analyzed. Images were taken under a 40x optical microscope with a resolution of 800 × 800 pixels and classified into three groups: Group B (mixed bacteria or Trichomonas vaginalis), Group C (Gardnerella vaginalis, identified by clue cells), and Group F (fungi, e.g., Candida albicans, which appear as hyphae or yeast cells in samples). The model was trained using 80% of data for training, 10% for validation, and 10% for testing. Performance was evaluated using two statistical metrics: the F1 score (a measure of accuracy balancing precision and recall) and the AUC-PR (Area Under the Curve of the Precision-Recall curve, a measure of model reliability for imbalanced datasets). The MobileNetV2 model performed well across all datasets, achieving an F1 score > 0.75 and an AUC-PR > 0.80. It demonstrated the best performance in identifying Gardnerella vaginalis (Group C), with both metrics exceeding 0.90. In conclusion, this study highlights MobileNetV2's potential as a rapid screening tool for vaginitis, particularly in identifying Gardnerella vaginalis (F1 score and AUC-PR > 0.90). While challenges have remained in classifying co-infections (e.g., Groups B vs. F), the model's stability across datasets underscores its practical utility. Integrating AI into vaginitis diagnosis could enhance efficiency, reduce human error, and improve early detection, ultimately advancing patient care.

Abstract Image

Abstract Image

Abstract Image

应用机器学习与MobileNetV2模型快速筛选阴道分泌物样本在阴道炎诊断中的应用。
阴道炎是一种影响女性生活质量的普遍妇科疾病,大多数女性可能至少经历过一次。传统的诊断包括在显微镜下手工观察阴道分泌物样本。这个过程严重依赖于技术人员的专业知识,容易受到主观偏见的影响。该研究旨在通过应用机器学习(特别是MobileNetV2模型)来自动分类阴道分泌物样本,从而提高诊断准确性。该模型支持医生识别阴道炎的病原体,包括阴道加德纳菌、真菌和其他病原体,如细菌或阴道毛滴虫。研究人员分析了来自1582名18岁及以上女性阴道分泌物样本的3164张图像。在分辨率为800 × 800像素的40倍光学显微镜下拍摄图像,并将图像分为三组:B组(混合细菌或阴道毛滴虫),C组(阴道加德纳菌,通过线索细胞识别)和F组(真菌,如白色念珠菌,以菌丝或酵母细胞的形式出现在样品中)。该模型使用80%的数据进行训练,10%用于验证,10%用于测试。使用两个统计指标评估性能:F1分数(准确度平衡精度和召回率的度量)和AUC-PR(精确度-召回率曲线下面积,衡量不平衡数据集的模型可靠性)。MobileNetV2模型在所有数据集上表现良好,F1得分为> 0.75,AUC-PR得分为> 0.80。该方法在识别阴道加德纳菌(C组)方面表现最佳,两项指标均超过0.90。总之,本研究强调了MobileNetV2作为阴道炎快速筛查工具的潜力,特别是在识别阴道加德纳菌(F1评分和AUC-PR > 0.90)方面。虽然在对合并感染进行分类(例如,B组与F组)方面仍然存在挑战,但该模型在数据集上的稳定性强调了其实用性。将人工智能集成到阴道炎诊断中可以提高效率,减少人为错误,改善早期发现,最终提高患者护理水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Scientific Reports
Scientific Reports Natural Science Disciplines-
CiteScore
7.50
自引率
4.30%
发文量
19567
审稿时长
3.9 months
期刊介绍: We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信