A comprehensive machine learning framework with particle swarm optimization for improved polycystic ovary syndrome (PCOS) diagnosis

IF 1.5 Q2 ENGINEERING, MULTIDISCIPLINARY
Ankur Kumar, Jaspreet Singh and Asim Ali Khan
{"title":"A comprehensive machine learning framework with particle swarm optimization for improved polycystic ovary syndrome (PCOS) diagnosis","authors":"Ankur Kumar, Jaspreet Singh and Asim Ali Khan","doi":"10.1088/2631-8695/ad76f9","DOIUrl":null,"url":null,"abstract":"Polycystic Ovary Syndrome (PCOS) is a hormonal disorder primarily affecting women of reproductive age, characterized by irregular menstrual cycles, elevated male hormones, and ovarian cysts. Early detection and treatment are crucial to prevent long-term complications. This research utilizes clinical data from Kaggle to develop a non-invasive PCOS diagnostic system. The authors conducted comprehensive data preprocessing, feature engineering, and exploratory data analysis (EDA). The refined dataset was incorporated into various default machine learning (ML) algorithms, including LR, LDA, GNB, SVM, XGB, DT, AB, RF, and KNN, for PCOS classification with varying train test ratios 70:30 to 80:20. To further enhance the model’s performance, the authors hybridized all the ML models with Particle Swarm Optimization (PSO). Remarkably, the proposed LR+PSO model achieved the highest accuracy at 96.30%, demonstrating exceptional proficiency with an 80:20 train-test ratio. It significantly improved sensitivity to 94.44%, indicating enhanced detection of positive cases, all while maintaining the highest specificity at 97.22% and precision at 94.44% compared to other models. These results highlight a substantial improvement in integrated models, emphasizing the potential of this novel approach to enhance PCOS diagnosis in terms of accuracy and efficiency, ultimately benefiting individuals with PCOS in their treatment journey.","PeriodicalId":11753,"journal":{"name":"Engineering Research Express","volume":"30 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Research Express","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2631-8695/ad76f9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Polycystic Ovary Syndrome (PCOS) is a hormonal disorder primarily affecting women of reproductive age, characterized by irregular menstrual cycles, elevated male hormones, and ovarian cysts. Early detection and treatment are crucial to prevent long-term complications. This research utilizes clinical data from Kaggle to develop a non-invasive PCOS diagnostic system. The authors conducted comprehensive data preprocessing, feature engineering, and exploratory data analysis (EDA). The refined dataset was incorporated into various default machine learning (ML) algorithms, including LR, LDA, GNB, SVM, XGB, DT, AB, RF, and KNN, for PCOS classification with varying train test ratios 70:30 to 80:20. To further enhance the model’s performance, the authors hybridized all the ML models with Particle Swarm Optimization (PSO). Remarkably, the proposed LR+PSO model achieved the highest accuracy at 96.30%, demonstrating exceptional proficiency with an 80:20 train-test ratio. It significantly improved sensitivity to 94.44%, indicating enhanced detection of positive cases, all while maintaining the highest specificity at 97.22% and precision at 94.44% compared to other models. These results highlight a substantial improvement in integrated models, emphasizing the potential of this novel approach to enhance PCOS diagnosis in terms of accuracy and efficiency, ultimately benefiting individuals with PCOS in their treatment journey.
利用粒子群优化的综合机器学习框架改进多囊卵巢综合征(PCOS)诊断
多囊卵巢综合症(PCOS)是一种主要影响育龄妇女的荷尔蒙失调症,其特点是月经周期不规律、雄性激素升高和卵巢囊肿。早期发现和治疗对预防长期并发症至关重要。这项研究利用来自 Kaggle 的临床数据开发了一种无创多囊卵巢综合症诊断系统。作者进行了全面的数据预处理、特征工程和探索性数据分析(EDA)。改进后的数据集被纳入各种默认的机器学习(ML)算法,包括 LR、LDA、GNB、SVM、XGB、DT、AB、RF 和 KNN,用于 PCOS 分类,训练测试比例为 70:30 到 80:20。为了进一步提高模型的性能,作者将所有 ML 模型与粒子群优化(PSO)进行了混合。值得注意的是,所提出的 LR+PSO 模型达到了 96.30% 的最高准确率,在 80:20 的训练测试比下表现出了非凡的能力。与其他模型相比,该模型的灵敏度大幅提高到 94.44%,表明阳性病例的检测能力得到了增强,同时特异度和精确度分别保持在 97.22% 和 94.44% 的最高水平。这些结果凸显了综合模型的大幅改进,强调了这种新方法在提高多囊卵巢综合症诊断的准确性和效率方面的潜力,最终使多囊卵巢综合症患者在治疗过程中受益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering Research Express
Engineering Research Express Engineering-Engineering (all)
CiteScore
2.20
自引率
5.90%
发文量
192
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信