Healthcare analytics (New York, N.Y.)最新文献

筛选
英文 中文
A triplanar ensemble model for brain tumor segmentation with volumetric multiparametric magnetic resonance images 利用容积多参数磁共振图像进行脑肿瘤分割的三平面集合模型
Healthcare analytics (New York, N.Y.) Pub Date : 2024-02-04 DOI: 10.1016/j.health.2024.100307
Snehal Rajput , Rupal Kapdi , Mohendra Roy , Mehul S. Raval
{"title":"A triplanar ensemble model for brain tumor segmentation with volumetric multiparametric magnetic resonance images","authors":"Snehal Rajput ,&nbsp;Rupal Kapdi ,&nbsp;Mohendra Roy ,&nbsp;Mehul S. Raval","doi":"10.1016/j.health.2024.100307","DOIUrl":"https://doi.org/10.1016/j.health.2024.100307","url":null,"abstract":"<div><p>Automated segmentation methods can produce faster segmentation of tumors in medical images, aiding medical professionals in diagnosis and treatment plans. A 3D U-Net method excels in this task but has high computational costs due to large model parameters, which limits their application under resource constraints. This study targets an optimized triplanar (2.5D) model ensemble to generate accurate segmentation with fewer parameters. The proposed triplanar model uses spatial and channel attention mechanisms and information from multiple orthogonal planar views to predict segmentation labels. In particular, we studied the optimum filter size to improve the accuracy without increasing the network complexity. The model generated output is further post-processed to fine-tune the segmentation results. The Dice similarity coefficients (Dice-score) of the Brain Tumor Segmentation (BraTS) 2020 training set for enhancing tumor (ET), whole tumor (WT), and tumor core (TC) are 0.736, 0.896, and 0.841, whereas, for the validation set, they are 0.713, 0.873, and 0.778, respectively. The proposed base model has only <span><math><mrow><mn>10</mn><mo>.</mo><mn>25</mn><mspace></mspace><mi>M</mi></mrow></math></span> parameters, three times less than BraTS 2020’s best-performing model (ET 0.798, WT 0.912, TC 0.857) on the validation set. The proposed ensemble model has <span><math><mrow><mn>93</mn><mo>.</mo><mn>5</mn><mspace></mspace><mi>M</mi></mrow></math></span> parameters, 1.6 times less than the top-ranked model and two times less than the third-ranked model (ET 0.793, WT 0.911, TC 0.853 on validation set) of BraTS2020 challenge.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100307"},"PeriodicalIF":0.0,"publicationDate":"2024-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000091/pdfft?md5=d29fc0533e483abd517c7cab8004bdcb&pid=1-s2.0-S2772442524000091-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139710235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using process mining algorithms for process improvement in healthcare 使用流程挖掘算法改进医疗保健流程
Healthcare analytics (New York, N.Y.) Pub Date : 2024-02-01 DOI: 10.1016/j.health.2024.100305
Fazla Rabbi , Debapriya Banik , Niamat Ullah Ibne Hossain , Alexandr Sokolov
{"title":"Using process mining algorithms for process improvement in healthcare","authors":"Fazla Rabbi ,&nbsp;Debapriya Banik ,&nbsp;Niamat Ullah Ibne Hossain ,&nbsp;Alexandr Sokolov","doi":"10.1016/j.health.2024.100305","DOIUrl":"https://doi.org/10.1016/j.health.2024.100305","url":null,"abstract":"<div><p>Healthcare professionals must provide their patients with the best possible service and be well-informed and expert at carrying out complex surgical procedures to fulfill this responsibility. The aim of the medical treatments is fewer complications, shorter hospital stays, and a better patient experience. Through continuous learning and training, medical practitioners trained in up-to-date and state-of-the-art surgical techniques and technologies make productive and effective healthcare systems possible. Healthcare systems often report on problems with surgical processes, skipped procedures, unusual activities during operations, and lengthy transition times. This event log data allows implementing process mining methods to deliver medical professionals with simple and understandable findings using Petri nets for process analysis and enhancement. This study identifies the parallels and discrepancies between the pre-and post-stages and their respective frequency on each typical Central Venous Catheter (CVC) installation activity. The Process Mining for Python (PM4Py) frameworks used four major mining algorithms to view the event log (i.e., alpha miner, directly-follows graph (DFG), heuristic miner, and inductive miner). This study's findings indicate that medical residents are more susceptible to error during pre-operative procedures.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100305"},"PeriodicalIF":0.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000078/pdfft?md5=d25e53aa28e307b96560fec95871fd89&pid=1-s2.0-S2772442524000078-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139674635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A mathematical tumor growth model for exploring saturated response of M2 macrophages 用于探索 M2 巨噬细胞饱和反应的肿瘤生长数学模型
Healthcare analytics (New York, N.Y.) Pub Date : 2024-01-31 DOI: 10.1016/j.health.2024.100306
Kaushik Dehingia , Yamen Alharbi , Vikas Pandey
{"title":"A mathematical tumor growth model for exploring saturated response of M2 macrophages","authors":"Kaushik Dehingia ,&nbsp;Yamen Alharbi ,&nbsp;Vikas Pandey","doi":"10.1016/j.health.2024.100306","DOIUrl":"https://doi.org/10.1016/j.health.2024.100306","url":null,"abstract":"<div><p>This study addresses a tumor–macrophage interaction model to examine the role of the saturated response of M2 macrophages. We find the equilibrium point of the model and analyze local stability at each equilibrium. We show that tumor-free equilibrium is always stable, whereas, under certain conditions, the tumor-dominant and interior equilibrium are asymptotically stable. Moreover, stable and unstable limit cycles and period-doubling bifurcation have been observed at the interior equilibrium point. A remarkable result has been observed: in the presence of a saturated response of M2 macrophages, with a relatively higher activation rate of M2 macrophages due to tumor cells, the disease spreads more quickly in the body. Hence, M1 macrophages cannot stabilize the system, and aperiodic oscillations are observed. Furthermore, we show that a better immune response can reverse that system’s unstable nature. Numerical simulations verify the analytical results.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100306"},"PeriodicalIF":0.0,"publicationDate":"2024-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S277244252400008X/pdfft?md5=56bbf5f1b26299586ec2ca78c05789d3&pid=1-s2.0-S277244252400008X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139653032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A systematic review of artificial intelligence techniques for oral cancer detection 口腔癌检测人工智能技术系统综述
Healthcare analytics (New York, N.Y.) Pub Date : 2024-01-22 DOI: 10.1016/j.health.2024.100304
Kavyashree C. , H.S. Vimala , Shreyas J.
{"title":"A systematic review of artificial intelligence techniques for oral cancer detection","authors":"Kavyashree C. ,&nbsp;H.S. Vimala ,&nbsp;Shreyas J.","doi":"10.1016/j.health.2024.100304","DOIUrl":"10.1016/j.health.2024.100304","url":null,"abstract":"<div><p>Oral cancer is a form of cancer that develops in the tissue of an oral cavity. Detection at an early stage is necessary to prevent the mortality rate in cancer patients. Artificial intelligence (AI) techniques play a significant role in assisting with diagnosing oral cancer. The AI techniques provide better detection accuracy and help automate oral cancer detection. The study shows that AI has a wide range of algorithms and provides outcomes in the most precise manner possible. We provide an overview of different input types and apply an appropriate algorithm to detect oral cancer. We aim to provide an overview of various AI techniques that can be used to automate oral cancer detection and to analyze these techniques to improve the efficiency and accuracy of oral cancer screening. We provide a summary of various methods available for oral cancer detection. We cover different input image formats, their processing, and the need for segmentation and feature extraction. We further include a list of other conventional strategies. We focus on various AI techniques for detecting oral cancer, including deep learning, machine learning, fuzzy computing, data mining, and genetic algorithms, and evaluates their benefits and drawbacks. The larger part of the articles focused on deep learning (37%) methods, followed by machine learning (32%), genetic algorithms (12%), data mining techniques (10%), and fuzzy computing (9%) for oral cancer detection.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100304"},"PeriodicalIF":0.0,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000066/pdfft?md5=4271ee0a4378ec8144ed336855cbfa61&pid=1-s2.0-S2772442524000066-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139636955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An advanced deep neural network for fundus image analysis and enhancing diabetic retinopathy detection 用于眼底图像分析和增强糖尿病视网膜病变检测的高级深度神经网络
Healthcare analytics (New York, N.Y.) Pub Date : 2024-01-20 DOI: 10.1016/j.health.2024.100303
F M Javed Mehedi Shamrat , Rashiduzzaman Shakil , Sharmin , Nazmul Hoque ovy , Bonna Akter , Md Zunayed Ahmed , Kawsar Ahmed , Francis M. Bui , Mohammad Ali Moni
{"title":"An advanced deep neural network for fundus image analysis and enhancing diabetic retinopathy detection","authors":"F M Javed Mehedi Shamrat ,&nbsp;Rashiduzzaman Shakil ,&nbsp;Sharmin ,&nbsp;Nazmul Hoque ovy ,&nbsp;Bonna Akter ,&nbsp;Md Zunayed Ahmed ,&nbsp;Kawsar Ahmed ,&nbsp;Francis M. Bui ,&nbsp;Mohammad Ali Moni","doi":"10.1016/j.health.2024.100303","DOIUrl":"https://doi.org/10.1016/j.health.2024.100303","url":null,"abstract":"<div><p>Diabetic retinopathy (DR) involves retina damage due to diabetes, often leading to blindness. It is diagnosed via color fundus injections, but the manual analysis is cumbersome and error-prone. While computer vision techniques can predict DR stages, they are computationally intensive and struggle with complex data extraction. In this research, our prime objective was to automate the process of DR classification into its various stages using convolutional neural network (CNN) models. We employed the performance of fifteen pre-trained models with our novel proposed diabetic retinopathy network (DRNet13) model. We aimed to discern the most efficient model for accurate diabetic retinopathy (DR) staging based on fundus images from five DR classes. We preprocessed the image using a median filter for noise reduction and Gamma correction for image enhancement. We expanded our dataset from 3662 to 7500 images to create a more generalized training model through various augmentation techniques. We also evaluated multiple evaluation metrics, including accuracy, precision, F1-score, Sensitivity, Specificity, Area under the curve (AUC), Mean Squared Error (MSE), False Positive Rate (FPR), False Negative Rate (FNR), in addition to confusion matrices for an in-depth comparison of the performance of these models. Feature maps were employed to illuminate decision making areas in the DRNet13 model, which achieved a 97 % accuracy rate for DR detection, surpassing other CNN architectures in speed and efficiency. Despite a few misclassifications, the model's capability to identify critical features demonstrates its potential as an impactful diagnostic tool for timely and accurate identification of diabetic retinopathy.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100303"},"PeriodicalIF":0.0,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000054/pdfft?md5=d8486a0b7c2a66d37a79ca700f9d36fd&pid=1-s2.0-S2772442524000054-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139549042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel fractional-order stochastic epidemic model to analyze the role of media awareness in the spread of conjunctivitis 分析媒体意识在结膜炎传播中的作用的新型分数阶随机流行病模型
Healthcare analytics (New York, N.Y.) Pub Date : 2024-01-17 DOI: 10.1016/j.health.2024.100302
Shiv Mangal , Ebenezer Bonyah , Vijay Shankar Sharma , Y. Yuan
{"title":"A novel fractional-order stochastic epidemic model to analyze the role of media awareness in the spread of conjunctivitis","authors":"Shiv Mangal ,&nbsp;Ebenezer Bonyah ,&nbsp;Vijay Shankar Sharma ,&nbsp;Y. Yuan","doi":"10.1016/j.health.2024.100302","DOIUrl":"https://doi.org/10.1016/j.health.2024.100302","url":null,"abstract":"<div><p>This study introduces a novel fractional-order stochastic epidemic model to analyze the spread of conjunctivitis, a prevalent ocular infection, while accounting for the influence of media awareness on disease transmission. The model incorporates fractional derivatives to capture memory effects and non-local interactions inherent in epidemic processes, allowing for a more accurate representation of disease dynamics. The stability analysis of equilibrium points is carried out based on the basic reproduction number <span><math><msub><mrow><mi>ℛ</mi></mrow><mrow><mn>0</mn></mrow></msub></math></span> and fractional-order <span><math><mi>α</mi></math></span>. Further, the Hopf bifurcation phenomenon is discussed in this paper. Stochasticity accounts for the randomness in transmission events. The findings of this study provide insights into the complex interrelationship between disease dynamics and media influence, shedding light on the role of public awareness in mitigating or exacerbating conjunctivitis outbreaks. The implications of this work extend to public health policy formulation, highlighting the importance of targeted communication strategies in controlling and preventing the spread of conjunctivitis and similar infectious diseases.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100302"},"PeriodicalIF":0.0,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000042/pdfft?md5=38829598f690a40a705f819fef29eef9&pid=1-s2.0-S2772442524000042-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139487305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel machine learning approach for diagnosing diabetes with a self-explainable interface 利用可自我解释的界面诊断糖尿病的新型机器学习方法
Healthcare analytics (New York, N.Y.) Pub Date : 2024-01-17 DOI: 10.1016/j.health.2024.100301
Gangani Dharmarathne , Thilini N. Jayasinghe , Madhusha Bogahawaththa , D.P.P. Meddage , Upaka Rathnayake
{"title":"A novel machine learning approach for diagnosing diabetes with a self-explainable interface","authors":"Gangani Dharmarathne ,&nbsp;Thilini N. Jayasinghe ,&nbsp;Madhusha Bogahawaththa ,&nbsp;D.P.P. Meddage ,&nbsp;Upaka Rathnayake","doi":"10.1016/j.health.2024.100301","DOIUrl":"https://doi.org/10.1016/j.health.2024.100301","url":null,"abstract":"<div><p>This study introduces the first-ever self-explanatory interface for diagnosing diabetes patients using machine learning. We propose four classification models (Decision Tree (DT), K-nearest Neighbor (KNN), Support Vector Classification (SVC), and Extreme Gradient Boosting (XGB)) based on the publicly available diabetes dataset. To elucidate the inner workings of these models, we employed the machine learning interpretation method known as Shapley Additive Explanations (SHAP). All the models exhibited commendable accuracy in diagnosing patients with diabetes, with the XGB model showing a slight edge over the others. Utilising SHAP, we delved into the XGB model, providing in-depth insights into the reasoning behind its predictions at a granular level. Subsequently, we integrated the XGB model and SHAP’s local explanations into an interface to predict diabetes in patients. This interface serves a critical role as it diagnoses patients and offers transparent explanations for the decisions made, providing users with a heightened awareness of their current health conditions. Given the high-stakes nature of the medical field, this developed interface can be further enhanced by including more extensive clinical data, ultimately aiding medical professionals in their decision-making processes.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100301"},"PeriodicalIF":0.0,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000030/pdfft?md5=494bc571d60d347c01d68d0c317c4288&pid=1-s2.0-S2772442524000030-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139487303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A mathematical model for investigating the effect of media awareness programs on the spread of COVID-19 with optimal control 通过优化控制研究媒体宣传计划对 COVID-19 传播影响的数学模型
Healthcare analytics (New York, N.Y.) Pub Date : 2024-01-15 DOI: 10.1016/j.health.2024.100300
Naba Kumar Goswami , Samson Olaniyi , Sulaimon F. Abimbade , Furaha M. Chuma
{"title":"A mathematical model for investigating the effect of media awareness programs on the spread of COVID-19 with optimal control","authors":"Naba Kumar Goswami ,&nbsp;Samson Olaniyi ,&nbsp;Sulaimon F. Abimbade ,&nbsp;Furaha M. Chuma","doi":"10.1016/j.health.2024.100300","DOIUrl":"https://doi.org/10.1016/j.health.2024.100300","url":null,"abstract":"<div><p>The coronavirus pandemic is a global health crisis creating an unprecedented socio-economic catastrophe. This pandemic is the biggest challenge the world has faced since World War II and is the main turning point in the history of humanity. Media coverage can change citizens’ attention to emerging infectious diseases and consequently change individual behaviors and attitudes. This study proposes and analyzes a seven-compartmental mathematical model to investigate the impact of media coverage on the spread and control of COVID-19. The threshold condition Ro for the initial transmission of infection is achieved by the next-generation approach. Stability analysis of the proposed model on disease-free and endemic equilibria is investigated in terms of basic reproduction numbers locally and globally. The sensitivity analysis of the reproduction number is visualized to distinguish the most sensitive parameters that can be regulated to control the transmission dynamics of coronavirus disease. Moreover, the theoretical results of the deterministic model are compared using numerical simulations. The outcomes of the analysis suggest that the disease prevalence can be terminated by suitable management of quarantine/medical care. We further extend the model to the optimal control framework. It is analyzed using Pontryagin’s maximum principle to characterize preventive control, testing facility, and treatment measures for managing COVID-19 transmission.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100300"},"PeriodicalIF":0.0,"publicationDate":"2024-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000029/pdfft?md5=181a72d948017369ae65a88b5750c988&pid=1-s2.0-S2772442524000029-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139487304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An evaluation of machine learning approaches for early diagnosis of autism spectrum disorder 评估自闭症谱系障碍早期诊断的机器学习方法
Healthcare analytics (New York, N.Y.) Pub Date : 2024-01-04 DOI: 10.1016/j.health.2023.100293
Rownak Ara Rasul , Promy Saha , Diponkor Bala , S.M. Rakib Ul Karim , Md. Ibrahim Abdullah , Bishwajit Saha
{"title":"An evaluation of machine learning approaches for early diagnosis of autism spectrum disorder","authors":"Rownak Ara Rasul ,&nbsp;Promy Saha ,&nbsp;Diponkor Bala ,&nbsp;S.M. Rakib Ul Karim ,&nbsp;Md. Ibrahim Abdullah ,&nbsp;Bishwajit Saha","doi":"10.1016/j.health.2023.100293","DOIUrl":"https://doi.org/10.1016/j.health.2023.100293","url":null,"abstract":"<div><p>Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify crucial ASD traits, aiming to enhance and automate the diagnostic process. We study eight state-of-the-art classification models to determine their effectiveness in ASD detection. We evaluate the models using accuracy, precision, recall, specificity, F1-score, area under the curve (AUC), kappa, and log loss metrics to find the best classifier for these binary datasets. Among all the classification models, for the children dataset, the SVM and LR models achieve the highest accuracy of 100% and for the adult dataset, the LR model produces the highest accuracy of 97.14%. Our proposed ANN model provides the highest accuracy of 94.24% for the new combined dataset when hyperparameters are precisely tuned for each model. As almost all classification models achieve high accuracy which utilize true labels, we become interested in delving into five popular clustering algorithms to understand model behavior in scenarios without true labels. We calculate Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and Silhouette Coefficient (SC) metrics to select the best clustering models. Our evaluation finds that spectral clustering outperforms all other benchmarking clustering models in terms of NMI and ARI metrics while demonstrating comparability to the optimal SC achieved by k-means. The implemented code is available at <span>GitHub</span><svg><path></path></svg>.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100293"},"PeriodicalIF":0.0,"publicationDate":"2024-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442523001600/pdfft?md5=e0fd6cd67baa47c33181f21a1d4a70e4&pid=1-s2.0-S2772442523001600-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139434016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An investigation of machine learning algorithms and data augmentation techniques for diabetes diagnosis using class imbalanced BRFSS dataset 利用类不平衡 BRFSS 数据集研究用于糖尿病诊断的机器学习算法和数据增强技术
Healthcare analytics (New York, N.Y.) Pub Date : 2023-12-30 DOI: 10.1016/j.health.2023.100297
Mohammad Mihrab Chowdhury , Ragib Shahariar Ayon , Md Sakhawat Hossain
{"title":"An investigation of machine learning algorithms and data augmentation techniques for diabetes diagnosis using class imbalanced BRFSS dataset","authors":"Mohammad Mihrab Chowdhury ,&nbsp;Ragib Shahariar Ayon ,&nbsp;Md Sakhawat Hossain","doi":"10.1016/j.health.2023.100297","DOIUrl":"https://doi.org/10.1016/j.health.2023.100297","url":null,"abstract":"<div><p>Diabetes is a prevalent chronic condition that poses significant challenges to early diagnosis and identifying at-risk individuals. Machine learning plays a crucial role in diabetes detection by leveraging its ability to process large volumes of data and identify complex patterns. However, imbalanced data, where the number of diabetic cases is substantially smaller than non-diabetic cases, complicates the identification of individuals with diabetes using machine learning algorithms. This study focuses on predicting whether a person is at risk of diabetes, considering the individual’s health and socio-economic conditions while mitigating the challenges posed by imbalanced data. We employ several data augmentation techniques, such as oversampling (Synthetic Minority Over Sampling for Nominal Data, i.e.SMOTE-N), undersampling (Edited Nearest Neighbor, i.e. ENN), and hybrid sampling techniques (SMOTE-Tomek and SMOTE-ENN) on training data before applying machine learning algorithms to minimize the impact of imbalanced data. Our study sheds light on the significance of carefully utilizing data augmentation techniques without any data leakage to enhance the effectiveness of machine learning algorithms. Moreover, it offers a complete machine learning structure for healthcare practitioners, from data obtaining to machine learning prediction, enabling them to make informed decisions.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100297"},"PeriodicalIF":0.0,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442523001648/pdfft?md5=cbb15d1b9b72127ef6f0b213ad40bae0&pid=1-s2.0-S2772442523001648-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139108378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信