Puyang Zhao , Xinhui Liu , Zhiyi Yue , Qianyu Zhao , Xinzhi Liu , Yuhui Deng , Jingjin Wu
{"title":"DiGAN 突破:利用基于 GAN 的创新不平衡校正技术推进糖尿病数据分析","authors":"Puyang Zhao , Xinhui Liu , Zhiyi Yue , Qianyu Zhao , Xinzhi Liu , Yuhui Deng , Jingjin Wu","doi":"10.1016/j.cmpbup.2024.100152","DOIUrl":null,"url":null,"abstract":"<div><p>In the rapidly evolving field of medical diagnostics, the challenge of imbalanced datasets, particularly in diabetes classification, calls for innovative solutions. The study introduces DiGAN, a groundbreaking approach that leverages the power of Generative Adversarial Networks (GAN) to revolutionize diabetes data analysis. Marking a significant departure from traditional methods, DiGAN applies GANs, typically seen in image processing, to the realm of diabetes data. This novel application is complemented by integrating the unsupervised Laplacian Score for sophisticated feature selection. The pioneering approach not only surpasses the limitations of existing techniques but also sets a new benchmark in classification accuracy with a 90% weighted F1-score, achieving a remarkable improvement of over 20% compared to conventional methods. Additionally, DiGAN demonstrates superior performance over popular SMOTE-based methods in handling extremely imbalanced datasets. This research, focusing on the integrated use of Laplacian Score, GAN, and Random Forest, stands at the forefront of diabetic classification, offering a uniquely effective and innovative solution to the long-standing data imbalance issue in medical diagnostics.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":"5 ","pages":"Article 100152"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666990024000193/pdfft?md5=465f3cbca9e1cb295e9d2d56ae5c71e1&pid=1-s2.0-S2666990024000193-main.pdf","citationCount":"0","resultStr":"{\"title\":\"DiGAN Breakthrough: Advancing diabetic data analysis with innovative GAN-based imbalance correction techniques\",\"authors\":\"Puyang Zhao , Xinhui Liu , Zhiyi Yue , Qianyu Zhao , Xinzhi Liu , Yuhui Deng , Jingjin Wu\",\"doi\":\"10.1016/j.cmpbup.2024.100152\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In the rapidly evolving field of medical diagnostics, the challenge of imbalanced datasets, particularly in diabetes classification, calls for innovative solutions. The study introduces DiGAN, a groundbreaking approach that leverages the power of Generative Adversarial Networks (GAN) to revolutionize diabetes data analysis. Marking a significant departure from traditional methods, DiGAN applies GANs, typically seen in image processing, to the realm of diabetes data. This novel application is complemented by integrating the unsupervised Laplacian Score for sophisticated feature selection. The pioneering approach not only surpasses the limitations of existing techniques but also sets a new benchmark in classification accuracy with a 90% weighted F1-score, achieving a remarkable improvement of over 20% compared to conventional methods. Additionally, DiGAN demonstrates superior performance over popular SMOTE-based methods in handling extremely imbalanced datasets. This research, focusing on the integrated use of Laplacian Score, GAN, and Random Forest, stands at the forefront of diabetic classification, offering a uniquely effective and innovative solution to the long-standing data imbalance issue in medical diagnostics.</p></div>\",\"PeriodicalId\":72670,\"journal\":{\"name\":\"Computer methods and programs in biomedicine update\",\"volume\":\"5 \",\"pages\":\"Article 100152\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666990024000193/pdfft?md5=465f3cbca9e1cb295e9d2d56ae5c71e1&pid=1-s2.0-S2666990024000193-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine update\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666990024000193\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine update","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666990024000193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在快速发展的医疗诊断领域,不平衡数据集的挑战,尤其是在糖尿病分类方面,需要创新的解决方案。这项研究介绍了 DiGAN,这是一种开创性的方法,利用生成对抗网络(GAN)的力量彻底改变糖尿病数据分析。DiGAN 与传统方法大相径庭,它将通常用于图像处理的 GAN 应用于糖尿病数据领域。这种新颖的应用还结合了无监督拉普拉斯分数(Laplacian Score),用于复杂的特征选择。这种开创性的方法不仅超越了现有技术的局限性,还在分类准确率方面树立了新的标杆,加权 F1 分数高达 90%,与传统方法相比显著提高了 20% 以上。此外,在处理极度不平衡的数据集时,DiGAN 的表现优于基于 SMOTE 的流行方法。这项研究的重点是拉普拉斯分数、GAN 和随机森林的综合使用,它站在了糖尿病分类的前沿,为医疗诊断中长期存在的数据不平衡问题提供了一种独特有效的创新解决方案。
DiGAN Breakthrough: Advancing diabetic data analysis with innovative GAN-based imbalance correction techniques
In the rapidly evolving field of medical diagnostics, the challenge of imbalanced datasets, particularly in diabetes classification, calls for innovative solutions. The study introduces DiGAN, a groundbreaking approach that leverages the power of Generative Adversarial Networks (GAN) to revolutionize diabetes data analysis. Marking a significant departure from traditional methods, DiGAN applies GANs, typically seen in image processing, to the realm of diabetes data. This novel application is complemented by integrating the unsupervised Laplacian Score for sophisticated feature selection. The pioneering approach not only surpasses the limitations of existing techniques but also sets a new benchmark in classification accuracy with a 90% weighted F1-score, achieving a remarkable improvement of over 20% compared to conventional methods. Additionally, DiGAN demonstrates superior performance over popular SMOTE-based methods in handling extremely imbalanced datasets. This research, focusing on the integrated use of Laplacian Score, GAN, and Random Forest, stands at the forefront of diabetic classification, offering a uniquely effective and innovative solution to the long-standing data imbalance issue in medical diagnostics.