{"title":"数据挖掘镰状细胞病患者的住院治疗和出院总结","authors":"Mohammed Gollapalli, A. Alfaleh","doi":"10.1109/ITIKD56332.2023.10099773","DOIUrl":null,"url":null,"abstract":"Sickle cell disease (SCD) is a hereditary blood disorder that affects certain parts of the world. This disease affects hemoglobin, causing red blood cells to change shape, such as sickle and crescent, making it difficult to supply oxygen to all of the human body's cells. Various genotypes of SCD have been discovered; the most common disorder is sickle cell anemia. This study is a continuation of our ongoing research on 191,406 clinical records of SCD patients who visited and got hospitalized over a 12-year period (between 2008 - 2020). This paper focused on conducting the retrospective analysis and then applying data mining classification algorithms on SCD patients' data based on hospitalization records, hospital visits, hospital admissions reasons, department patients were admitted to, the length of time patients were treated in the hospital, blood transfer section for S C D patients, and discharge reason for different types of S C D patients. Five distinct classification models with ten cross-validations were experimented using the Naive Bayes, J48, SVM, NN, and PART algorithms. Furthermore, parameter optimization was carried out to determine the optimal classification results of each algorithm. Naïve Bayes with an accuracy of 95.50%, was faster, correctly classified clinical cases, and provided detailed correlation results for each of the target features. Finally, we extracted knowledge clusters on hospital clinical services for SCD patients, which were then validated by medical doctors in order to better serve SCD patients visiting the hospital.","PeriodicalId":283631,"journal":{"name":"2023 International Conference on IT Innovation and Knowledge Discovery (ITIKD)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data Mining Hospital Treatment and Discharge Summary of Sickle Cell Disease Patients\",\"authors\":\"Mohammed Gollapalli, A. Alfaleh\",\"doi\":\"10.1109/ITIKD56332.2023.10099773\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sickle cell disease (SCD) is a hereditary blood disorder that affects certain parts of the world. This disease affects hemoglobin, causing red blood cells to change shape, such as sickle and crescent, making it difficult to supply oxygen to all of the human body's cells. Various genotypes of SCD have been discovered; the most common disorder is sickle cell anemia. This study is a continuation of our ongoing research on 191,406 clinical records of SCD patients who visited and got hospitalized over a 12-year period (between 2008 - 2020). This paper focused on conducting the retrospective analysis and then applying data mining classification algorithms on SCD patients' data based on hospitalization records, hospital visits, hospital admissions reasons, department patients were admitted to, the length of time patients were treated in the hospital, blood transfer section for S C D patients, and discharge reason for different types of S C D patients. Five distinct classification models with ten cross-validations were experimented using the Naive Bayes, J48, SVM, NN, and PART algorithms. Furthermore, parameter optimization was carried out to determine the optimal classification results of each algorithm. Naïve Bayes with an accuracy of 95.50%, was faster, correctly classified clinical cases, and provided detailed correlation results for each of the target features. Finally, we extracted knowledge clusters on hospital clinical services for SCD patients, which were then validated by medical doctors in order to better serve SCD patients visiting the hospital.\",\"PeriodicalId\":283631,\"journal\":{\"name\":\"2023 International Conference on IT Innovation and Knowledge Discovery (ITIKD)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on IT Innovation and Knowledge Discovery (ITIKD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITIKD56332.2023.10099773\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on IT Innovation and Knowledge Discovery (ITIKD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITIKD56332.2023.10099773","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
镰状细胞病(SCD)是一种影响世界某些地区的遗传性血液疾病。这种疾病影响血红蛋白,导致红细胞改变形状,如镰状和新月形,使人体所有细胞难以供应氧气。SCD的各种基因型已经被发现;最常见的疾病是镰状细胞性贫血。本研究是我们正在进行的对12年(2008 - 2020年)期间就诊和住院的191,406例SCD患者临床记录的研究的延续。本文重点对SCD患者的住院记录、就诊次数、入院原因、住院科室、住院时间、S - C - D患者的转血科室、不同类型S - C - D患者的出院原因等数据进行回顾性分析,然后应用数据挖掘分类算法对SCD患者数据进行分类。使用朴素贝叶斯、J48、支持向量机、神经网络和PART算法对5种不同的分类模型进行了10次交叉验证。进一步进行参数优化,确定各算法的最优分类结果。Naïve贝叶斯准确率为95.50%,速度更快,正确分类临床病例,并提供了每个目标特征的详细相关性结果。最后,我们提取了SCD患者的医院临床服务知识聚类,然后由医生进行验证,以便更好地为就诊的SCD患者服务。
Data Mining Hospital Treatment and Discharge Summary of Sickle Cell Disease Patients
Sickle cell disease (SCD) is a hereditary blood disorder that affects certain parts of the world. This disease affects hemoglobin, causing red blood cells to change shape, such as sickle and crescent, making it difficult to supply oxygen to all of the human body's cells. Various genotypes of SCD have been discovered; the most common disorder is sickle cell anemia. This study is a continuation of our ongoing research on 191,406 clinical records of SCD patients who visited and got hospitalized over a 12-year period (between 2008 - 2020). This paper focused on conducting the retrospective analysis and then applying data mining classification algorithms on SCD patients' data based on hospitalization records, hospital visits, hospital admissions reasons, department patients were admitted to, the length of time patients were treated in the hospital, blood transfer section for S C D patients, and discharge reason for different types of S C D patients. Five distinct classification models with ten cross-validations were experimented using the Naive Bayes, J48, SVM, NN, and PART algorithms. Furthermore, parameter optimization was carried out to determine the optimal classification results of each algorithm. Naïve Bayes with an accuracy of 95.50%, was faster, correctly classified clinical cases, and provided detailed correlation results for each of the target features. Finally, we extracted knowledge clusters on hospital clinical services for SCD patients, which were then validated by medical doctors in order to better serve SCD patients visiting the hospital.