An Algorithm to Impute Missing Values in Medical Datasets to Predict the Risk of Diseases in Patients

H. V. Bhagat, Manminder Singh
{"title":"An Algorithm to Impute Missing Values in Medical Datasets to Predict the Risk of Diseases in Patients","authors":"H. V. Bhagat, Manminder Singh","doi":"10.1109/DELCON57910.2023.10127481","DOIUrl":null,"url":null,"abstract":"The advent of Internet of Things (IoT) revolutionizes the healthcare sector by altering how medical equipment and humans interact while providing healthcare solutions. Applications of IoT in healthcare are advantageous to patients, their families, physicians, hospitals, and insurance companies. Before the IoT, patients could only speak with doctors physically, over the phone, or by text. There wasn’t a practical way for medical staff or facilities to continuously assess patients' health and provide guidance. Remote monitoring in the healthcare industry is made possible by utilizing sensor devices that help physicians to deliver superlative care to their patients. Sensor devices play a crucial role in the remote monitoring of patients. However, the failure of network or sensor devices may result in data with missing values. This incomplete data cannot be utilized to diagnose a patient or to make effective predictions of the disease in early diagnosis. To ensure the effective diagnosis and predict the risk of diseases in the patient, this paper proposed a novel ReMiss (Removing Missingness in medical datasets) algorithm to impute the missing values present in incomplete medical datasets. The proposed ReMiss algorithm is a data-partitioning based missing values imputation technique that partitions the datasets into complete and incomplete subsets to predict the missing values efficiently. The adjusted coefficient of determination, RMSEs and classification accuracy are the performance metrics used to evaluate the performance of the proposed ReMiss algorithm with the existing imputation techniques. The proposed ReMiss algorithm obtained an average classification accuracy of 93.53%.","PeriodicalId":193577,"journal":{"name":"2023 2nd Edition of IEEE Delhi Section Flagship Conference (DELCON)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd Edition of IEEE Delhi Section Flagship Conference (DELCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DELCON57910.2023.10127481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The advent of Internet of Things (IoT) revolutionizes the healthcare sector by altering how medical equipment and humans interact while providing healthcare solutions. Applications of IoT in healthcare are advantageous to patients, their families, physicians, hospitals, and insurance companies. Before the IoT, patients could only speak with doctors physically, over the phone, or by text. There wasn’t a practical way for medical staff or facilities to continuously assess patients' health and provide guidance. Remote monitoring in the healthcare industry is made possible by utilizing sensor devices that help physicians to deliver superlative care to their patients. Sensor devices play a crucial role in the remote monitoring of patients. However, the failure of network or sensor devices may result in data with missing values. This incomplete data cannot be utilized to diagnose a patient or to make effective predictions of the disease in early diagnosis. To ensure the effective diagnosis and predict the risk of diseases in the patient, this paper proposed a novel ReMiss (Removing Missingness in medical datasets) algorithm to impute the missing values present in incomplete medical datasets. The proposed ReMiss algorithm is a data-partitioning based missing values imputation technique that partitions the datasets into complete and incomplete subsets to predict the missing values efficiently. The adjusted coefficient of determination, RMSEs and classification accuracy are the performance metrics used to evaluate the performance of the proposed ReMiss algorithm with the existing imputation techniques. The proposed ReMiss algorithm obtained an average classification accuracy of 93.53%.
一种基于缺失值的医疗数据集预测患者疾病风险的算法
物联网(IoT)的出现通过改变医疗设备和人类的互动方式,同时提供医疗保健解决方案,彻底改变了医疗保健行业。物联网在医疗保健领域的应用对患者、家属、医生、医院和保险公司都是有利的。在物联网之前,患者只能通过电话或短信与医生进行身体交流。医务人员或医疗机构没有一种切实可行的方法来持续评估患者的健康状况并提供指导。通过利用传感器设备,可以帮助医生向患者提供最高级的护理,从而使医疗保健行业的远程监控成为可能。传感器设备在患者远程监护中起着至关重要的作用。但是,网络或传感器设备故障可能导致数据丢失值。这种不完整的数据不能用于诊断患者或在早期诊断中对疾病做出有效的预测。为了保证有效诊断和预测患者的疾病风险,本文提出了一种新的ReMiss (removal Missingness in medical datasets)算法,对不完整医疗数据集中存在的缺失值进行估算。本文提出的ReMiss算法是一种基于数据划分的缺失值输入技术,它将数据集划分为完整子集和不完整子集,从而有效地预测缺失值。调整后的确定系数、均方根误差和分类精度是评价ReMiss算法与现有归算技术性能的性能指标。所提出的ReMiss算法的平均分类准确率为93.53%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信