{"title":"An Algorithm to Impute Missing Values in Medical Datasets to Predict the Risk of Diseases in Patients","authors":"H. V. Bhagat, Manminder Singh","doi":"10.1109/DELCON57910.2023.10127481","DOIUrl":null,"url":null,"abstract":"The advent of Internet of Things (IoT) revolutionizes the healthcare sector by altering how medical equipment and humans interact while providing healthcare solutions. Applications of IoT in healthcare are advantageous to patients, their families, physicians, hospitals, and insurance companies. Before the IoT, patients could only speak with doctors physically, over the phone, or by text. There wasn’t a practical way for medical staff or facilities to continuously assess patients' health and provide guidance. Remote monitoring in the healthcare industry is made possible by utilizing sensor devices that help physicians to deliver superlative care to their patients. Sensor devices play a crucial role in the remote monitoring of patients. However, the failure of network or sensor devices may result in data with missing values. This incomplete data cannot be utilized to diagnose a patient or to make effective predictions of the disease in early diagnosis. To ensure the effective diagnosis and predict the risk of diseases in the patient, this paper proposed a novel ReMiss (Removing Missingness in medical datasets) algorithm to impute the missing values present in incomplete medical datasets. The proposed ReMiss algorithm is a data-partitioning based missing values imputation technique that partitions the datasets into complete and incomplete subsets to predict the missing values efficiently. The adjusted coefficient of determination, RMSEs and classification accuracy are the performance metrics used to evaluate the performance of the proposed ReMiss algorithm with the existing imputation techniques. The proposed ReMiss algorithm obtained an average classification accuracy of 93.53%.","PeriodicalId":193577,"journal":{"name":"2023 2nd Edition of IEEE Delhi Section Flagship Conference (DELCON)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd Edition of IEEE Delhi Section Flagship Conference (DELCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DELCON57910.2023.10127481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The advent of Internet of Things (IoT) revolutionizes the healthcare sector by altering how medical equipment and humans interact while providing healthcare solutions. Applications of IoT in healthcare are advantageous to patients, their families, physicians, hospitals, and insurance companies. Before the IoT, patients could only speak with doctors physically, over the phone, or by text. There wasn’t a practical way for medical staff or facilities to continuously assess patients' health and provide guidance. Remote monitoring in the healthcare industry is made possible by utilizing sensor devices that help physicians to deliver superlative care to their patients. Sensor devices play a crucial role in the remote monitoring of patients. However, the failure of network or sensor devices may result in data with missing values. This incomplete data cannot be utilized to diagnose a patient or to make effective predictions of the disease in early diagnosis. To ensure the effective diagnosis and predict the risk of diseases in the patient, this paper proposed a novel ReMiss (Removing Missingness in medical datasets) algorithm to impute the missing values present in incomplete medical datasets. The proposed ReMiss algorithm is a data-partitioning based missing values imputation technique that partitions the datasets into complete and incomplete subsets to predict the missing values efficiently. The adjusted coefficient of determination, RMSEs and classification accuracy are the performance metrics used to evaluate the performance of the proposed ReMiss algorithm with the existing imputation techniques. The proposed ReMiss algorithm obtained an average classification accuracy of 93.53%.
物联网(IoT)的出现通过改变医疗设备和人类的互动方式,同时提供医疗保健解决方案,彻底改变了医疗保健行业。物联网在医疗保健领域的应用对患者、家属、医生、医院和保险公司都是有利的。在物联网之前,患者只能通过电话或短信与医生进行身体交流。医务人员或医疗机构没有一种切实可行的方法来持续评估患者的健康状况并提供指导。通过利用传感器设备,可以帮助医生向患者提供最高级的护理,从而使医疗保健行业的远程监控成为可能。传感器设备在患者远程监护中起着至关重要的作用。但是,网络或传感器设备故障可能导致数据丢失值。这种不完整的数据不能用于诊断患者或在早期诊断中对疾病做出有效的预测。为了保证有效诊断和预测患者的疾病风险,本文提出了一种新的ReMiss (removal Missingness in medical datasets)算法,对不完整医疗数据集中存在的缺失值进行估算。本文提出的ReMiss算法是一种基于数据划分的缺失值输入技术,它将数据集划分为完整子集和不完整子集,从而有效地预测缺失值。调整后的确定系数、均方根误差和分类精度是评价ReMiss算法与现有归算技术性能的性能指标。所提出的ReMiss算法的平均分类准确率为93.53%。