{"title":"Prediction of Cervical Cancer Patients' Survival Period with Machine Learning Techniques.","authors":"Intorn Chanudom, Ekkasit Tharavichitkul, Wimalin Laosiritaworn","doi":"10.4258/hir.2024.30.1.60","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>The objective of this research is to apply machine learning (ML) algorithms to predict the survival of cervical cancer patients. The aim was to address the limitations of traditional statistical methods, which often fail to provide accurate answers due to the complexity of the problem.</p><p><strong>Methods: </strong>This research employed visualization techniques for initial data understanding. Subsequently, ML algorithms were used to develop both classification and regression models for survival prediction. In the classification models, we trained the algorithms to predict the time interval between the initial diagnosis and the patient's death. The intervals were categorized as \"<6 months,\" \"6 months to 3 years,\" \"3 years to 5 years,\" and \">5 years.\" The regression model aimed to predict survival time (in months). We used attribute weights to gain insights into the model, highlighting features with a significant impact on predictions and offering valuable insights into the model's behavior and decision-making process.</p><p><strong>Results: </strong>The gradient boosting trees algorithm achieved an 81.55% accuracy in the classification model, while the random forest algorithm excelled in the regression model, with a root mean square error of 22.432. Notably, radiation doses around the affected areas significantly influenced survival duration.</p><p><strong>Conclusions: </strong>Machine learning demonstrated the ability to provide high-accuracy predictions of survival periods in both classification and regression problems. This suggests its potential use as a decision-support tool in the process of treatment planning and resource allocation for each patient.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"30 1","pages":"60-72"},"PeriodicalIF":2.3000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10879821/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Informatics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/hir.2024.30.1.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/31 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: The objective of this research is to apply machine learning (ML) algorithms to predict the survival of cervical cancer patients. The aim was to address the limitations of traditional statistical methods, which often fail to provide accurate answers due to the complexity of the problem.
Methods: This research employed visualization techniques for initial data understanding. Subsequently, ML algorithms were used to develop both classification and regression models for survival prediction. In the classification models, we trained the algorithms to predict the time interval between the initial diagnosis and the patient's death. The intervals were categorized as "<6 months," "6 months to 3 years," "3 years to 5 years," and ">5 years." The regression model aimed to predict survival time (in months). We used attribute weights to gain insights into the model, highlighting features with a significant impact on predictions and offering valuable insights into the model's behavior and decision-making process.
Results: The gradient boosting trees algorithm achieved an 81.55% accuracy in the classification model, while the random forest algorithm excelled in the regression model, with a root mean square error of 22.432. Notably, radiation doses around the affected areas significantly influenced survival duration.
Conclusions: Machine learning demonstrated the ability to provide high-accuracy predictions of survival periods in both classification and regression problems. This suggests its potential use as a decision-support tool in the process of treatment planning and resource allocation for each patient.