癌症化疗患者症状升级的预先预测。

AMIA ... Annual Symposium proceedings. AMIA Symposium Pub Date : 2025-05-22 eCollection Date: 2024-01-01

Joseph Finkelstein, Aref Smiley, Christina Echeverria, Kathi Mooney

{"title":"癌症化疗患者症状升级的预先预测。","authors":"Joseph Finkelstein, Aref Smiley, Christina Echeverria, Kathi Mooney","doi":"","DOIUrl":null,"url":null,"abstract":"This study evaluates the utility of machine learning (ML) algorithms in early forecasting of total symptom score changes from daily self-reports of 339 chemotherapy patients. The dataset comprised 12 specific symptoms, with severity and distress for each symptom rated on a 1 to 10 scale, generating a \"total symptom score\" ranging from 0 to 230. To address the challenge of an unbalanced original dataset, where Class I (score change ≥ 5) and Class II (score change < 5) were unevenly represented, we created a balanced dataset specifically for model training. This process involved a stratified sampling technique to ensure equitable representation of both classes, enhancing the predictive analysis. Using the MATLAB® Classification Learner application, we investigated nine ML models, including decision trees, discriminant analysis, support vector machines (SVM), and others, each applying various classifiers. The objective was to predict the total symptom score change based on the preceding 3 to 5 days' symptom data. Models were trained on the balanced dataset to mitigate the original imbalance's impact, with comparative evaluations also conducted on the unbalanced data to assess performance differences. The analysis revealed that certain classifiers, such as SVM, delivered optimal performance on the unbalanced dataset, with an accuracy rate peaking at 82%. Yet, these models tended to frequently misclassify Class I as Class II. In contrast, the Ensemble algorithm equipped with the RUSBoost classifier demonstrated exceptional skill in accurately classifying both classes on both datasets, achieving accuracies of 59%, 59.3%, and 59.4% for data from 3, 4, and 5 days prior, respectively. Notably, these figures slightly improved to 61.16%, 58.41%, and 60.05% upon utilizing the balanced dataset for training. The deployment of a balanced dataset for model training underscores the significant potential of ML algorithms in improving symptom management for chemotherapy patients, offering a path to enhanced patient care and quality of life through targeted, personalized symptom monitoring.","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"427-432"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099323/pdf/","citationCount":"0","resultStr":"{\"title\":\"Preemptive Forecasting of Symptom Escalation in Cancer Patients Undergoing Chemotherapy.\",\"authors\":\"Joseph Finkelstein, Aref Smiley, Christina Echeverria, Kathi Mooney\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study evaluates the utility of machine learning (ML) algorithms in early forecasting of total symptom score changes from daily self-reports of 339 chemotherapy patients. The dataset comprised 12 specific symptoms, with severity and distress for each symptom rated on a 1 to 10 scale, generating a \\\"total symptom score\\\" ranging from 0 to 230. To address the challenge of an unbalanced original dataset, where Class I (score change ≥ 5) and Class II (score change < 5) were unevenly represented, we created a balanced dataset specifically for model training. This process involved a stratified sampling technique to ensure equitable representation of both classes, enhancing the predictive analysis. Using the MATLAB® Classification Learner application, we investigated nine ML models, including decision trees, discriminant analysis, support vector machines (SVM), and others, each applying various classifiers. The objective was to predict the total symptom score change based on the preceding 3 to 5 days' symptom data. Models were trained on the balanced dataset to mitigate the original imbalance's impact, with comparative evaluations also conducted on the unbalanced data to assess performance differences. The analysis revealed that certain classifiers, such as SVM, delivered optimal performance on the unbalanced dataset, with an accuracy rate peaking at 82%. Yet, these models tended to frequently misclassify Class I as Class II. In contrast, the Ensemble algorithm equipped with the RUSBoost classifier demonstrated exceptional skill in accurately classifying both classes on both datasets, achieving accuracies of 59%, 59.3%, and 59.4% for data from 3, 4, and 5 days prior, respectively. Notably, these figures slightly improved to 61.16%, 58.41%, and 60.05% upon utilizing the balanced dataset for training. The deployment of a balanced dataset for model training underscores the significant potential of ML algorithms in improving symptom management for chemotherapy patients, offering a path to enhanced patient care and quality of life through targeted, personalized symptom monitoring.\",\"PeriodicalId\":72180,\"journal\":{\"name\":\"AMIA ... Annual Symposium proceedings. AMIA Symposium\",\"volume\":\"2024 \",\"pages\":\"427-432\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099323/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AMIA ... Annual Symposium proceedings. AMIA Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AMIA ... Annual Symposium proceedings. AMIA Symposium","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本研究评估了机器学习（ML）算法在早期预测339名化疗患者每日自我报告的总症状评分变化中的效用。该数据集包括12种特定症状，每种症状的严重程度和痛苦程度按1到10的等级评分，产生从0到230的“总症状评分”。为了解决不平衡原始数据集的挑战，其中I类（分数变化≥5）和II类（分数变化< 5）的代表不均匀，我们创建了一个专门用于模型训练的平衡数据集。这一过程涉及分层抽样技术，以确保公平代表两个阶层，加强预测分析。使用MATLAB®Classification Learner应用程序，我们研究了九种ML模型，包括决策树、判别分析、支持向量机（SVM）等，每种模型都应用了不同的分类器。目的是根据前3 - 5天的症状数据预测总症状评分的变化。在平衡数据集上训练模型，以减轻原始不平衡的影响，并对不平衡数据进行比较评估，以评估性能差异。分析显示，某些分类器，如SVM，在不平衡数据集上提供了最佳性能，准确率达到82%。然而，这些模型往往经常将I类错误地划分为II类。相比之下，配备RUSBoost分类器的Ensemble算法在对两个数据集上的两个类进行准确分类方面表现出了出色的技能，对3天、4天和5天前的数据分别实现了59%、59.3%和59.4%的准确率。值得注意的是，在使用平衡数据集进行训练时，这些数字略有提高，分别为61.16%，58.41%和60.05%。平衡数据集用于模型训练的部署强调了ML算法在改善化疗患者症状管理方面的巨大潜力，通过有针对性的个性化症状监测提供了增强患者护理和生活质量的途径。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

本刊更多论文

Preemptive Forecasting of Symptom Escalation in Cancer Patients Undergoing Chemotherapy.

This study evaluates the utility of machine learning (ML) algorithms in early forecasting of total symptom score changes from daily self-reports of 339 chemotherapy patients. The dataset comprised 12 specific symptoms, with severity and distress for each symptom rated on a 1 to 10 scale, generating a "total symptom score" ranging from 0 to 230. To address the challenge of an unbalanced original dataset, where Class I (score change ≥ 5) and Class II (score change < 5) were unevenly represented, we created a balanced dataset specifically for model training. This process involved a stratified sampling technique to ensure equitable representation of both classes, enhancing the predictive analysis. Using the MATLAB® Classification Learner application, we investigated nine ML models, including decision trees, discriminant analysis, support vector machines (SVM), and others, each applying various classifiers. The objective was to predict the total symptom score change based on the preceding 3 to 5 days' symptom data. Models were trained on the balanced dataset to mitigate the original imbalance's impact, with comparative evaluations also conducted on the unbalanced data to assess performance differences. The analysis revealed that certain classifiers, such as SVM, delivered optimal performance on the unbalanced dataset, with an accuracy rate peaking at 82%. Yet, these models tended to frequently misclassify Class I as Class II. In contrast, the Ensemble algorithm equipped with the RUSBoost classifier demonstrated exceptional skill in accurately classifying both classes on both datasets, achieving accuracies of 59%, 59.3%, and 59.4% for data from 3, 4, and 5 days prior, respectively. Notably, these figures slightly improved to 61.16%, 58.41%, and 60.05% upon utilizing the balanced dataset for training. The deployment of a balanced dataset for model training underscores the significant potential of ML algorithms in improving symptom management for chemotherapy patients, offering a path to enhanced patient care and quality of life through targeted, personalized symptom monitoring.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

AMIA ... Annual Symposium proceedings. AMIA Symposium

自引率

0.00%

发文量