Machine Learning Approaches to Predict Symptoms in People With Cancer: Systematic Review.

IF 3.3 Q2 ONCOLOGY

JMIR Cancer Pub Date : 2024-03-19 DOI:10.2196/52322

Nahid Zeinali, Nayung Youn, Alaa Albashayreh, Weiguo Fan, Stéphanie Gilbertson White

{"title":"Machine Learning Approaches to Predict Symptoms in People With Cancer: Systematic Review.","authors":"Nahid Zeinali, Nayung Youn, Alaa Albashayreh, Weiguo Fan, Stéphanie Gilbertson White","doi":"10.2196/52322","DOIUrl":null,"url":null,"abstract":"Background: People with cancer frequently experience severe and distressing symptoms associated with cancer and its treatments. Predicting symptoms in patients with cancer continues to be a significant challenge for both clinicians and researchers. The rapid evolution of machine learning (ML) highlights the need for a current systematic review to improve cancer symptom prediction.Objective: This systematic review aims to synthesize the literature that has used ML algorithms to predict the development of cancer symptoms and to identify the predictors of these symptoms. This is essential for integrating new developments and identifying gaps in existing literature.Methods: We conducted this systematic review in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) checklist. We conducted a systematic search of CINAHL, Embase, and PubMed for English records published from 1984 to August 11, 2023, using the following search terms: cancer, neoplasm, specific symptoms, neural networks, machine learning, specific algorithm names, and deep learning. All records that met the eligibility criteria were individually reviewed by 2 coauthors, and key findings were extracted and synthesized. We focused on studies using ML algorithms to predict cancer symptoms, excluding nonhuman research, technical reports, reviews, book chapters, conference proceedings, and inaccessible full texts.Results: A total of 42 studies were included, the majority of which were published after 2017. Most studies were conducted in North America (18/42, 43%) and Asia (16/42, 38%). The sample sizes in most studies (27/42, 64%) typically ranged from 100 to 1000 participants. The most prevalent category of algorithms was supervised ML, accounting for 39 (93%) of the 42 studies. Each of the methods-deep learning, ensemble classifiers, and unsupervised ML-constituted 3 (3%) of the 42 studies. The ML algorithms with the best performance were logistic regression (9/42, 17%), random forest (7/42, 13%), artificial neural networks (5/42, 9%), and decision trees (5/42, 9%). The most commonly included primary cancer sites were the head and neck (9/42, 22%) and breast (8/42, 19%), with 17 (41%) of the 42 studies not specifying the site. The most frequently studied symptoms were xerostomia (9/42, 14%), depression (8/42, 13%), pain (8/42, 13%), and fatigue (6/42, 10%). The significant predictors were age, gender, treatment type, treatment number, cancer site, cancer stage, chemotherapy, radiotherapy, chronic diseases, comorbidities, physical factors, and psychological factors.Conclusions: This review outlines the algorithms used for predicting symptoms in individuals with cancer. Given the diversity of symptoms people with cancer experience, analytic approaches that can handle complex and nonlinear relationships are critical. This knowledge can pave the way for crafting algorithms tailored to a specific symptom. In addition, to improve prediction precision, future research should compare cutting-edge ML strategies such as deep learning and ensemble methods with traditional statistical models.","PeriodicalId":45538,"journal":{"name":"JMIR Cancer","volume":"10 ","pages":"e52322"},"PeriodicalIF":3.3000,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10988375/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Cancer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/52322","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: People with cancer frequently experience severe and distressing symptoms associated with cancer and its treatments. Predicting symptoms in patients with cancer continues to be a significant challenge for both clinicians and researchers. The rapid evolution of machine learning (ML) highlights the need for a current systematic review to improve cancer symptom prediction.

Objective: This systematic review aims to synthesize the literature that has used ML algorithms to predict the development of cancer symptoms and to identify the predictors of these symptoms. This is essential for integrating new developments and identifying gaps in existing literature.

Methods: We conducted this systematic review in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) checklist. We conducted a systematic search of CINAHL, Embase, and PubMed for English records published from 1984 to August 11, 2023, using the following search terms: cancer, neoplasm, specific symptoms, neural networks, machine learning, specific algorithm names, and deep learning. All records that met the eligibility criteria were individually reviewed by 2 coauthors, and key findings were extracted and synthesized. We focused on studies using ML algorithms to predict cancer symptoms, excluding nonhuman research, technical reports, reviews, book chapters, conference proceedings, and inaccessible full texts.

Results: A total of 42 studies were included, the majority of which were published after 2017. Most studies were conducted in North America (18/42, 43%) and Asia (16/42, 38%). The sample sizes in most studies (27/42, 64%) typically ranged from 100 to 1000 participants. The most prevalent category of algorithms was supervised ML, accounting for 39 (93%) of the 42 studies. Each of the methods-deep learning, ensemble classifiers, and unsupervised ML-constituted 3 (3%) of the 42 studies. The ML algorithms with the best performance were logistic regression (9/42, 17%), random forest (7/42, 13%), artificial neural networks (5/42, 9%), and decision trees (5/42, 9%). The most commonly included primary cancer sites were the head and neck (9/42, 22%) and breast (8/42, 19%), with 17 (41%) of the 42 studies not specifying the site. The most frequently studied symptoms were xerostomia (9/42, 14%), depression (8/42, 13%), pain (8/42, 13%), and fatigue (6/42, 10%). The significant predictors were age, gender, treatment type, treatment number, cancer site, cancer stage, chemotherapy, radiotherapy, chronic diseases, comorbidities, physical factors, and psychological factors.

Conclusions: This review outlines the algorithms used for predicting symptoms in individuals with cancer. Given the diversity of symptoms people with cancer experience, analytic approaches that can handle complex and nonlinear relationships are critical. This knowledge can pave the way for crafting algorithms tailored to a specific symptom. In addition, to improve prediction precision, future research should compare cutting-edge ML strategies such as deep learning and ensemble methods with traditional statistical models.

查看原文本刊更多论文

预测癌症患者症状的机器学习方法：系统综述。

背景：癌症患者经常会出现与癌症及其治疗相关的严重而痛苦的症状。预测癌症患者的症状仍然是临床医生和研究人员面临的一项重大挑战。机器学习（ML）的快速发展凸显了当前系统综述改善癌症症状预测的必要性：本系统综述旨在综合使用 ML 算法预测癌症症状发展的文献，并确定这些症状的预测因素。这对于整合新的发展和确定现有文献中的空白点至关重要：我们按照 PRISMA（系统综述和元分析首选报告项目）清单进行了此次系统综述。我们在CINAHL、Embase和PubMed上对1984年至2023年8月11日发表的英文记录进行了系统检索，检索词包括：癌症、肿瘤、特定症状、神经网络、机器学习、特定算法名称和深度学习。所有符合资格标准的记录都由两位共同作者进行了逐一审阅，并提取和归纳了主要发现。我们重点关注使用 ML 算法预测癌症症状的研究，排除了非人类研究、技术报告、综述、书籍章节、会议论文集以及无法访问的全文：共纳入42项研究，其中大部分研究发表于2017年之后。大多数研究在北美（18/42，43%）和亚洲（16/42，38%）进行。大多数研究（27/42，64%）的样本量通常在 100 到 1000 名参与者之间。最普遍的算法类别是有监督的 ML，占 42 项研究中的 39 项（93%）。在 42 项研究中，深度学习、集合分类器和无监督 ML 各占 3 项（3%）。性能最好的 ML 算法是逻辑回归（9/42，17%）、随机森林（7/42，13%）、人工神经网络（5/42，9%）和决策树（5/42，9%）。最常见的原发癌症部位是头颈部（9/42，22%）和乳腺（8/42，19%），42 项研究中有 17 项（41%）未说明具体部位。研究中最常见的症状是口腔干燥（9/42，14%）、抑郁（8/42，13%）、疼痛（8/42，13%）和疲劳（6/42，10%）。重要的预测因素包括年龄、性别、治疗类型、治疗次数、癌症部位、癌症分期、化疗、放疗、慢性病、合并症、身体因素和心理因素：本综述概述了用于预测癌症患者症状的算法。鉴于癌症患者的症状多种多样，能够处理复杂和非线性关系的分析方法至关重要。这方面的知识可以为制定针对特定症状的算法铺平道路。此外，为了提高预测精度，未来的研究应将深度学习和集合方法等前沿的 ML 策略与传统的统计模型进行比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊