放疗患者每周报告的短期和长期预后预测:单患者时间序列模型与基于变压器的多患者时间序列模型

IF 6.1 3区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Yang Yan, Zhong Chen, Xinglei Shen, Ronald C Chen, Hao Gao
{"title":"放疗患者每周报告的短期和长期预后预测:单患者时间序列模型与基于变压器的多患者时间序列模型","authors":"Yang Yan, Zhong Chen, Xinglei Shen, Ronald C Chen, Hao Gao","doi":"10.1186/s13040-025-00464-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Patient-reported outcomes (PROs) are direct reports from patients on health status, symptoms, quality of life, or treatment satisfaction, offering critical insights into subjective experiences that clinical metrics may overlook. Accurately predicting personalized short- and long-term weekly PROs during radiotherapy is essential for monitoring health status, optimizing treatment efficacy, and enabling timely interventions to manage side effects.</p><p><strong>Methods: </strong>Based on the well-documented prostate cancer PRO dataset with 17 patients after pre-processing, this study evaluates single-patient time series models (i.e., vector autoregression (VAR) and VAR with incremental ground truth PRO data (VAR-Inc)) and a transformer-based multi-patient model (i.e., Temporal Fusion Transformer (TFT)) for short- and long-term weekly PRO prediction. VAR-Inc integrates follow-up PRO data to refine predictions, while TFT leverages multi-patient heterogeneous information to capture complex temporal patterns.</p><p><strong>Results: </strong>Key experimental results on prostate cancer patients demonstrate that (1) VAR-Inc demonstrated superior performance (lower MAE/RMSE) over VAR, highlighting the importance of incremental PRO updates. (2) TFT significantly outperformed both VAR models in long-term prediction, with statistical significance, by utilizing multi-patient data. (3) TFT effectively captured weekly PRO trends and variations, aligning closely with ground truth. (4) Unlike single-patient models, TFT built robust predictive frameworks by integrating cross-patient similarities and complementary patients' PRO information. VAR-Inc's performance deteriorated with missing follow-up PROs, whereas TFT remained stable, overcoming this limitation. On average, TFT outperforms VAR and VAR-Inc by achieving a lowest MAE 0.7715, while the MAE of VAR and VAR-Inc are 1.1329 and 0.8089, respectively. Furthermore, TFT is superior to VAR and VAR-Inc by achieving a lowest RMSE 0.9586, while the RMSE of VAR and VAR-Inc are 1.4817 and 1.0693, respectively.</p><p><strong>Conclusion: </strong>TFT emerges as a reliable approach for PRO prediction, excelling in long-term accuracy, trend capture, and resilience to data gaps by leveraging multi-patient information. Its ability to synthesize heterogeneous PRO data offers advantages over single-patient models, supporting personalized treatment adaptation and informed clinical decision-making. This underscores the potential of transformer-based models in enhancing PRO-driven radiotherapy management.</p>","PeriodicalId":48947,"journal":{"name":"Biodata Mining","volume":"18 1","pages":"53"},"PeriodicalIF":6.1000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12341308/pdf/","citationCount":"0","resultStr":"{\"title\":\"Short- and long-term weekly patient-reported outcomes prediction undergoing radiotherapy: single-patient time series model vs. transformer-based multi-patient time series model.\",\"authors\":\"Yang Yan, Zhong Chen, Xinglei Shen, Ronald C Chen, Hao Gao\",\"doi\":\"10.1186/s13040-025-00464-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Patient-reported outcomes (PROs) are direct reports from patients on health status, symptoms, quality of life, or treatment satisfaction, offering critical insights into subjective experiences that clinical metrics may overlook. Accurately predicting personalized short- and long-term weekly PROs during radiotherapy is essential for monitoring health status, optimizing treatment efficacy, and enabling timely interventions to manage side effects.</p><p><strong>Methods: </strong>Based on the well-documented prostate cancer PRO dataset with 17 patients after pre-processing, this study evaluates single-patient time series models (i.e., vector autoregression (VAR) and VAR with incremental ground truth PRO data (VAR-Inc)) and a transformer-based multi-patient model (i.e., Temporal Fusion Transformer (TFT)) for short- and long-term weekly PRO prediction. VAR-Inc integrates follow-up PRO data to refine predictions, while TFT leverages multi-patient heterogeneous information to capture complex temporal patterns.</p><p><strong>Results: </strong>Key experimental results on prostate cancer patients demonstrate that (1) VAR-Inc demonstrated superior performance (lower MAE/RMSE) over VAR, highlighting the importance of incremental PRO updates. (2) TFT significantly outperformed both VAR models in long-term prediction, with statistical significance, by utilizing multi-patient data. (3) TFT effectively captured weekly PRO trends and variations, aligning closely with ground truth. (4) Unlike single-patient models, TFT built robust predictive frameworks by integrating cross-patient similarities and complementary patients' PRO information. VAR-Inc's performance deteriorated with missing follow-up PROs, whereas TFT remained stable, overcoming this limitation. On average, TFT outperforms VAR and VAR-Inc by achieving a lowest MAE 0.7715, while the MAE of VAR and VAR-Inc are 1.1329 and 0.8089, respectively. Furthermore, TFT is superior to VAR and VAR-Inc by achieving a lowest RMSE 0.9586, while the RMSE of VAR and VAR-Inc are 1.4817 and 1.0693, respectively.</p><p><strong>Conclusion: </strong>TFT emerges as a reliable approach for PRO prediction, excelling in long-term accuracy, trend capture, and resilience to data gaps by leveraging multi-patient information. Its ability to synthesize heterogeneous PRO data offers advantages over single-patient models, supporting personalized treatment adaptation and informed clinical decision-making. This underscores the potential of transformer-based models in enhancing PRO-driven radiotherapy management.</p>\",\"PeriodicalId\":48947,\"journal\":{\"name\":\"Biodata Mining\",\"volume\":\"18 1\",\"pages\":\"53\"},\"PeriodicalIF\":6.1000,\"publicationDate\":\"2025-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12341308/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biodata Mining\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s13040-025-00464-7\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICAL & COMPUTATIONAL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biodata Mining","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13040-025-00464-7","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:患者报告的结果(pro)是患者对健康状况、症状、生活质量或治疗满意度的直接报告,为临床指标可能忽略的主观体验提供了重要的见解。准确预测放射治疗期间个性化的短期和长期每周PROs对于监测健康状况、优化治疗效果和及时干预以管理副作用至关重要。方法:基于经过预处理的17例前列腺癌PRO数据集,本研究评估了单患者时间序列模型(即向量自回归(VAR)和增量地真PRO数据(VAR- inc)的VAR)和基于变压器的多患者模型(即时间融合变压器(TFT))用于短期和长期每周PRO预测。VAR-Inc整合了后续的PRO数据来改进预测,而TFT利用多患者异构信息来捕获复杂的时间模式。结果:前列腺癌患者的关键实验结果表明(1)VAR- inc表现出优于VAR的性能(MAE/RMSE更低),突出了PRO增量更新的重要性。(2)在利用多患者数据进行长期预测时,TFT显著优于VAR模型,且具有统计学意义。(3) TFT有效捕获了每周PRO趋势和变化,与实际情况密切相关。(4)与单一患者模型不同,TFT通过整合跨患者相似性和补充患者PRO信息构建了鲁棒的预测框架。VAR-Inc的表现因缺少后续PROs而恶化,而TFT保持稳定,克服了这一限制。平均而言,TFT优于VAR和VAR- inc,其MAE最低为0.7715,而VAR和VAR- inc的MAE分别为1.1329和0.8089。此外,TFT优于VAR和VAR- inc, RMSE最低为0.9586,而VAR和VAR- inc的RMSE分别为1.4817和1.0693。结论:TFT是预测PRO的可靠方法,通过利用多患者信息,在长期准确性、趋势捕获和对数据缺口的弹性方面表现出色。其综合异构PRO数据的能力比单一患者模型具有优势,支持个性化治疗适应和知情的临床决策。这强调了基于变压器的模型在增强pro驱动的放射治疗管理方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Short- and long-term weekly patient-reported outcomes prediction undergoing radiotherapy: single-patient time series model vs. transformer-based multi-patient time series model.

Background: Patient-reported outcomes (PROs) are direct reports from patients on health status, symptoms, quality of life, or treatment satisfaction, offering critical insights into subjective experiences that clinical metrics may overlook. Accurately predicting personalized short- and long-term weekly PROs during radiotherapy is essential for monitoring health status, optimizing treatment efficacy, and enabling timely interventions to manage side effects.

Methods: Based on the well-documented prostate cancer PRO dataset with 17 patients after pre-processing, this study evaluates single-patient time series models (i.e., vector autoregression (VAR) and VAR with incremental ground truth PRO data (VAR-Inc)) and a transformer-based multi-patient model (i.e., Temporal Fusion Transformer (TFT)) for short- and long-term weekly PRO prediction. VAR-Inc integrates follow-up PRO data to refine predictions, while TFT leverages multi-patient heterogeneous information to capture complex temporal patterns.

Results: Key experimental results on prostate cancer patients demonstrate that (1) VAR-Inc demonstrated superior performance (lower MAE/RMSE) over VAR, highlighting the importance of incremental PRO updates. (2) TFT significantly outperformed both VAR models in long-term prediction, with statistical significance, by utilizing multi-patient data. (3) TFT effectively captured weekly PRO trends and variations, aligning closely with ground truth. (4) Unlike single-patient models, TFT built robust predictive frameworks by integrating cross-patient similarities and complementary patients' PRO information. VAR-Inc's performance deteriorated with missing follow-up PROs, whereas TFT remained stable, overcoming this limitation. On average, TFT outperforms VAR and VAR-Inc by achieving a lowest MAE 0.7715, while the MAE of VAR and VAR-Inc are 1.1329 and 0.8089, respectively. Furthermore, TFT is superior to VAR and VAR-Inc by achieving a lowest RMSE 0.9586, while the RMSE of VAR and VAR-Inc are 1.4817 and 1.0693, respectively.

Conclusion: TFT emerges as a reliable approach for PRO prediction, excelling in long-term accuracy, trend capture, and resilience to data gaps by leveraging multi-patient information. Its ability to synthesize heterogeneous PRO data offers advantages over single-patient models, supporting personalized treatment adaptation and informed clinical decision-making. This underscores the potential of transformer-based models in enhancing PRO-driven radiotherapy management.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biodata Mining
Biodata Mining MATHEMATICAL & COMPUTATIONAL BIOLOGY-
CiteScore
7.90
自引率
0.00%
发文量
28
审稿时长
23 weeks
期刊介绍: BioData Mining is an open access, open peer-reviewed journal encompassing research on all aspects of data mining applied to high-dimensional biological and biomedical data, focusing on computational aspects of knowledge discovery from large-scale genetic, transcriptomic, genomic, proteomic, and metabolomic data. Topical areas include, but are not limited to: -Development, evaluation, and application of novel data mining and machine learning algorithms. -Adaptation, evaluation, and application of traditional data mining and machine learning algorithms. -Open-source software for the application of data mining and machine learning algorithms. -Design, development and integration of databases, software and web services for the storage, management, retrieval, and analysis of data from large scale studies. -Pre-processing, post-processing, modeling, and interpretation of data mining and machine learning results for biological interpretation and knowledge discovery.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信