Elucidating Celecoxib's Preventive Effect in Capecitabine-Induced Hand-Foot Syndrome Using Medical Natural Language Processing.

IF 2.8 Q2 ONCOLOGY

JCO Clinical Cancer Informatics Pub Date : 2025-08-01 Epub Date: 2025-08-12 DOI:10.1200/CCI-25-00096

Masami Tsuchiya, Yoshimasa Kawazoe, Kiminori Shimamoto, Tomohisa Seki, Shungo Imai, Hayato Kizaki, Emiko Shinohara, Shuntaro Yada, Shoko Wakamiya, Eiji Aramaki, Satoko Hori

{"title":"Elucidating Celecoxib's Preventive Effect in Capecitabine-Induced Hand-Foot Syndrome Using Medical Natural Language Processing.","authors":"Masami Tsuchiya, Yoshimasa Kawazoe, Kiminori Shimamoto, Tomohisa Seki, Shungo Imai, Hayato Kizaki, Emiko Shinohara, Shuntaro Yada, Shoko Wakamiya, Eiji Aramaki, Satoko Hori","doi":"10.1200/CCI-25-00096","DOIUrl":null,"url":null,"abstract":"Purpose: Capecitabine, an oral anticancer agent, frequently causes hand-foot syndrome (HFS), affecting patients' quality of life and treatment adherence. However, such symptomatic toxicities are often difficult to detect in structured electronic health record (EHR) data. This study primarily aimed to validate a natural language processing (NLP) approach to identifying capecitabine-induced HFS from unstructured clinical text and demonstrate its application in evaluating medication-associated adverse event trends in real-world settings.Methods: We conducted a retrospective cohort study using EHRs from the University of Tokyo Hospital (2004-2021). HFS cases were identified using the MedNERN-CR-JA NLP model. After propensity score matching, we compared capecitabine users with and without celecoxib and assessed time to HFS onset using Cox proportional hazards models. NLP-based HFS detection was validated through manual annotation of aggregated clinical notes. Negative control and sensitivity analyses ensured robustness.Results: Among 44,502 patients with cancer, 669 capecitabine users were analyzed. HFS incidence was significantly higher among capecitabine users (hazard ratio [HR], 1.93 [95% CI, 1.48 to 2.52]; P < .001) compared with nonusers. Celecoxib use showed a suggestive association with a reduced HFS risk (HR, 0.51 [95% CI, 0.24 to 1.07]; P = .073). The NLP model demonstrated high accuracy in identifying HFS, achieving a precision of 0.875, recall of 1.000, and F1 score of 0.933, based on manual annotation of patient-level clinical notes. Outcome trends remained consistent when using manually annotated HFS case labels instead of NLP-detected events, supporting the method's robustness.Conclusion: These findings demonstrate the effectiveness of NLP in detecting HFS from real-world clinical records. The application to celecoxib-HFS detection illustrates the potential utility of this approach for retrospective safety analysis. Further work is needed to evaluate generalizability across diverse clinical settings.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500096"},"PeriodicalIF":2.8000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12341754/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI-25-00096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Capecitabine, an oral anticancer agent, frequently causes hand-foot syndrome (HFS), affecting patients' quality of life and treatment adherence. However, such symptomatic toxicities are often difficult to detect in structured electronic health record (EHR) data. This study primarily aimed to validate a natural language processing (NLP) approach to identifying capecitabine-induced HFS from unstructured clinical text and demonstrate its application in evaluating medication-associated adverse event trends in real-world settings.

Methods: We conducted a retrospective cohort study using EHRs from the University of Tokyo Hospital (2004-2021). HFS cases were identified using the MedNERN-CR-JA NLP model. After propensity score matching, we compared capecitabine users with and without celecoxib and assessed time to HFS onset using Cox proportional hazards models. NLP-based HFS detection was validated through manual annotation of aggregated clinical notes. Negative control and sensitivity analyses ensured robustness.

Results: Among 44,502 patients with cancer, 669 capecitabine users were analyzed. HFS incidence was significantly higher among capecitabine users (hazard ratio [HR], 1.93 [95% CI, 1.48 to 2.52]; P < .001) compared with nonusers. Celecoxib use showed a suggestive association with a reduced HFS risk (HR, 0.51 [95% CI, 0.24 to 1.07]; P = .073). The NLP model demonstrated high accuracy in identifying HFS, achieving a precision of 0.875, recall of 1.000, and F₁ score of 0.933, based on manual annotation of patient-level clinical notes. Outcome trends remained consistent when using manually annotated HFS case labels instead of NLP-detected events, supporting the method's robustness.

Conclusion: These findings demonstrate the effectiveness of NLP in detecting HFS from real-world clinical records. The application to celecoxib-HFS detection illustrates the potential utility of this approach for retrospective safety analysis. Further work is needed to evaluate generalizability across diverse clinical settings.

查看原文本刊更多论文

利用医学自然语言处理阐明塞来昔布对卡培他滨所致手足综合征的预防作用。

目的：卡培他滨是一种口服抗癌药物，常引起手足综合征（HFS），影响患者的生活质量和治疗依从性。然而，这种症状性毒性通常难以在结构化电子健康记录（EHR）数据中检测到。本研究主要旨在验证自然语言处理（NLP）方法从非结构化临床文本中识别卡培他滨诱导的HFS，并展示其在评估现实环境中药物相关不良事件趋势方面的应用。方法：我们使用东京大学医院（2004-2021）的电子病历进行了回顾性队列研究。HFS病例采用MedNERN-CR-JA NLP模型进行鉴定。在倾向评分匹配后，我们比较了卡培他滨使用和不使用塞来昔布的使用者，并使用Cox比例风险模型评估了HFS发作的时间。基于nlp的HFS检测通过人工标注汇总的临床记录进行验证。阴性对照和敏感性分析确保了稳健性。结果：在44,502例癌症患者中，分析了669例卡培他滨使用者。卡培他滨服用者的HFS发生率显著升高(风险比[HR]， 1.93 [95% CI, 1.48 ~ 2.52]；P < 0.001)。塞来昔布的使用与HFS风险降低相关(HR, 0.51 [95% CI, 0.24 ~ 1.07]；P = .073)。NLP模型对HFS的识别准确率较高，在手工标注患者级临床笔记的基础上，准确率为0.875，召回率为1.000，F1评分为0.933。当使用手动注释的HFS病例标签而不是nlp检测到的事件时，结果趋势保持一致，支持方法的鲁棒性。结论：这些发现证明了NLP在从现实世界的临床记录中检测HFS的有效性。塞来昔布- hfs检测的应用说明了这种方法在回顾性安全性分析中的潜在效用。需要进一步的工作来评估不同临床环境下的普遍性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JCO Clinical Cancer Informatics ONCOLOGY-

CiteScore

6.20

自引率

4.80%

发文量

190