利用医学自然语言处理阐明塞来昔布对卡培他滨所致手足综合征的预防作用。

IF 2.8 Q2 ONCOLOGY
JCO Clinical Cancer Informatics Pub Date : 2025-08-01 Epub Date: 2025-08-12 DOI:10.1200/CCI-25-00096
Masami Tsuchiya, Yoshimasa Kawazoe, Kiminori Shimamoto, Tomohisa Seki, Shungo Imai, Hayato Kizaki, Emiko Shinohara, Shuntaro Yada, Shoko Wakamiya, Eiji Aramaki, Satoko Hori
{"title":"利用医学自然语言处理阐明塞来昔布对卡培他滨所致手足综合征的预防作用。","authors":"Masami Tsuchiya, Yoshimasa Kawazoe, Kiminori Shimamoto, Tomohisa Seki, Shungo Imai, Hayato Kizaki, Emiko Shinohara, Shuntaro Yada, Shoko Wakamiya, Eiji Aramaki, Satoko Hori","doi":"10.1200/CCI-25-00096","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Capecitabine, an oral anticancer agent, frequently causes hand-foot syndrome (HFS), affecting patients' quality of life and treatment adherence. However, such symptomatic toxicities are often difficult to detect in structured electronic health record (EHR) data. This study primarily aimed to validate a natural language processing (NLP) approach to identifying capecitabine-induced HFS from unstructured clinical text and demonstrate its application in evaluating medication-associated adverse event trends in real-world settings.</p><p><strong>Methods: </strong>We conducted a retrospective cohort study using EHRs from the University of Tokyo Hospital (2004-2021). HFS cases were identified using the MedNERN-CR-JA NLP model. After propensity score matching, we compared capecitabine users with and without celecoxib and assessed time to HFS onset using Cox proportional hazards models. NLP-based HFS detection was validated through manual annotation of aggregated clinical notes. Negative control and sensitivity analyses ensured robustness.</p><p><strong>Results: </strong>Among 44,502 patients with cancer, 669 capecitabine users were analyzed. HFS incidence was significantly higher among capecitabine users (hazard ratio [HR], 1.93 [95% CI, 1.48 to 2.52]; <i>P</i> < .001) compared with nonusers. Celecoxib use showed a suggestive association with a reduced HFS risk (HR, 0.51 [95% CI, 0.24 to 1.07]; <i>P</i> = .073). The NLP model demonstrated high accuracy in identifying HFS, achieving a precision of 0.875, recall of 1.000, and F<sub>1</sub> score of 0.933, based on manual annotation of patient-level clinical notes. Outcome trends remained consistent when using manually annotated HFS case labels instead of NLP-detected events, supporting the method's robustness.</p><p><strong>Conclusion: </strong>These findings demonstrate the effectiveness of NLP in detecting HFS from real-world clinical records. The application to celecoxib-HFS detection illustrates the potential utility of this approach for retrospective safety analysis. Further work is needed to evaluate generalizability across diverse clinical settings.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500096"},"PeriodicalIF":2.8000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12341754/pdf/","citationCount":"0","resultStr":"{\"title\":\"Elucidating Celecoxib's Preventive Effect in Capecitabine-Induced Hand-Foot Syndrome Using Medical Natural Language Processing.\",\"authors\":\"Masami Tsuchiya, Yoshimasa Kawazoe, Kiminori Shimamoto, Tomohisa Seki, Shungo Imai, Hayato Kizaki, Emiko Shinohara, Shuntaro Yada, Shoko Wakamiya, Eiji Aramaki, Satoko Hori\",\"doi\":\"10.1200/CCI-25-00096\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Capecitabine, an oral anticancer agent, frequently causes hand-foot syndrome (HFS), affecting patients' quality of life and treatment adherence. However, such symptomatic toxicities are often difficult to detect in structured electronic health record (EHR) data. This study primarily aimed to validate a natural language processing (NLP) approach to identifying capecitabine-induced HFS from unstructured clinical text and demonstrate its application in evaluating medication-associated adverse event trends in real-world settings.</p><p><strong>Methods: </strong>We conducted a retrospective cohort study using EHRs from the University of Tokyo Hospital (2004-2021). HFS cases were identified using the MedNERN-CR-JA NLP model. After propensity score matching, we compared capecitabine users with and without celecoxib and assessed time to HFS onset using Cox proportional hazards models. NLP-based HFS detection was validated through manual annotation of aggregated clinical notes. Negative control and sensitivity analyses ensured robustness.</p><p><strong>Results: </strong>Among 44,502 patients with cancer, 669 capecitabine users were analyzed. HFS incidence was significantly higher among capecitabine users (hazard ratio [HR], 1.93 [95% CI, 1.48 to 2.52]; <i>P</i> < .001) compared with nonusers. Celecoxib use showed a suggestive association with a reduced HFS risk (HR, 0.51 [95% CI, 0.24 to 1.07]; <i>P</i> = .073). The NLP model demonstrated high accuracy in identifying HFS, achieving a precision of 0.875, recall of 1.000, and F<sub>1</sub> score of 0.933, based on manual annotation of patient-level clinical notes. Outcome trends remained consistent when using manually annotated HFS case labels instead of NLP-detected events, supporting the method's robustness.</p><p><strong>Conclusion: </strong>These findings demonstrate the effectiveness of NLP in detecting HFS from real-world clinical records. The application to celecoxib-HFS detection illustrates the potential utility of this approach for retrospective safety analysis. Further work is needed to evaluate generalizability across diverse clinical settings.</p>\",\"PeriodicalId\":51626,\"journal\":{\"name\":\"JCO Clinical Cancer Informatics\",\"volume\":\"9 \",\"pages\":\"e2500096\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12341754/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JCO Clinical Cancer Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1200/CCI-25-00096\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/8/12 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI-25-00096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的:卡培他滨是一种口服抗癌药物,常引起手足综合征(HFS),影响患者的生活质量和治疗依从性。然而,这种症状性毒性通常难以在结构化电子健康记录(EHR)数据中检测到。本研究主要旨在验证自然语言处理(NLP)方法从非结构化临床文本中识别卡培他滨诱导的HFS,并展示其在评估现实环境中药物相关不良事件趋势方面的应用。方法:我们使用东京大学医院(2004-2021)的电子病历进行了回顾性队列研究。HFS病例采用MedNERN-CR-JA NLP模型进行鉴定。在倾向评分匹配后,我们比较了卡培他滨使用和不使用塞来昔布的使用者,并使用Cox比例风险模型评估了HFS发作的时间。基于nlp的HFS检测通过人工标注汇总的临床记录进行验证。阴性对照和敏感性分析确保了稳健性。结果:在44,502例癌症患者中,分析了669例卡培他滨使用者。卡培他滨服用者的HFS发生率显著升高(风险比[HR], 1.93 [95% CI, 1.48 ~ 2.52];P < 0.001)。塞来昔布的使用与HFS风险降低相关(HR, 0.51 [95% CI, 0.24 ~ 1.07];P = .073)。NLP模型对HFS的识别准确率较高,在手工标注患者级临床笔记的基础上,准确率为0.875,召回率为1.000,F1评分为0.933。当使用手动注释的HFS病例标签而不是nlp检测到的事件时,结果趋势保持一致,支持方法的鲁棒性。结论:这些发现证明了NLP在从现实世界的临床记录中检测HFS的有效性。塞来昔布- hfs检测的应用说明了这种方法在回顾性安全性分析中的潜在效用。需要进一步的工作来评估不同临床环境下的普遍性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Elucidating Celecoxib's Preventive Effect in Capecitabine-Induced Hand-Foot Syndrome Using Medical Natural Language Processing.

Purpose: Capecitabine, an oral anticancer agent, frequently causes hand-foot syndrome (HFS), affecting patients' quality of life and treatment adherence. However, such symptomatic toxicities are often difficult to detect in structured electronic health record (EHR) data. This study primarily aimed to validate a natural language processing (NLP) approach to identifying capecitabine-induced HFS from unstructured clinical text and demonstrate its application in evaluating medication-associated adverse event trends in real-world settings.

Methods: We conducted a retrospective cohort study using EHRs from the University of Tokyo Hospital (2004-2021). HFS cases were identified using the MedNERN-CR-JA NLP model. After propensity score matching, we compared capecitabine users with and without celecoxib and assessed time to HFS onset using Cox proportional hazards models. NLP-based HFS detection was validated through manual annotation of aggregated clinical notes. Negative control and sensitivity analyses ensured robustness.

Results: Among 44,502 patients with cancer, 669 capecitabine users were analyzed. HFS incidence was significantly higher among capecitabine users (hazard ratio [HR], 1.93 [95% CI, 1.48 to 2.52]; P < .001) compared with nonusers. Celecoxib use showed a suggestive association with a reduced HFS risk (HR, 0.51 [95% CI, 0.24 to 1.07]; P = .073). The NLP model demonstrated high accuracy in identifying HFS, achieving a precision of 0.875, recall of 1.000, and F1 score of 0.933, based on manual annotation of patient-level clinical notes. Outcome trends remained consistent when using manually annotated HFS case labels instead of NLP-detected events, supporting the method's robustness.

Conclusion: These findings demonstrate the effectiveness of NLP in detecting HFS from real-world clinical records. The application to celecoxib-HFS detection illustrates the potential utility of this approach for retrospective safety analysis. Further work is needed to evaluate generalizability across diverse clinical settings.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
6.20
自引率
4.80%
发文量
190
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信