{"title":"基于自然语言的深度学习模型评估非肌肉浸润性膀胱癌患者接受膀胱内卡介苗-谷氨酰胺治疗的有效性。","authors":"Makito Miyake, Naohiro Yonemoto, Kanae Togo, Linghua Xu, Tomoyo Oguri, Masayuki Tanaka, Yoshiyuki Hasegawa, Yoshinobu Izawa, Kenji Araki","doi":"10.1200/CCI-24-00249","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Collecting information on clinical outcomes (recurrence/progression) from complex treatment courses in non-muscle invasive bladder cancer (NMIBC) is challenging and time-consuming. We developed a deep learning natural language processing model to assess outcomes in patients with NMIBC using vast data from electronic health records (EHRs).</p><p><strong>Methods: </strong>This retrospective study analyzed data from Japanese adults with NMIBC who started Bacillus Calmette-Guérin (BCG) induction therapy between April 2016 and June 2022. A Bidirectional Encoder Representations from Transformers (BERT) model was trained to classify outcomes, supported by human review for past history records. The model's performance was assessed by precision, recall, and F1 scores. We compared the effectiveness of BCG therapy between completion (patients who completed therapy) and non-completion groups.</p><p><strong>Results: </strong>Of 372 patients studied, 79.3% and 20.7% were in the completion group and the non-completion group, respectively. The final BERT model achieved average F1 scores of 0.91 and 0.98 for time to recurrence (TTR), and 0.74 and 0.94 for time to progression (TTP) before and after human support, respectively. The hazard ratio for TTR in BCG completion versus non-completion groups was 0.40 (95% CI, 0.26 to 0.62) by a multivariate Cox proportional hazard model and 0.41 (95% CI, 0.26 to 0.63) by inverse probability of treatment weighting.</p><p><strong>Conclusion: </strong>The developed model could compare the clinical outcomes between treatments in patients with NMIBC using EHRs. Human support, although required, was needed in only 10% documents and was deemed feasible. The model was able to demonstrate the difference in TTR and TTP between BCG completion and non-completion groups.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2400249"},"PeriodicalIF":3.3000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12233173/pdf/","citationCount":"0","resultStr":"{\"title\":\"Deep Learning Model for Natural Language to Assess Effectiveness of Patients With Non-Muscle Invasive Bladder Cancer Receiving Intravesical Bacillus Calmette-Guérin Therapy.\",\"authors\":\"Makito Miyake, Naohiro Yonemoto, Kanae Togo, Linghua Xu, Tomoyo Oguri, Masayuki Tanaka, Yoshiyuki Hasegawa, Yoshinobu Izawa, Kenji Araki\",\"doi\":\"10.1200/CCI-24-00249\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Collecting information on clinical outcomes (recurrence/progression) from complex treatment courses in non-muscle invasive bladder cancer (NMIBC) is challenging and time-consuming. We developed a deep learning natural language processing model to assess outcomes in patients with NMIBC using vast data from electronic health records (EHRs).</p><p><strong>Methods: </strong>This retrospective study analyzed data from Japanese adults with NMIBC who started Bacillus Calmette-Guérin (BCG) induction therapy between April 2016 and June 2022. A Bidirectional Encoder Representations from Transformers (BERT) model was trained to classify outcomes, supported by human review for past history records. The model's performance was assessed by precision, recall, and F1 scores. We compared the effectiveness of BCG therapy between completion (patients who completed therapy) and non-completion groups.</p><p><strong>Results: </strong>Of 372 patients studied, 79.3% and 20.7% were in the completion group and the non-completion group, respectively. The final BERT model achieved average F1 scores of 0.91 and 0.98 for time to recurrence (TTR), and 0.74 and 0.94 for time to progression (TTP) before and after human support, respectively. The hazard ratio for TTR in BCG completion versus non-completion groups was 0.40 (95% CI, 0.26 to 0.62) by a multivariate Cox proportional hazard model and 0.41 (95% CI, 0.26 to 0.63) by inverse probability of treatment weighting.</p><p><strong>Conclusion: </strong>The developed model could compare the clinical outcomes between treatments in patients with NMIBC using EHRs. Human support, although required, was needed in only 10% documents and was deemed feasible. The model was able to demonstrate the difference in TTR and TTP between BCG completion and non-completion groups.</p>\",\"PeriodicalId\":51626,\"journal\":{\"name\":\"JCO Clinical Cancer Informatics\",\"volume\":\"9 \",\"pages\":\"e2400249\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12233173/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JCO Clinical Cancer Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1200/CCI-24-00249\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/27 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI-24-00249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/27 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
Deep Learning Model for Natural Language to Assess Effectiveness of Patients With Non-Muscle Invasive Bladder Cancer Receiving Intravesical Bacillus Calmette-Guérin Therapy.
Purpose: Collecting information on clinical outcomes (recurrence/progression) from complex treatment courses in non-muscle invasive bladder cancer (NMIBC) is challenging and time-consuming. We developed a deep learning natural language processing model to assess outcomes in patients with NMIBC using vast data from electronic health records (EHRs).
Methods: This retrospective study analyzed data from Japanese adults with NMIBC who started Bacillus Calmette-Guérin (BCG) induction therapy between April 2016 and June 2022. A Bidirectional Encoder Representations from Transformers (BERT) model was trained to classify outcomes, supported by human review for past history records. The model's performance was assessed by precision, recall, and F1 scores. We compared the effectiveness of BCG therapy between completion (patients who completed therapy) and non-completion groups.
Results: Of 372 patients studied, 79.3% and 20.7% were in the completion group and the non-completion group, respectively. The final BERT model achieved average F1 scores of 0.91 and 0.98 for time to recurrence (TTR), and 0.74 and 0.94 for time to progression (TTP) before and after human support, respectively. The hazard ratio for TTR in BCG completion versus non-completion groups was 0.40 (95% CI, 0.26 to 0.62) by a multivariate Cox proportional hazard model and 0.41 (95% CI, 0.26 to 0.63) by inverse probability of treatment weighting.
Conclusion: The developed model could compare the clinical outcomes between treatments in patients with NMIBC using EHRs. Human support, although required, was needed in only 10% documents and was deemed feasible. The model was able to demonstrate the difference in TTR and TTP between BCG completion and non-completion groups.