Jose Peña, Sebastián Santana, Juan Cristobal Morales, Natalie Pinto, Mariano Suárez, Carola Sánchez, Juan Carlos Opazo, Rodrigo Villarroel, Claudio Montenegro, Bruno Nervi, Richard Weber
{"title":"OncovigIA: Artificial Intelligence for Early Lung Cancer Detection and Referral in a Chilean Public Hospital.","authors":"Jose Peña, Sebastián Santana, Juan Cristobal Morales, Natalie Pinto, Mariano Suárez, Carola Sánchez, Juan Carlos Opazo, Rodrigo Villarroel, Claudio Montenegro, Bruno Nervi, Richard Weber","doi":"10.1200/CCI-25-00035","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Lung cancer is a leading cause of death in Chile, where late-stage diagnoses and high mortality rates prevail. Here, we describe the development of <i>OncovigIA</i>, a novel digital tool powered by natural language processing that enhances the identification of potential lung cancer cases by surveilling computed tomography (CT) reports in a large public Hospital in Santiago, Chile.</p><p><strong>Materials and methods: </strong>We combined natural language processing and large language models with state-of-the-art machine learning techniques and approaches to treat unbalanced data sets and determine the best solution to implement in <i>OncovigIA</i>. Focusing on key sections of the reports and using various machine learning models, including a balanced Random Forest, the tool achieved high performance with 0.90 accuracy and 0.84 F1-score on the test set.</p><p><strong>Results: </strong>When applied to 13,326 CT chest reports from 2022, it successfully identified 377 CTs of patients with suspected lung cancer previously undetected and not managed by the multidisciplinary local lung cancer team.</p><p><strong>Conclusion: </strong>This study underscores the potential of artificial intelligence in early cancer detection and highlights the importance of its integration into local health care ecosystems. By promptly increasing the number of patients referred for specialized management, the tool <i>OncovigIA</i> offers a promising path toward improving lung cancer survival rates in Chile and beyond. Moreover, this article provides avenues for its broader implementation, extending it to other cancer types and/or health care-related texts for continuous surveillance, aiming at the early referral and treatment of cancer in low-resource settings.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500035"},"PeriodicalIF":2.8000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI-25-00035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/10/2 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Lung cancer is a leading cause of death in Chile, where late-stage diagnoses and high mortality rates prevail. Here, we describe the development of OncovigIA, a novel digital tool powered by natural language processing that enhances the identification of potential lung cancer cases by surveilling computed tomography (CT) reports in a large public Hospital in Santiago, Chile.
Materials and methods: We combined natural language processing and large language models with state-of-the-art machine learning techniques and approaches to treat unbalanced data sets and determine the best solution to implement in OncovigIA. Focusing on key sections of the reports and using various machine learning models, including a balanced Random Forest, the tool achieved high performance with 0.90 accuracy and 0.84 F1-score on the test set.
Results: When applied to 13,326 CT chest reports from 2022, it successfully identified 377 CTs of patients with suspected lung cancer previously undetected and not managed by the multidisciplinary local lung cancer team.
Conclusion: This study underscores the potential of artificial intelligence in early cancer detection and highlights the importance of its integration into local health care ecosystems. By promptly increasing the number of patients referred for specialized management, the tool OncovigIA offers a promising path toward improving lung cancer survival rates in Chile and beyond. Moreover, this article provides avenues for its broader implementation, extending it to other cancer types and/or health care-related texts for continuous surveillance, aiming at the early referral and treatment of cancer in low-resource settings.