Martijn C Schut, Torec T Luik, Iacopo Vagliano, Miguel A Rios Gaona, Charles W Helsper, Kristel M van Asselt, Niek de Wit, Ameen Abu-Hanna, Henk van Weert
{"title":"Artificial Intelligence for early detection of lung cancer in General Practitioners' clinical notes.","authors":"Martijn C Schut, Torec T Luik, Iacopo Vagliano, Miguel A Rios Gaona, Charles W Helsper, Kristel M van Asselt, Niek de Wit, Ameen Abu-Hanna, Henk van Weert","doi":"10.3399/BJGP.2023.0489","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The journey of more than 80% of patients diagnosed with lung cancer starts in general practice. About 75% of patients are diagnosed in an advanced stage (3 or 4), leading to more than 80% mortality within one year at present. The long-term data in general practitioners' records might contain hidden information that could be used for earlier case-finding of patients with cancer.</p><p><strong>Aim: </strong>To develop new prediction tools that improve the risk assessment for cancer.</p><p><strong>Design and setting: </strong> Text analysis of electronic patient data using natural language processing and machine learning in general practice files of four networks in the Netherlands.</p><p><strong>Method: </strong>Files of 525,526 patients were analysed, of whom 2386 were diagnosed with lung cancer. Diagnoses were validated in the Dutch Cancer registration, and structured and free text data were used to predict diagnosis of lung cancer five months before diagnosis (four months before referral).</p><p><strong>Results: </strong>Our algorithm could facilitate earlier detection of lung cancer using routine general practice data. We established discrimination, calibration, sensitivity, and specificity under various cut-off points of the prediction five months before diagnosis. Internal validation demonstrated an area under the curve of 0.90 (CI 95%: 0.90-0.93), and 0.84 (CI: 0.83-0.85) during external validation. The desired sensitivity determines the number of patients to be referred to detect one patient with lung cancer.</p><p><strong>Conclusion: </strong> AI-based support enables earlier detection of lung cancer in general practice using readily available text in the patient files of general practitioners, but needs additional prospective clinical evaluation.</p>","PeriodicalId":55320,"journal":{"name":"British Journal of General Practice","volume":" ","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of General Practice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3399/BJGP.2023.0489","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The journey of more than 80% of patients diagnosed with lung cancer starts in general practice. About 75% of patients are diagnosed in an advanced stage (3 or 4), leading to more than 80% mortality within one year at present. The long-term data in general practitioners' records might contain hidden information that could be used for earlier case-finding of patients with cancer.
Aim: To develop new prediction tools that improve the risk assessment for cancer.
Design and setting: Text analysis of electronic patient data using natural language processing and machine learning in general practice files of four networks in the Netherlands.
Method: Files of 525,526 patients were analysed, of whom 2386 were diagnosed with lung cancer. Diagnoses were validated in the Dutch Cancer registration, and structured and free text data were used to predict diagnosis of lung cancer five months before diagnosis (four months before referral).
Results: Our algorithm could facilitate earlier detection of lung cancer using routine general practice data. We established discrimination, calibration, sensitivity, and specificity under various cut-off points of the prediction five months before diagnosis. Internal validation demonstrated an area under the curve of 0.90 (CI 95%: 0.90-0.93), and 0.84 (CI: 0.83-0.85) during external validation. The desired sensitivity determines the number of patients to be referred to detect one patient with lung cancer.
Conclusion: AI-based support enables earlier detection of lung cancer in general practice using readily available text in the patient files of general practitioners, but needs additional prospective clinical evaluation.
期刊介绍:
The British Journal of General Practice is an international journal publishing research, editorials, debate and analysis, and clinical guidance for family practitioners and primary care researchers worldwide.
BJGP began in 1953 as the ‘College of General Practitioners’ Research Newsletter’, with the ‘Journal of the College of General Practitioners’ first appearing in 1960. Following the change in status of the College, the ‘Journal of the Royal College of General Practitioners’ was launched in 1967. Three editors later, in 1990, the title was changed to the ‘British Journal of General Practice’. The journal is commonly referred to as the ''BJGP'', and is an editorially-independent publication of the Royal College of General Practitioners.