A. Panny, H. Hegde, I. Glurich, F. Scannapieco, J. Vedre, J. Vanwormer, J. Miecznikowski, A. Acharya
{"title":"使用自然语言处理(NLP)验证放射学报告中肺炎遭遇的方法学方法。","authors":"A. Panny, H. Hegde, I. Glurich, F. Scannapieco, J. Vedre, J. Vanwormer, J. Miecznikowski, A. Acharya","doi":"10.1055/a-1817-7008","DOIUrl":null,"url":null,"abstract":"INTRODUCTION\nPneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format.\n\n\nOBJECTIVE\nThe study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format.\n\n\nMETHODS\nA pneumonia-specific Natural Language Processing (NLP) pipeline was strategically developed applying cTAKES to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: \"positive\", \"negative\" or \"not classified: requires manual review\" based on tagged concepts that support or refute diagnostic codes.\n\n\nRESULTS\nA total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest x-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as 'Pneumonia-positive', 19% as (15401/81,707) as 'Pneumonia-negative' and 48% (39,209/81,707) as ''episode classification pending further manual review'. NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%).\n\n\nCONCLUSION\nThe pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":" ","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2022-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A methodological approach to validate pneumonia encounters from radiology reports using Natural Language Processing (NLP).\",\"authors\":\"A. Panny, H. Hegde, I. Glurich, F. Scannapieco, J. Vedre, J. Vanwormer, J. Miecznikowski, A. Acharya\",\"doi\":\"10.1055/a-1817-7008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"INTRODUCTION\\nPneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format.\\n\\n\\nOBJECTIVE\\nThe study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format.\\n\\n\\nMETHODS\\nA pneumonia-specific Natural Language Processing (NLP) pipeline was strategically developed applying cTAKES to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: \\\"positive\\\", \\\"negative\\\" or \\\"not classified: requires manual review\\\" based on tagged concepts that support or refute diagnostic codes.\\n\\n\\nRESULTS\\nA total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest x-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as 'Pneumonia-positive', 19% as (15401/81,707) as 'Pneumonia-negative' and 48% (39,209/81,707) as ''episode classification pending further manual review'. NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%).\\n\\n\\nCONCLUSION\\nThe pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.\",\"PeriodicalId\":49822,\"journal\":{\"name\":\"Methods of Information in Medicine\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2022-04-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Methods of Information in Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1055/a-1817-7008\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods of Information in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/a-1817-7008","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
A methodological approach to validate pneumonia encounters from radiology reports using Natural Language Processing (NLP).
INTRODUCTION
Pneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format.
OBJECTIVE
The study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format.
METHODS
A pneumonia-specific Natural Language Processing (NLP) pipeline was strategically developed applying cTAKES to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: "positive", "negative" or "not classified: requires manual review" based on tagged concepts that support or refute diagnostic codes.
RESULTS
A total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest x-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as 'Pneumonia-positive', 19% as (15401/81,707) as 'Pneumonia-negative' and 48% (39,209/81,707) as ''episode classification pending further manual review'. NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%).
CONCLUSION
The pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.
期刊介绍:
Good medicine and good healthcare demand good information. Since the journal''s founding in 1962, Methods of Information in Medicine has stressed the methodology and scientific fundamentals of organizing, representing and analyzing data, information and knowledge in biomedicine and health care. Covering publications in the fields of biomedical and health informatics, medical biometry, and epidemiology, the journal publishes original papers, reviews, reports, opinion papers, editorials, and letters to the editor. From time to time, the journal publishes articles on particular focus themes as part of a journal''s issue.