Elif Baran, Melissa Lee, Steven Aviv, Jessica Weiss, Chris Pettengell, Irene Karam, Andrew Bayley, Ian Poon, Kelvin K W Chan, Ambica Parmar, Martin Smoragiewicz, Hagen Klieb, Tra Truong, Pejman Maralani, Danny J Enepekides, Kevin M Higgins, Antoine Eskander
{"title":"Oropharyngeal Cancer Staging Health Record Extraction Using Artificial Intelligence.","authors":"Elif Baran, Melissa Lee, Steven Aviv, Jessica Weiss, Chris Pettengell, Irene Karam, Andrew Bayley, Ian Poon, Kelvin K W Chan, Ambica Parmar, Martin Smoragiewicz, Hagen Klieb, Tra Truong, Pejman Maralani, Danny J Enepekides, Kevin M Higgins, Antoine Eskander","doi":"10.1001/jamaoto.2024.1201","DOIUrl":null,"url":null,"abstract":"<p><strong>Importance: </strong>Accurate, timely, and cost-effective methods for staging oropharyngeal cancers are crucial for patient prognosis and treatment decisions, but staging documentation is often inaccurate or incomplete. With the emergence of artificial intelligence in medicine, data abstraction may be associated with reduced costs but increased efficiency and accuracy of cancer staging.</p><p><strong>Objective: </strong>To evaluate an algorithm using an artificial intelligence engine capable of extracting essential information from medical records of patients with oropharyngeal cancer and assigning tumor, nodal, and metastatic stages according to American Joint Committee on Cancer eighth edition guidelines.</p><p><strong>Design, setting, and participants: </strong>This retrospective diagnostic study was conducted among a convenience sample of 806 patients with oropharyngeal squamous cell carcinoma. Medical records of patients with oropharyngeal squamous cell carcinomas who presented to a single tertiary care center between January 1, 2010, and August 1, 2020, were reviewed. A ground truth cancer stage dataset and comprehensive staging rule book consisting of 135 rules encompassing p16 status, tumor, and nodal and metastatic stage were developed. Subsequently, 4 distinct models were trained: model T (entity relationship extraction) for anatomical location and invasion state, model S (numerical extraction) for lesion size, model M (sequential classification) for metastasis detection, and a p16 model for p16 status. For validation, results were compared against ground truth established by expert reviewers, and accuracy was reported. Data were analyzed from March to November 2023.</p><p><strong>Main outcomes and measures: </strong>The accuracy of algorithm cancer stages was compared with ground truth.</p><p><strong>Results: </strong>Among 806 patients with oropharyngeal cancer (mean [SD] age, 63.6 [10.6] years; 651 males [80.8%]), 421 patients (52.2%) were positive for human papillomavirus. The artificial intelligence engine achieved accuracies of 55.9% (95% CI, 52.5%-59.3%) for tumor, 56.0% (95% CI, 52.5%-59.4%) for nodal, and 87.6% (95% CI, 85.1%-89.7%) for metastatic stages and 92.1% (95% CI, 88.5%-94.6%) for p16 status. Differentiation between localized (stages 1-2) and advanced (stages 3-4) cancers achieved 80.7% (95% CI, 77.8%-83.2%) accuracy.</p><p><strong>Conclusion and relevance: </strong>This study found that tumor and nodal staging accuracies were fair to good and excellent for metastatic stage and p16 status, with clinical relevance in assigning optimal treatment and reducing toxic effect exposures. Further model refinement and external validation with electronic health records at different institutions are necessary to improve algorithm accuracy and clinical applicability.</p>","PeriodicalId":14632,"journal":{"name":"JAMA otolaryngology-- head & neck surgery","volume":" ","pages":"1051-1057"},"PeriodicalIF":6.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11099836/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAMA otolaryngology-- head & neck surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1001/jamaoto.2024.1201","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Importance: Accurate, timely, and cost-effective methods for staging oropharyngeal cancers are crucial for patient prognosis and treatment decisions, but staging documentation is often inaccurate or incomplete. With the emergence of artificial intelligence in medicine, data abstraction may be associated with reduced costs but increased efficiency and accuracy of cancer staging.
Objective: To evaluate an algorithm using an artificial intelligence engine capable of extracting essential information from medical records of patients with oropharyngeal cancer and assigning tumor, nodal, and metastatic stages according to American Joint Committee on Cancer eighth edition guidelines.
Design, setting, and participants: This retrospective diagnostic study was conducted among a convenience sample of 806 patients with oropharyngeal squamous cell carcinoma. Medical records of patients with oropharyngeal squamous cell carcinomas who presented to a single tertiary care center between January 1, 2010, and August 1, 2020, were reviewed. A ground truth cancer stage dataset and comprehensive staging rule book consisting of 135 rules encompassing p16 status, tumor, and nodal and metastatic stage were developed. Subsequently, 4 distinct models were trained: model T (entity relationship extraction) for anatomical location and invasion state, model S (numerical extraction) for lesion size, model M (sequential classification) for metastasis detection, and a p16 model for p16 status. For validation, results were compared against ground truth established by expert reviewers, and accuracy was reported. Data were analyzed from March to November 2023.
Main outcomes and measures: The accuracy of algorithm cancer stages was compared with ground truth.
Results: Among 806 patients with oropharyngeal cancer (mean [SD] age, 63.6 [10.6] years; 651 males [80.8%]), 421 patients (52.2%) were positive for human papillomavirus. The artificial intelligence engine achieved accuracies of 55.9% (95% CI, 52.5%-59.3%) for tumor, 56.0% (95% CI, 52.5%-59.4%) for nodal, and 87.6% (95% CI, 85.1%-89.7%) for metastatic stages and 92.1% (95% CI, 88.5%-94.6%) for p16 status. Differentiation between localized (stages 1-2) and advanced (stages 3-4) cancers achieved 80.7% (95% CI, 77.8%-83.2%) accuracy.
Conclusion and relevance: This study found that tumor and nodal staging accuracies were fair to good and excellent for metastatic stage and p16 status, with clinical relevance in assigning optimal treatment and reducing toxic effect exposures. Further model refinement and external validation with electronic health records at different institutions are necessary to improve algorithm accuracy and clinical applicability.
期刊介绍:
JAMA Otolaryngology–Head & Neck Surgery is a globally recognized and peer-reviewed medical journal dedicated to providing up-to-date information on diseases affecting the head and neck. It originated in 1925 as Archives of Otolaryngology and currently serves as the official publication for the American Head and Neck Society. As part of the prestigious JAMA Network, a collection of reputable general medical and specialty publications, it ensures the highest standards of research and expertise. Physicians and scientists worldwide rely on JAMA Otolaryngology–Head & Neck Surgery for invaluable insights in this specialized field.