Tiffany T Yu,Anne C Hoyt,Melissa M Joines,Cheryce P Fischer,Nazanin Yaghmai,James S Chalfant,Lucy Chow,Shabnam Mortazavi,Christopher D Sears,James Sayre,Joann G Elmore,William Hsu,Hannah S Milch
{"title":"Mammographic classification of interval breast cancers and artificial intelligence performance.","authors":"Tiffany T Yu,Anne C Hoyt,Melissa M Joines,Cheryce P Fischer,Nazanin Yaghmai,James S Chalfant,Lucy Chow,Shabnam Mortazavi,Christopher D Sears,James Sayre,Joann G Elmore,William Hsu,Hannah S Milch","doi":"10.1093/jnci/djaf103","DOIUrl":null,"url":null,"abstract":"BACKGROUND\r\nEuropean studies suggest artificial intelligence (AI) can reduce interval breast cancers (IBCs). However, research on IBC classification and AI's effectiveness in the U.S., particularly using digital breast tomosynthesis (DBT) and annual screening, is limited. We aimed to mammographically classify IBCs and assess AI performance using a 12-month screening interval.\r\n\r\nMETHODS\r\nFrom digital mammography (DM) and DBT screening mammograms acquired 2010-2019 at a U.S. tertiary care academic center, we identified IBCs diagnosed <12 months after a negative mammogram. At least three breast radiologists retrospectively classified IBCs as missed-reading error, minimal signs-actionable, minimal signs-non-actionable, true interval, occult, or missed-technical error. A deep-learning AI tool assigned risk scores (1-10) to the negative index screening mammograms, with scores ≥8 considered \"flagged.\" Statistical analysis evaluated associations among IBC types and AI exam scores, AI markings, and patient/tumor characteristics.\r\n\r\nRESULTS\r\nFrom 184,935 screening mammograms (65% DM, 35% DBT), we identified 148 IBCs in 148 women (mean age, 61±12 years). Of these, 26% were minimal signs-actionable; 24% occult; 22% minimal signs-non-actionable; 17% missed-reading error; 6% true interval; and 5% missed-technical error (p<.001). AI scored 131 mammograms (17 errors excluded). AI most frequently flagged exams with missed-reading errors (90%), minimal signs-actionable (89%) and minimal signs-non-actionable (72%) (p=.02). AI localized mammographically-visible types more accurately (35-68%) than non-visible types (0-50%, p=.02).\r\n\r\nCONCLUSION\r\nAI more frequently flagged and accurately localized IBC types that were mammographically visible at screening (missed or minimal signs), as compared to true interval or occult cancers.","PeriodicalId":501635,"journal":{"name":"Journal of the National Cancer Institute","volume":"108 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the National Cancer Institute","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jnci/djaf103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
BACKGROUND
European studies suggest artificial intelligence (AI) can reduce interval breast cancers (IBCs). However, research on IBC classification and AI's effectiveness in the U.S., particularly using digital breast tomosynthesis (DBT) and annual screening, is limited. We aimed to mammographically classify IBCs and assess AI performance using a 12-month screening interval.
METHODS
From digital mammography (DM) and DBT screening mammograms acquired 2010-2019 at a U.S. tertiary care academic center, we identified IBCs diagnosed <12 months after a negative mammogram. At least three breast radiologists retrospectively classified IBCs as missed-reading error, minimal signs-actionable, minimal signs-non-actionable, true interval, occult, or missed-technical error. A deep-learning AI tool assigned risk scores (1-10) to the negative index screening mammograms, with scores ≥8 considered "flagged." Statistical analysis evaluated associations among IBC types and AI exam scores, AI markings, and patient/tumor characteristics.
RESULTS
From 184,935 screening mammograms (65% DM, 35% DBT), we identified 148 IBCs in 148 women (mean age, 61±12 years). Of these, 26% were minimal signs-actionable; 24% occult; 22% minimal signs-non-actionable; 17% missed-reading error; 6% true interval; and 5% missed-technical error (p<.001). AI scored 131 mammograms (17 errors excluded). AI most frequently flagged exams with missed-reading errors (90%), minimal signs-actionable (89%) and minimal signs-non-actionable (72%) (p=.02). AI localized mammographically-visible types more accurately (35-68%) than non-visible types (0-50%, p=.02).
CONCLUSION
AI more frequently flagged and accurately localized IBC types that were mammographically visible at screening (missed or minimal signs), as compared to true interval or occult cancers.