Inter-reader reproducibility of a radiographic grading system for usual interstitial pneumonitis validates its use as a surrogate endpoint in clinical trials

Imaging Pub Date : 2024-03-06 DOI:10.1556/1647.2024.00190

K. Capaccione, Hong Ma, L. Luk, Mary M. Salvatore

{"title":"Inter-reader reproducibility of a radiographic grading system for usual interstitial pneumonitis validates its use as a surrogate endpoint in clinical trials","authors":"K. Capaccione, Hong Ma, L. Luk, Mary M. Salvatore","doi":"10.1556/1647.2024.00190","DOIUrl":null,"url":null,"abstract":"The primary purpose of this study was to assess the interreader reliability of a grading system for UIP based on the quantification of normal lung. This grading system considers each of the following lung regions: right upper and middle lobes, right lower lobe, left upper lobe, and left lower lobe. Each is assigned a grade based on the following: 0: 0% normal lung; 1: 1–49% normal lung; 2: 50–74% normal lung; 3: 75–89% normal lung; 4: 90–99% normal lung; 5: 100% normal lung. The secondary purpose was to compare the grades rendered by non-cardiothoracic subspecialty trained radiologists to grades established by cardiothoracic radiologists, which were considered the gold standard.Chest CT images of patients were obtained by searching the radiology record system for the terms “usual interstitial pneumonia” and “UIP”. Each case was confirmed by radiologist review; pathology was not assessed given the small fraction of cases that underwent biopsy due to the high risk of complications in patients with fibrotic lung disease. Two cardiothoracic radiologists evaluated each CT and reached a consensus grade. Two different radiologists who were not subspecialty trained in cardiothoracic radiology independently graded each case. Spearman correlation analysis was performed to compare the two reader's grades as well as each reader's grade independently to the gold standard score.Our analysis demonstrated a strongly positive statistically significant interreader correlation coefficient (RS) = 0.7062, P < 0.001. Our analysis of each reader compared to the gold standard demonstrated an Rs = 0.77559, P < 0.001 and an RS = 0.69958, P < 0.001 for readers 1 and 2, respectively, both representing statistically significant strongly positive correlations.These results demonstrate strong interreader reproducibility and show that radiologists without subspecialty training in cardiothoracic radiology render grades that correlate strongly with those given by cardiothoracic radiologists. These findings support the use of this grading system for UIP both to monitor clinical progression and as a surrogate endpoint for antifibrotic drug trials.","PeriodicalId":503851,"journal":{"name":"Imaging","volume":"31 16","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1556/1647.2024.00190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The primary purpose of this study was to assess the interreader reliability of a grading system for UIP based on the quantification of normal lung. This grading system considers each of the following lung regions: right upper and middle lobes, right lower lobe, left upper lobe, and left lower lobe. Each is assigned a grade based on the following: 0: 0% normal lung; 1: 1–49% normal lung; 2: 50–74% normal lung; 3: 75–89% normal lung; 4: 90–99% normal lung; 5: 100% normal lung. The secondary purpose was to compare the grades rendered by non-cardiothoracic subspecialty trained radiologists to grades established by cardiothoracic radiologists, which were considered the gold standard.Chest CT images of patients were obtained by searching the radiology record system for the terms “usual interstitial pneumonia” and “UIP”. Each case was confirmed by radiologist review; pathology was not assessed given the small fraction of cases that underwent biopsy due to the high risk of complications in patients with fibrotic lung disease. Two cardiothoracic radiologists evaluated each CT and reached a consensus grade. Two different radiologists who were not subspecialty trained in cardiothoracic radiology independently graded each case. Spearman correlation analysis was performed to compare the two reader's grades as well as each reader's grade independently to the gold standard score.Our analysis demonstrated a strongly positive statistically significant interreader correlation coefficient (RS) = 0.7062, P < 0.001. Our analysis of each reader compared to the gold standard demonstrated an Rs = 0.77559, P < 0.001 and an RS = 0.69958, P < 0.001 for readers 1 and 2, respectively, both representing statistically significant strongly positive correlations.These results demonstrate strong interreader reproducibility and show that radiologists without subspecialty training in cardiothoracic radiology render grades that correlate strongly with those given by cardiothoracic radiologists. These findings support the use of this grading system for UIP both to monitor clinical progression and as a surrogate endpoint for antifibrotic drug trials.

查看原文本刊更多论文

常见间质性肺炎放射学分级系统的读片者间重复性验证了其在临床试验中作为替代终点的有效性

本研究的主要目的是评估基于正常肺部量化的 UIP 分级系统的读片机间可靠性。该分级系统考虑了以下每个肺部区域：右上叶和中叶、右下叶、左上叶和左下叶。每个区域根据以下标准划分等级：0：0% 正常肺；1：1-49% 正常肺；2：50-74% 正常肺；3：75-89% 正常肺；4：90-99% 正常肺；5：100% 正常肺。次要目的是将非心胸亚专业放射科医生的分级与心胸放射科医生确定的分级进行比较，后者被认为是金标准。患者的胸部 CT 图像是通过在放射科记录系统中搜索 "常见间质性肺炎 "和 "UIP "这两个术语获得的。每个病例均由放射科医生审查确认；由于纤维化肺病患者出现并发症的风险较高，只有一小部分病例进行了活检，因此没有进行病理评估。两名心胸放射科医生对每张 CT 进行评估，并达成一致的分级。两名未接受过心胸外科放射学亚专业培训的不同放射科医生对每个病例进行独立分级。我们的分析表明，读片者之间的相关系数 (RS) = 0.7062，P < 0.001。我们对每位读者与金标准的比较分析表明，读者 1 和读者 2 的 Rs = 0.77559，P < 0.001，RS = 0.69958，P < 0.001，两者均为统计学意义上的强正相关。这些结果表明读者间具有很强的可重复性，并表明未接受过心胸外科放射学亚专业培训的放射科医生给出的分级与心胸外科放射科医生给出的分级具有很强的相关性。这些研究结果支持使用这种 UIP 分级系统监测临床进展，并将其作为抗纤维化药物试验的替代终点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Imaging

自引率

0.00%

发文量