Lucy S Kilburn, Victoria Hinder, Sikhuphukile G Mahati, Judith M Bliss
{"title":"使用常规收集的健康数据(英格兰)来确定原发性乳腺癌患者随后的疾病相关事件:乳腺癌临床试验中以医院为基础的随访的一种实际替代方法。","authors":"Lucy S Kilburn, Victoria Hinder, Sikhuphukile G Mahati, Judith M Bliss","doi":"10.1186/s13063-025-09085-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>With continued improvements in breast cancer (BC) outcomes and risk of recurrence occurring until at least 20 years post-diagnosis, it is important to continue to follow up clinical trial participants to characterise long-term treatment impact. Traditionally, follow-up has been via hospitals, entailing burden on patients and site staff. Using routinely collected health datasets (RCHD) as an alternative method is attractive, but historically, cancer recurrence is poorly recorded, unlike initial cancer diagnosis. Here we use data collected prospectively from large, multi-centre BC clinical trials to develop and test a procedure to identify recurrence within RCHD.</p><p><strong>Methods: </strong>Data from four trials of early breast cancer (TACT2, POETIC, IMPORT-HIGH and FAST-Forward) where recurrence data has been collected prospectively (gold standard) was linked with RCHD (incl. cancer registry and hospital episode statistics; HES) managed by NHS England. The procedure identified episodes of clinical activity within RCHD to classify each event type (local and distant recurrence, second cancers, death) separately, then combined to derive time-to-recurrence (TTR), disease-free survival (iDFS) and overall survival (OS) outcomes. Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were calculated. Hazard ratios using Cox regression modelling, log-rank test p-values, and 3-year survival rates for the randomised treatments were reported separately for RCHD and trial data.</p><p><strong>Results: </strong>The final procedure used Cancer Registry diagnoses to identify initial BCs for quality control purposes and second primary cancers. Deaths were identified via death dates and cause. Distant recurrence was identified predominantly by direct indicators of metastases (e.g. ICD10 codes C77X-79X). Local recurrence was identified via relevant surgeries' OPCS4 codes. For TTR, iDFS and OS, agreement between study and RCHD events was reasonable. Specificity was good across all endpoints (range: 97.9-99.9% for three training datasets combined), as was NPV (range: 95.2-99.6%). Sensitivity and PPV were more variable, with sensitivity ranging between 72.9 and 97.2% and PPV ranging between 82.6 and 99.5%. Values were similar when considering the test dataset. Survival estimates for TTR, iDFS and OS were similar between study and RCHD data.</p><p><strong>Conclusion: </strong>It is possible, with reasonable accuracy, to identify cancer recurrences using RCHD in the place of hospital-based data collection after the point of primary analysis.</p>","PeriodicalId":23333,"journal":{"name":"Trials","volume":"26 1","pages":"359"},"PeriodicalIF":2.0000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12465273/pdf/","citationCount":"0","resultStr":"{\"title\":\"Use of routinely collected health data (England) to identify subsequent disease-related events in patients with primary breast cancer: a practical alternative to hospital-based follow-up for breast cancer clinical trials.\",\"authors\":\"Lucy S Kilburn, Victoria Hinder, Sikhuphukile G Mahati, Judith M Bliss\",\"doi\":\"10.1186/s13063-025-09085-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>With continued improvements in breast cancer (BC) outcomes and risk of recurrence occurring until at least 20 years post-diagnosis, it is important to continue to follow up clinical trial participants to characterise long-term treatment impact. Traditionally, follow-up has been via hospitals, entailing burden on patients and site staff. Using routinely collected health datasets (RCHD) as an alternative method is attractive, but historically, cancer recurrence is poorly recorded, unlike initial cancer diagnosis. Here we use data collected prospectively from large, multi-centre BC clinical trials to develop and test a procedure to identify recurrence within RCHD.</p><p><strong>Methods: </strong>Data from four trials of early breast cancer (TACT2, POETIC, IMPORT-HIGH and FAST-Forward) where recurrence data has been collected prospectively (gold standard) was linked with RCHD (incl. cancer registry and hospital episode statistics; HES) managed by NHS England. The procedure identified episodes of clinical activity within RCHD to classify each event type (local and distant recurrence, second cancers, death) separately, then combined to derive time-to-recurrence (TTR), disease-free survival (iDFS) and overall survival (OS) outcomes. Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were calculated. Hazard ratios using Cox regression modelling, log-rank test p-values, and 3-year survival rates for the randomised treatments were reported separately for RCHD and trial data.</p><p><strong>Results: </strong>The final procedure used Cancer Registry diagnoses to identify initial BCs for quality control purposes and second primary cancers. Deaths were identified via death dates and cause. Distant recurrence was identified predominantly by direct indicators of metastases (e.g. ICD10 codes C77X-79X). Local recurrence was identified via relevant surgeries' OPCS4 codes. For TTR, iDFS and OS, agreement between study and RCHD events was reasonable. Specificity was good across all endpoints (range: 97.9-99.9% for three training datasets combined), as was NPV (range: 95.2-99.6%). Sensitivity and PPV were more variable, with sensitivity ranging between 72.9 and 97.2% and PPV ranging between 82.6 and 99.5%. Values were similar when considering the test dataset. Survival estimates for TTR, iDFS and OS were similar between study and RCHD data.</p><p><strong>Conclusion: </strong>It is possible, with reasonable accuracy, to identify cancer recurrences using RCHD in the place of hospital-based data collection after the point of primary analysis.</p>\",\"PeriodicalId\":23333,\"journal\":{\"name\":\"Trials\",\"volume\":\"26 1\",\"pages\":\"359\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12465273/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Trials\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13063-025-09085-1\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Trials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13063-025-09085-1","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
Use of routinely collected health data (England) to identify subsequent disease-related events in patients with primary breast cancer: a practical alternative to hospital-based follow-up for breast cancer clinical trials.
Background: With continued improvements in breast cancer (BC) outcomes and risk of recurrence occurring until at least 20 years post-diagnosis, it is important to continue to follow up clinical trial participants to characterise long-term treatment impact. Traditionally, follow-up has been via hospitals, entailing burden on patients and site staff. Using routinely collected health datasets (RCHD) as an alternative method is attractive, but historically, cancer recurrence is poorly recorded, unlike initial cancer diagnosis. Here we use data collected prospectively from large, multi-centre BC clinical trials to develop and test a procedure to identify recurrence within RCHD.
Methods: Data from four trials of early breast cancer (TACT2, POETIC, IMPORT-HIGH and FAST-Forward) where recurrence data has been collected prospectively (gold standard) was linked with RCHD (incl. cancer registry and hospital episode statistics; HES) managed by NHS England. The procedure identified episodes of clinical activity within RCHD to classify each event type (local and distant recurrence, second cancers, death) separately, then combined to derive time-to-recurrence (TTR), disease-free survival (iDFS) and overall survival (OS) outcomes. Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were calculated. Hazard ratios using Cox regression modelling, log-rank test p-values, and 3-year survival rates for the randomised treatments were reported separately for RCHD and trial data.
Results: The final procedure used Cancer Registry diagnoses to identify initial BCs for quality control purposes and second primary cancers. Deaths were identified via death dates and cause. Distant recurrence was identified predominantly by direct indicators of metastases (e.g. ICD10 codes C77X-79X). Local recurrence was identified via relevant surgeries' OPCS4 codes. For TTR, iDFS and OS, agreement between study and RCHD events was reasonable. Specificity was good across all endpoints (range: 97.9-99.9% for three training datasets combined), as was NPV (range: 95.2-99.6%). Sensitivity and PPV were more variable, with sensitivity ranging between 72.9 and 97.2% and PPV ranging between 82.6 and 99.5%. Values were similar when considering the test dataset. Survival estimates for TTR, iDFS and OS were similar between study and RCHD data.
Conclusion: It is possible, with reasonable accuracy, to identify cancer recurrences using RCHD in the place of hospital-based data collection after the point of primary analysis.
期刊介绍:
Trials is an open access, peer-reviewed, online journal that will encompass all aspects of the performance and findings of randomized controlled trials. Trials will experiment with, and then refine, innovative approaches to improving communication about trials. We are keen to move beyond publishing traditional trial results articles (although these will be included). We believe this represents an exciting opportunity to advance the science and reporting of trials. Prior to 2006, Trials was published as Current Controlled Trials in Cardiovascular Medicine (CCTCVM). All published CCTCVM articles are available via the Trials website and citations to CCTCVM article URLs will continue to be supported.