Jenna Bhimani, K. O'Connell, I. Ergas, Marilyn J. Foley, Grace B Gallagher, Jennifer J Griggs, Narre Heon, Tatjana Kolevska, Yuriy Kotsurovskyy, Candyce H Kroenke, Cecile A Laurent, Raymond Liu, Kanichi G Nakata, Sonia Persaud, Donna R Rivera, Janise M. Roh, Sara M. Tabatabai, Emily Valice, Erin J A Bowles, Elisa V Bandera, Lawrence H Kushi, Elizabeth D Kantor
{"title":"Methodology for Using Real-World Data From Electronic Health Records to Assess Chemotherapy Administration in Women With Breast Cancer.","authors":"Jenna Bhimani, K. O'Connell, I. Ergas, Marilyn J. Foley, Grace B Gallagher, Jennifer J Griggs, Narre Heon, Tatjana Kolevska, Yuriy Kotsurovskyy, Candyce H Kroenke, Cecile A Laurent, Raymond Liu, Kanichi G Nakata, Sonia Persaud, Donna R Rivera, Janise M. Roh, Sara M. Tabatabai, Emily Valice, Erin J A Bowles, Elisa V Bandera, Lawrence H Kushi, Elizabeth D Kantor","doi":"10.1200/CCI.23.00209","DOIUrl":null,"url":null,"abstract":"PURPOSE\nIdentification of patients' intended chemotherapy regimens is critical to most research questions conducted in the real-world setting of cancer care. Yet, these data are not routinely available in electronic health records (EHRs) at the specificity required to address these questions. We developed a methodology to identify patients' intended regimens from EHR data in the Optimal Breast Cancer Chemotherapy Dosing (OBCD) study.\n\n\nMETHODS\nIn women older than 18 years, diagnosed with primary stage I-IIIA breast cancer at Kaiser Permanente Northern California (2006-2019), we categorized participants into 24 drug combinations described in National Comprehensive Cancer Network guidelines for breast cancer treatment. Participants were categorized into 50 guideline chemotherapy administration schedules within these combinations using an iterative algorithm process, followed by chart abstraction where necessary. We also identified patients intended to receive nonguideline administration schedules within guideline drug combinations and nonguideline drug combinations. This process was adapted at Kaiser Permanente Washington using abstracted data (2004-2015).\n\n\nRESULTS\nIn the OBCD cohort, 13,231 women received adjuvant or neoadjuvant chemotherapy, of whom 10,213 (77%) had their intended regimen identified via the algorithm, 2,416 (18%) had their intended regimen identified via abstraction, and 602 (4.5%) could not be identified. Across guideline drug combinations, 111 nonguideline dosing schedules were used, alongside 61 nonguideline drug combinations. A number of factors were associated with requiring abstraction for regimen determination, including: decreasing neighborhood household income, earlier diagnosis year, later stage, nodal status, and human epidermal growth factor receptor 2 (HER2)+ status.\n\n\nCONCLUSION\nWe describe the challenges and approaches to operationalize complex, real-world data to identify intended chemotherapy regimens in large, observational studies. This methodology can improve efficiency of use of large-scale clinical data in real-world populations, helping answer critical questions to improve care delivery and patient outcomes.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI.23.00209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
PURPOSE
Identification of patients' intended chemotherapy regimens is critical to most research questions conducted in the real-world setting of cancer care. Yet, these data are not routinely available in electronic health records (EHRs) at the specificity required to address these questions. We developed a methodology to identify patients' intended regimens from EHR data in the Optimal Breast Cancer Chemotherapy Dosing (OBCD) study.
METHODS
In women older than 18 years, diagnosed with primary stage I-IIIA breast cancer at Kaiser Permanente Northern California (2006-2019), we categorized participants into 24 drug combinations described in National Comprehensive Cancer Network guidelines for breast cancer treatment. Participants were categorized into 50 guideline chemotherapy administration schedules within these combinations using an iterative algorithm process, followed by chart abstraction where necessary. We also identified patients intended to receive nonguideline administration schedules within guideline drug combinations and nonguideline drug combinations. This process was adapted at Kaiser Permanente Washington using abstracted data (2004-2015).
RESULTS
In the OBCD cohort, 13,231 women received adjuvant or neoadjuvant chemotherapy, of whom 10,213 (77%) had their intended regimen identified via the algorithm, 2,416 (18%) had their intended regimen identified via abstraction, and 602 (4.5%) could not be identified. Across guideline drug combinations, 111 nonguideline dosing schedules were used, alongside 61 nonguideline drug combinations. A number of factors were associated with requiring abstraction for regimen determination, including: decreasing neighborhood household income, earlier diagnosis year, later stage, nodal status, and human epidermal growth factor receptor 2 (HER2)+ status.
CONCLUSION
We describe the challenges and approaches to operationalize complex, real-world data to identify intended chemotherapy regimens in large, observational studies. This methodology can improve efficiency of use of large-scale clinical data in real-world populations, helping answer critical questions to improve care delivery and patient outcomes.