骨髓纤维化定量分析凸显骨髓纤维化的异质性并增强组织学评估：津芬曲辛αII期临床研究的启示

IF 7.6 2区医学 Q1 HEMATOLOGY

HemaSphere Pub Date : 2024-06-13 DOI:10.1002/hem3.105

Hosuk Ryou, Korsuk Sirinukunwattana, Ruby Wood, Alan Aberdeen, Jens Rittscher, Olga K. Weinberg, Robert Hasserjian, Olga Pozdnyakova, Frank Peale, Brian Higgins, Pontus Lundberg, Kerstin Trunzer, Claire N. Harrison, Daniel Royston

{"title":"骨髓纤维化定量分析凸显骨髓纤维化的异质性并增强组织学评估：津芬曲辛αII期临床研究的启示","authors":"Hosuk Ryou, Korsuk Sirinukunwattana, Ruby Wood, Alan Aberdeen, Jens Rittscher, Olga K. Weinberg, Robert Hasserjian, Olga Pozdnyakova, Frank Peale, Brian Higgins, Pontus Lundberg, Kerstin Trunzer, Claire N. Harrison, Daniel Royston","doi":"10.1002/hem3.105","DOIUrl":null,"url":null,"abstract":"Accurate assessment of bone marrow fibrosis is central to the diagnosis and assessment of patients with myeloproliferative neoplasms (MPNs).1-3 However, European consensus criteria for fibrosis are subjective, only semiquantitative, and cannot fully capture sample fibrosis heterogeneity.4-6 In response, we have recently demonstrated the potential of machine learning to improve the detection and quantitation of marrow fibrosis in MPN using routinely prepared bone marrow trephine (BMT) samples.7 Such approaches can support accurate MPN classification/risk stratification and provide quantitative analysis of fibrosis heterogeneity, with the potential to support clinical trial teams in the evaluation of current and novel antifibrotic therapies.6 Here, we report evidence of such utility in the context of stage 2 of a phase II study of zinpentraxin alfa in patients diagnosed with primary or secondary myelofibrosis (MF) [ClinicalTrials.gov identifier: NCT01981850]. The primary trial endpoint was bone marrow response (≥1 grade reduction from baseline fibrosis at any timepoint). Secondary endpoints included effects on disease-related anemia, thrombocytopenia, and constitutional symptoms.Zinpentraxin alfa (ZPN; previously PRM-151) is a recombinant form of human pentraxin-2 (PTX2; also known as serum amyloid P component or SAP), a circulating endogenous regulator of the inflammatory response to tissue damage and a natural inhibitor of fibrosis.8-10 In the open-label stage 1 of this phase 2 study, ZPN showed evidence of clinical activity and tolerable safety as monotherapy or in combination with ruxolitinib in patients with primary MF, post-polycythemia vera (PV) MF, or post-essential thrombocythemia (ET) MF.11 A subsequent randomized dose-ranging study (stage 2) evaluated the efficacy and safety of three different doses of ZPN as monotherapy in patients with IPSS intermediate-1, intermediate-2, and high-risk primary MF, post-PV MF, or post-ET MF who were anemic or thrombocytopenic and ineligible for, intolerant of, or had an inadequate prior response to ruxolitinib.12 Patients were randomized to receive 0.3, 3.0, or 10.0 mg/kg ZPN on Days 1, 3, and 5 of cycle 1 and every 4 weeks thereafter for up to nine cycles. Reticulin-stained BMTs from three timepoints (screening, cycle 4 [C4D1], and cycle 9 [C9D29]) were analyzed for a subset of patients enrolled in the stage 2 study for whom digital scanned images were available at all three timepoints (50/97) (Figure 1A,B). Prior manual assessment of marrow fibrosis had been performed as part of a blinded, independent central review by three expert hematopathologists. Quantitative assessment of fibrosis using Continuous Indexing of Fibrosis (CIF) was performed by automated analyses as previously described.7 Briefly, CIF analysis employs a ranking convolutional neural network (CNN) trained on images of reticulin-stained BMT slides to score image tiles for fibrosis severity. These tiles cover the analyzable marrow tissue and are used to generate fibrosis severity maps with output image scores (CIF scores) normalized between 0 and 1. Three sets of features relating to analyzed tiles are extracted from each sample: average tile CIF score, tile score distribution, and heterogeneity of CIF score. Visualization of these outputs into two-dimensional space is performed using principal component analysis (PCA) (Figure 1C).A total of 142/157 (90.4%) BMT samples obtained from 50 patients at three timepoints were evaluable. Overall, there was a moderate correlation between the average sample CIF score and the manually assigned fibrosis grade for all samples (Spearman's rho = 0.39) (Figure 2A). However, there was a marked overlap in the distribution of CIF scores across fibrosis grades, most notably for samples assigned to grades MF-2 and MF-3. Approximately, 38% (n = 16) of MF-2 samples fell within the interquartile range of CIF distribution observed in MF-3, and around 48% of MF-3 (n = 45) samples fell within the interquartile range observed in MF-2. This result is in keeping with the recognized challenge of accurately distinguishing between these MF grade categories, although both are consistent with a diagnosis of overt myelofibrosis. Notably, several samples manually assessed as MF-2 had average CIF scores similar to or lower than those graded as MF-0 or MF-1. On review, we suspected this may reflect sample fibrosis heterogeneity; some samples with low average fibrosis (low average sample CIF score) were correctly classified as MF-2 on the basis that ≥30% of the tissue contained more severe fibrosis (high regional CIF score). To investigate this further, we compared the ZPN trial samples taken at screening with an independent cohort of newly diagnosed and untreated MPNs in which PCA was used to combine average tile CIF score, tile score distribution, and heterogeneity of CIF score (Figure 2D). Plotting the ZPN screening samples onto this PCA of MPN “disease space” revealed that while most samples demonstrated such combined fibrosis features typical of primary or secondary myelofibrosis, several displayed features more typically seen in ET, pre-PMF, or PV.Having identified marked variation in both the fibrosis features at screening and average CIF scores of manually assigned MF grades for all samples, we assessed changes in fibrosis from screening to C4 and C9. This revealed an improvement in the average CIF score in 16 of 42 patients (38%) (Figure 3A). Notably, improvements in average CIF score by C9 appeared to be most marked in patients with higher CIF scores at screening, although no obvious ZPN dose-dependent effect was observed. The overall improvement in CIF score was similar to that of manually assessed fibrosis in which 15 of 41 patients (37%) had an improvement of at least one MF grade at either C4 or C9. However, there was notable discordance between manual and quantitative CIF fibrosis assessment for individual cases (Figure 2B,C), with only 6 of 41 cases (15%) demonstrating both an improvement in CIF score and manual MF grade.Next, we sought to correlate changes in CIF score with the secondary trial endpoints. We observed no significant association between a change in average sample CIF score and changes in disease-related anemia, thrombocytopenia, or constitutional symptoms (data not shown). However, we observed a trend toward an association between improving CIF score and best overall response as per modified International Working Group-Myeloproliferative Neoplasms Research and Treatment (IWG-MRT) criteria, with marrows from patients experiencing clinical improvement more likely to have a corresponding improvement in CIF score between screening and C9 (Figure 3B). Finally, logistic regression analysis was used to estimate the association of the treatment group, baseline anemia, or thrombocytopenia and average CIF score at screening with the reduction in average CIF score. This revealed that a higher average CIF score at screening was significantly associated with CIF score reduction for the 42 patients for whom samples were available for all three trial timepoints (Wald's test p < 0.01) (Figure 3C).Our analysis is the first to demonstrate the utility of AI-driven quantitative fibrosis analysis in a multicenter clinical trial of patients with myelofibrosis. Although CIF-based analysis is not designed to specify an MF grade, it provides an objective measure of fibrosis severity and heterogeneity within BMTs, which is beyond conventional manual grading criteria. Moreover, it enables objective comparison across sequential samples from individual patients and allows accurate comparison within trial cohorts. Our results raise important concerns over the subjectivity of conventional fibrosis assessment in myelofibrosis, with marked overlap in CIF scores seen between and within manually assigned MF grades, and poor concordance between manually assessed and CIF-determined fibrosis improvement. Unexpectedly, there was a marked variation in average CIF score at screening in a trial recruiting patients with high-risk primary or secondary MF, supported by our demonstration of striking cohort heterogeneity when compared to a separate cohort of MPN. Indeed, 39% (19/49) of the screening samples analyzed in this study demonstrated fibrotic features (average severity and heterogeneity) more typical of MPNs other than primary or secondary myelofibrosis (i.e., ET, PV, and pre-PMF). However, it should be noted that most patients recruited to this trial had high-risk disease (39/50 with IPSS Int-2/high risk) and 39/50 patients had received prior JAK2 inhibition. By contrast, our previously analyzed cohort of MPN included only newly diagnosed patients with no significant pretreatment. It remains unclear to what extent the inclusion of MPN patient samples with longstanding disease and/or significant pretreatment will influence our existing description of bone marrow fibrosis state in ongoing studies. Notwithstanding this caveat, our analysis suggests that variation in manual fibrosis assessment could adversely influence the accuracy and consistency of trials aiming to evaluate therapeutics targeting MF, and alternative methods for quantifying and defining fibrosis changes following therapy are indicated. This is particularly important given recent work questioning the role of marrow fibrosis assessment in evaluating outcomes in JAK inhibitor-naïve patients treated with momelotinib or ruxolitinib, particularly as the authors relied upon local fibrosis grading with no central review.13 Although we could not demonstrate evidence for a significant association between CIF score improvement and the secondary clinical endpoints, we had access to WSI from only 50 of the 97 recruited patients. This reflects challenges in collecting such data as part of post hoc analytical studies and highlights the value of including such analysis in the study protocols of future clinical trials looking to evaluate bone marrow morphological response. Our observation of a trend toward an association between improving CIF score and the best overall response as per IWG-MRT criteria warrants further evaluation of quantitative fibrosis analysis as a surrogate for clinical response in MPN trials aiming to stabilize or reverse marrow fibrosis.14Conception and design: Daniel Royston, Kerstin Trunzer, Korsuk Sirinukunwattana, Hosuk Ryou, Alan Aberdeen & Jens Rittscher. Collection and assembly of data: Kerstin Trunzer, Frank Peale, Brian Higgins, Pontus Lundberg, Claire N. Harrison, Olga K. Weinberg, Robert Hasserjian & Olga Pozdnyakova. Data analysis and interpretation: Hosuk Ryou, Korsuk Sirinukunwattana, Ruby Wood, Alan Aberdeen & Daniel Royston. Manuscript writing: Daniel Royston, Korsuk Sirinukunwattana, Kerstin Trunzer, Pontus Lundberg & Alan Aberdeen. Final approval of manuscript: All authors.Korsuk Sirinukunwattana, Alan Aberdeen, and Jens Rittscher are cofounders and equity holders of Ground Truth Labs Ltd. Daniel Royston provides consulting services to Ground Truth Labs Ltd. and Johnson & Johnson. Kerstin Trunzer and Pontus Lundberg are employees of F. Hoffmann-La Roche and have stock ownership. Brian Higgins is an employee of F. Hoffmann-La Roche and Genentech and has stock ownership. Frank Peale is an employee of Genentech and has stock ownership. Claire N. Harrison has received consulting fees from AbbVie, AOP, BMS, Constellation Pharmaceuticals, CTI BioPharma, Galecto, GSK, Karyopharm, Keros, MorphoSys, Novartis, Promedior, and Roche; honoraria from AbbVie, BMS, GSK, and Novartis; has advisory roles for Galecto and Keros; has received support from Novartis for attending meetings; and has a leadership or fiduciary role with the European Hematology Association and MPN Voice; and is an Editor of HemaSphere. The remaining authors declare no conflict of interest.This study was supported by F. Hoffmann-La Roche, Ltd.; Blood Cancer UK, Grant/Award Number: 23012; Cancer Research UK, Grant/Award Number: EDDPJT-May23/100034; EPSRC-funded Seebibyte programme (EP/M013774/1); and Ludwig Institute for Cancer Research, Oxford Branch.","PeriodicalId":12982,"journal":{"name":"HemaSphere","volume":"8 6","pages":""},"PeriodicalIF":7.6000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/hem3.105","citationCount":"0","resultStr":"{\"title\":\"Quantitative analysis of bone marrow fibrosis highlights heterogeneity in myelofibrosis and augments histological assessment: An Insight from a phase II clinical study of zinpentraxin alfa\",\"authors\":\"Hosuk Ryou, Korsuk Sirinukunwattana, Ruby Wood, Alan Aberdeen, Jens Rittscher, Olga K. Weinberg, Robert Hasserjian, Olga Pozdnyakova, Frank Peale, Brian Higgins, Pontus Lundberg, Kerstin Trunzer, Claire N. Harrison, Daniel Royston\",\"doi\":\"10.1002/hem3.105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate assessment of bone marrow fibrosis is central to the diagnosis and assessment of patients with myeloproliferative neoplasms (MPNs).1-3 However, European consensus criteria for fibrosis are subjective, only semiquantitative, and cannot fully capture sample fibrosis heterogeneity.4-6 In response, we have recently demonstrated the potential of machine learning to improve the detection and quantitation of marrow fibrosis in MPN using routinely prepared bone marrow trephine (BMT) samples.7 Such approaches can support accurate MPN classification/risk stratification and provide quantitative analysis of fibrosis heterogeneity, with the potential to support clinical trial teams in the evaluation of current and novel antifibrotic therapies.6 Here, we report evidence of such utility in the context of stage 2 of a phase II study of zinpentraxin alfa in patients diagnosed with primary or secondary myelofibrosis (MF) [ClinicalTrials.gov identifier: NCT01981850]. The primary trial endpoint was bone marrow response (≥1 grade reduction from baseline fibrosis at any timepoint). Secondary endpoints included effects on disease-related anemia, thrombocytopenia, and constitutional symptoms.Zinpentraxin alfa (ZPN; previously PRM-151) is a recombinant form of human pentraxin-2 (PTX2; also known as serum amyloid P component or SAP), a circulating endogenous regulator of the inflammatory response to tissue damage and a natural inhibitor of fibrosis.8-10 In the open-label stage 1 of this phase 2 study, ZPN showed evidence of clinical activity and tolerable safety as monotherapy or in combination with ruxolitinib in patients with primary MF, post-polycythemia vera (PV) MF, or post-essential thrombocythemia (ET) MF.11 A subsequent randomized dose-ranging study (stage 2) evaluated the efficacy and safety of three different doses of ZPN as monotherapy in patients with IPSS intermediate-1, intermediate-2, and high-risk primary MF, post-PV MF, or post-ET MF who were anemic or thrombocytopenic and ineligible for, intolerant of, or had an inadequate prior response to ruxolitinib.12 Patients were randomized to receive 0.3, 3.0, or 10.0 mg/kg ZPN on Days 1, 3, and 5 of cycle 1 and every 4 weeks thereafter for up to nine cycles. Reticulin-stained BMTs from three timepoints (screening, cycle 4 [C4D1], and cycle 9 [C9D29]) were analyzed for a subset of patients enrolled in the stage 2 study for whom digital scanned images were available at all three timepoints (50/97) (Figure 1A,B). Prior manual assessment of marrow fibrosis had been performed as part of a blinded, independent central review by three expert hematopathologists. Quantitative assessment of fibrosis using Continuous Indexing of Fibrosis (CIF) was performed by automated analyses as previously described.7 Briefly, CIF analysis employs a ranking convolutional neural network (CNN) trained on images of reticulin-stained BMT slides to score image tiles for fibrosis severity. These tiles cover the analyzable marrow tissue and are used to generate fibrosis severity maps with output image scores (CIF scores) normalized between 0 and 1. Three sets of features relating to analyzed tiles are extracted from each sample: average tile CIF score, tile score distribution, and heterogeneity of CIF score. Visualization of these outputs into two-dimensional space is performed using principal component analysis (PCA) (Figure 1C).A total of 142/157 (90.4%) BMT samples obtained from 50 patients at three timepoints were evaluable. Overall, there was a moderate correlation between the average sample CIF score and the manually assigned fibrosis grade for all samples (Spearman's rho = 0.39) (Figure 2A). However, there was a marked overlap in the distribution of CIF scores across fibrosis grades, most notably for samples assigned to grades MF-2 and MF-3. Approximately, 38% (n = 16) of MF-2 samples fell within the interquartile range of CIF distribution observed in MF-3, and around 48% of MF-3 (n = 45) samples fell within the interquartile range observed in MF-2. This result is in keeping with the recognized challenge of accurately distinguishing between these MF grade categories, although both are consistent with a diagnosis of overt myelofibrosis. Notably, several samples manually assessed as MF-2 had average CIF scores similar to or lower than those graded as MF-0 or MF-1. On review, we suspected this may reflect sample fibrosis heterogeneity; some samples with low average fibrosis (low average sample CIF score) were correctly classified as MF-2 on the basis that ≥30% of the tissue contained more severe fibrosis (high regional CIF score). To investigate this further, we compared the ZPN trial samples taken at screening with an independent cohort of newly diagnosed and untreated MPNs in which PCA was used to combine average tile CIF score, tile score distribution, and heterogeneity of CIF score (Figure 2D). Plotting the ZPN screening samples onto this PCA of MPN “disease space” revealed that while most samples demonstrated such combined fibrosis features typical of primary or secondary myelofibrosis, several displayed features more typically seen in ET, pre-PMF, or PV.Having identified marked variation in both the fibrosis features at screening and average CIF scores of manually assigned MF grades for all samples, we assessed changes in fibrosis from screening to C4 and C9. This revealed an improvement in the average CIF score in 16 of 42 patients (38%) (Figure 3A). Notably, improvements in average CIF score by C9 appeared to be most marked in patients with higher CIF scores at screening, although no obvious ZPN dose-dependent effect was observed. The overall improvement in CIF score was similar to that of manually assessed fibrosis in which 15 of 41 patients (37%) had an improvement of at least one MF grade at either C4 or C9. However, there was notable discordance between manual and quantitative CIF fibrosis assessment for individual cases (Figure 2B,C), with only 6 of 41 cases (15%) demonstrating both an improvement in CIF score and manual MF grade.Next, we sought to correlate changes in CIF score with the secondary trial endpoints. We observed no significant association between a change in average sample CIF score and changes in disease-related anemia, thrombocytopenia, or constitutional symptoms (data not shown). However, we observed a trend toward an association between improving CIF score and best overall response as per modified International Working Group-Myeloproliferative Neoplasms Research and Treatment (IWG-MRT) criteria, with marrows from patients experiencing clinical improvement more likely to have a corresponding improvement in CIF score between screening and C9 (Figure 3B). Finally, logistic regression analysis was used to estimate the association of the treatment group, baseline anemia, or thrombocytopenia and average CIF score at screening with the reduction in average CIF score. This revealed that a higher average CIF score at screening was significantly associated with CIF score reduction for the 42 patients for whom samples were available for all three trial timepoints (Wald's test p < 0.01) (Figure 3C).Our analysis is the first to demonstrate the utility of AI-driven quantitative fibrosis analysis in a multicenter clinical trial of patients with myelofibrosis. Although CIF-based analysis is not designed to specify an MF grade, it provides an objective measure of fibrosis severity and heterogeneity within BMTs, which is beyond conventional manual grading criteria. Moreover, it enables objective comparison across sequential samples from individual patients and allows accurate comparison within trial cohorts. Our results raise important concerns over the subjectivity of conventional fibrosis assessment in myelofibrosis, with marked overlap in CIF scores seen between and within manually assigned MF grades, and poor concordance between manually assessed and CIF-determined fibrosis improvement. Unexpectedly, there was a marked variation in average CIF score at screening in a trial recruiting patients with high-risk primary or secondary MF, supported by our demonstration of striking cohort heterogeneity when compared to a separate cohort of MPN. Indeed, 39% (19/49) of the screening samples analyzed in this study demonstrated fibrotic features (average severity and heterogeneity) more typical of MPNs other than primary or secondary myelofibrosis (i.e., ET, PV, and pre-PMF). However, it should be noted that most patients recruited to this trial had high-risk disease (39/50 with IPSS Int-2/high risk) and 39/50 patients had received prior JAK2 inhibition. By contrast, our previously analyzed cohort of MPN included only newly diagnosed patients with no significant pretreatment. It remains unclear to what extent the inclusion of MPN patient samples with longstanding disease and/or significant pretreatment will influence our existing description of bone marrow fibrosis state in ongoing studies. Notwithstanding this caveat, our analysis suggests that variation in manual fibrosis assessment could adversely influence the accuracy and consistency of trials aiming to evaluate therapeutics targeting MF, and alternative methods for quantifying and defining fibrosis changes following therapy are indicated. This is particularly important given recent work questioning the role of marrow fibrosis assessment in evaluating outcomes in JAK inhibitor-naïve patients treated with momelotinib or ruxolitinib, particularly as the authors relied upon local fibrosis grading with no central review.13 Although we could not demonstrate evidence for a significant association between CIF score improvement and the secondary clinical endpoints, we had access to WSI from only 50 of the 97 recruited patients. This reflects challenges in collecting such data as part of post hoc analytical studies and highlights the value of including such analysis in the study protocols of future clinical trials looking to evaluate bone marrow morphological response. Our observation of a trend toward an association between improving CIF score and the best overall response as per IWG-MRT criteria warrants further evaluation of quantitative fibrosis analysis as a surrogate for clinical response in MPN trials aiming to stabilize or reverse marrow fibrosis.14Conception and design: Daniel Royston, Kerstin Trunzer, Korsuk Sirinukunwattana, Hosuk Ryou, Alan Aberdeen & Jens Rittscher. Collection and assembly of data: Kerstin Trunzer, Frank Peale, Brian Higgins, Pontus Lundberg, Claire N. Harrison, Olga K. Weinberg, Robert Hasserjian & Olga Pozdnyakova. Data analysis and interpretation: Hosuk Ryou, Korsuk Sirinukunwattana, Ruby Wood, Alan Aberdeen & Daniel Royston. Manuscript writing: Daniel Royston, Korsuk Sirinukunwattana, Kerstin Trunzer, Pontus Lundberg & Alan Aberdeen. Final approval of manuscript: All authors.Korsuk Sirinukunwattana, Alan Aberdeen, and Jens Rittscher are cofounders and equity holders of Ground Truth Labs Ltd. Daniel Royston provides consulting services to Ground Truth Labs Ltd. and Johnson & Johnson. Kerstin Trunzer and Pontus Lundberg are employees of F. Hoffmann-La Roche and have stock ownership. Brian Higgins is an employee of F. Hoffmann-La Roche and Genentech and has stock ownership. Frank Peale is an employee of Genentech and has stock ownership. Claire N. Harrison has received consulting fees from AbbVie, AOP, BMS, Constellation Pharmaceuticals, CTI BioPharma, Galecto, GSK, Karyopharm, Keros, MorphoSys, Novartis, Promedior, and Roche; honoraria from AbbVie, BMS, GSK, and Novartis; has advisory roles for Galecto and Keros; has received support from Novartis for attending meetings; and has a leadership or fiduciary role with the European Hematology Association and MPN Voice; and is an Editor of HemaSphere. The remaining authors declare no conflict of interest.This study was supported by F. Hoffmann-La Roche, Ltd.; Blood Cancer UK, Grant/Award Number: 23012; Cancer Research UK, Grant/Award Number: EDDPJT-May23/100034; EPSRC-funded Seebibyte programme (EP/M013774/1); and Ludwig Institute for Cancer Research, Oxford Branch.\",\"PeriodicalId\":12982,\"journal\":{\"name\":\"HemaSphere\",\"volume\":\"8 6\",\"pages\":\"\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2024-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/hem3.105\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"HemaSphere\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/hem3.105\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEMATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"HemaSphere","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/hem3.105","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEMATOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

将 ZPN 筛选样本绘制到 MPN "疾病空间 "的 PCA 上显示，虽然大多数样本显示了原发性或继发性骨髓纤维化的典型合并纤维化特征，但也有一些样本显示了更典型的 ET、前骨髓纤维化或 PV 特征。结果显示，42 例患者中有 16 例（38%）的平均 CIF 评分有所提高（图 3A）。值得注意的是，尽管没有观察到明显的 ZPN 剂量依赖效应，但在筛查时 CIF 评分较高的患者中，C9 时平均 CIF 评分的改善似乎最为明显。CIF 评分的总体改善情况与人工评估纤维化的情况相似，41 名患者中有 15 名（37%）在 C4 或 C9 时至少改善了一个 MF 等级。然而，手动和定量 CIF 纤维化评估在个别病例中存在明显的不一致（图 2B、C），41 例病例中只有 6 例（15%）同时显示 CIF 评分和手动 MF 分级均有改善。我们观察到，平均样本 CIF 评分的变化与疾病相关的贫血、血小板减少或体质症状的变化之间没有明显关联（数据未显示）。不过，我们观察到 CIF 评分的提高与根据国际骨髓增生性肿瘤研究和治疗工作组（IWG-MRT）修订标准得出的最佳总体反应之间存在关联趋势，在筛查至 C9 期间，临床症状有所改善的患者的骨髓更有可能获得 CIF 评分的相应提高（图 3B）。最后，利用逻辑回归分析估计了治疗组、基线贫血或血小板减少以及筛查时平均 CIF 评分与平均 CIF 评分降低的关系。我们的分析首次证明了人工智能驱动的纤维化定量分析在骨髓纤维化患者多中心临床试验中的实用性。虽然基于 CIF 的分析不是为了明确骨髓纤维化的分级，但它能客观衡量骨髓纤维化的严重程度和 BMT 内的异质性，这超出了传统的人工分级标准。此外，它还能对单个患者的连续样本进行客观比较，并在试验队列中进行准确比较。我们的结果引起了人们对骨髓纤维化常规纤维化评估主观性的关注，在人工分配的骨髓纤维化分级之间和内部，CIF评分有明显的重叠，人工评估和CIF确定的纤维化改善之间的一致性很差。出乎意料的是，在一项招募高风险原发性或继发性 MF 患者的试验中，筛查时的平均 CIF 评分存在明显差异，这与我们对 MPN 的单独队列进行比较时发现的显著队列异质性相吻合。事实上，在本研究分析的筛查样本中，39%（19/49）的样本表现出比原发性或继发性骨髓纤维化（即 ET、PV 和前骨髓纤维化）更典型的 MPN 纤维化特征（平均严重程度和异质性）。不过，需要注意的是，该试验招募的大多数患者都患有高风险疾病（39/50 患者的 IPSS 为 Int-2/高风险），39/50 患者曾接受过 JAK2 抑制剂治疗。相比之下，我们之前分析的骨髓增生性疾病患者队列只包括新诊断的患者，且未进行过重要的预处理。目前仍不清楚纳入长期患病和/或接受过重要治疗的 MPN 患者样本会在多大程度上影响我们正在进行的研究中对骨髓纤维化状态的现有描述。尽管有这一注意事项，但我们的分析表明，人工纤维化评估的差异可能会对旨在评估针对骨髓纤维化疗法的试验的准确性和一致性产生不利影响，因此需要采用其他方法来量化和定义治疗后的纤维化变化。这一点尤为重要，因为最近有研究质疑骨髓纤维化评估在评估接受莫美洛替尼或鲁索利替尼治疗的 JAK 抑制剂无效患者的预后中的作用，特别是作者依赖于局部纤维化分级，而没有进行中央审查。13 虽然我们无法证明 CIF 评分改善与次要临床终点之间存在显著关联，但我们只能获得 97 例招募患者中 50 例的 WSI。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Quantitative analysis of bone marrow fibrosis highlights heterogeneity in myelofibrosis and augments histological assessment: An Insight from a phase II clinical study of zinpentraxin alfa

查看原文本刊更多论文

Quantitative analysis of bone marrow fibrosis highlights heterogeneity in myelofibrosis and augments histological assessment: An Insight from a phase II clinical study of zinpentraxin alfa

Accurate assessment of bone marrow fibrosis is central to the diagnosis and assessment of patients with myeloproliferative neoplasms (MPNs).^1-3 However, European consensus criteria for fibrosis are subjective, only semiquantitative, and cannot fully capture sample fibrosis heterogeneity.^4-6 In response, we have recently demonstrated the potential of machine learning to improve the detection and quantitation of marrow fibrosis in MPN using routinely prepared bone marrow trephine (BMT) samples.⁷ Such approaches can support accurate MPN classification/risk stratification and provide quantitative analysis of fibrosis heterogeneity, with the potential to support clinical trial teams in the evaluation of current and novel antifibrotic therapies.⁶ Here, we report evidence of such utility in the context of stage 2 of a phase II study of zinpentraxin alfa in patients diagnosed with primary or secondary myelofibrosis (MF) [ClinicalTrials.gov identifier: NCT01981850]. The primary trial endpoint was bone marrow response (≥1 grade reduction from baseline fibrosis at any timepoint). Secondary endpoints included effects on disease-related anemia, thrombocytopenia, and constitutional symptoms.

Zinpentraxin alfa (ZPN; previously PRM-151) is a recombinant form of human pentraxin-2 (PTX2; also known as serum amyloid P component or SAP), a circulating endogenous regulator of the inflammatory response to tissue damage and a natural inhibitor of fibrosis.^8-10 In the open-label stage 1 of this phase 2 study, ZPN showed evidence of clinical activity and tolerable safety as monotherapy or in combination with ruxolitinib in patients with primary MF, post-polycythemia vera (PV) MF, or post-essential thrombocythemia (ET) MF.¹¹ A subsequent randomized dose-ranging study (stage 2) evaluated the efficacy and safety of three different doses of ZPN as monotherapy in patients with IPSS intermediate-1, intermediate-2, and high-risk primary MF, post-PV MF, or post-ET MF who were anemic or thrombocytopenic and ineligible for, intolerant of, or had an inadequate prior response to ruxolitinib.¹² Patients were randomized to receive 0.3, 3.0, or 10.0 mg/kg ZPN on Days 1, 3, and 5 of cycle 1 and every 4 weeks thereafter for up to nine cycles. Reticulin-stained BMTs from three timepoints (screening, cycle 4 [C4D1], and cycle 9 [C9D29]) were analyzed for a subset of patients enrolled in the stage 2 study for whom digital scanned images were available at all three timepoints (50/97) (Figure 1A,B). Prior manual assessment of marrow fibrosis had been performed as part of a blinded, independent central review by three expert hematopathologists. Quantitative assessment of fibrosis using Continuous Indexing of Fibrosis (CIF) was performed by automated analyses as previously described.⁷ Briefly, CIF analysis employs a ranking convolutional neural network (CNN) trained on images of reticulin-stained BMT slides to score image tiles for fibrosis severity. These tiles cover the analyzable marrow tissue and are used to generate fibrosis severity maps with output image scores (CIF scores) normalized between 0 and 1. Three sets of features relating to analyzed tiles are extracted from each sample: average tile CIF score, tile score distribution, and heterogeneity of CIF score. Visualization of these outputs into two-dimensional space is performed using principal component analysis (PCA) (Figure 1C).

A total of 142/157 (90.4%) BMT samples obtained from 50 patients at three timepoints were evaluable. Overall, there was a moderate correlation between the average sample CIF score and the manually assigned fibrosis grade for all samples (Spearman's rho = 0.39) (Figure 2A). However, there was a marked overlap in the distribution of CIF scores across fibrosis grades, most notably for samples assigned to grades MF-2 and MF-3. Approximately, 38% (n = 16) of MF-2 samples fell within the interquartile range of CIF distribution observed in MF-3, and around 48% of MF-3 (n = 45) samples fell within the interquartile range observed in MF-2. This result is in keeping with the recognized challenge of accurately distinguishing between these MF grade categories, although both are consistent with a diagnosis of overt myelofibrosis. Notably, several samples manually assessed as MF-2 had average CIF scores similar to or lower than those graded as MF-0 or MF-1. On review, we suspected this may reflect sample fibrosis heterogeneity; some samples with low average fibrosis (low average sample CIF score) were correctly classified as MF-2 on the basis that ≥30% of the tissue contained more severe fibrosis (high regional CIF score). To investigate this further, we compared the ZPN trial samples taken at screening with an independent cohort of newly diagnosed and untreated MPNs in which PCA was used to combine average tile CIF score, tile score distribution, and heterogeneity of CIF score (Figure 2D). Plotting the ZPN screening samples onto this PCA of MPN “disease space” revealed that while most samples demonstrated such combined fibrosis features typical of primary or secondary myelofibrosis, several displayed features more typically seen in ET, pre-PMF, or PV.

Having identified marked variation in both the fibrosis features at screening and average CIF scores of manually assigned MF grades for all samples, we assessed changes in fibrosis from screening to C4 and C9. This revealed an improvement in the average CIF score in 16 of 42 patients (38%) (Figure 3A). Notably, improvements in average CIF score by C9 appeared to be most marked in patients with higher CIF scores at screening, although no obvious ZPN dose-dependent effect was observed. The overall improvement in CIF score was similar to that of manually assessed fibrosis in which 15 of 41 patients (37%) had an improvement of at least one MF grade at either C4 or C9. However, there was notable discordance between manual and quantitative CIF fibrosis assessment for individual cases (Figure 2B,C), with only 6 of 41 cases (15%) demonstrating both an improvement in CIF score and manual MF grade.

Next, we sought to correlate changes in CIF score with the secondary trial endpoints. We observed no significant association between a change in average sample CIF score and changes in disease-related anemia, thrombocytopenia, or constitutional symptoms (data not shown). However, we observed a trend toward an association between improving CIF score and best overall response as per modified International Working Group-Myeloproliferative Neoplasms Research and Treatment (IWG-MRT) criteria, with marrows from patients experiencing clinical improvement more likely to have a corresponding improvement in CIF score between screening and C9 (Figure 3B). Finally, logistic regression analysis was used to estimate the association of the treatment group, baseline anemia, or thrombocytopenia and average CIF score at screening with the reduction in average CIF score. This revealed that a higher average CIF score at screening was significantly associated with CIF score reduction for the 42 patients for whom samples were available for all three trial timepoints (Wald's test p < 0.01) (Figure 3C).

Our analysis is the first to demonstrate the utility of AI-driven quantitative fibrosis analysis in a multicenter clinical trial of patients with myelofibrosis. Although CIF-based analysis is not designed to specify an MF grade, it provides an objective measure of fibrosis severity and heterogeneity within BMTs, which is beyond conventional manual grading criteria. Moreover, it enables objective comparison across sequential samples from individual patients and allows accurate comparison within trial cohorts. Our results raise important concerns over the subjectivity of conventional fibrosis assessment in myelofibrosis, with marked overlap in CIF scores seen between and within manually assigned MF grades, and poor concordance between manually assessed and CIF-determined fibrosis improvement. Unexpectedly, there was a marked variation in average CIF score at screening in a trial recruiting patients with high-risk primary or secondary MF, supported by our demonstration of striking cohort heterogeneity when compared to a separate cohort of MPN. Indeed, 39% (19/49) of the screening samples analyzed in this study demonstrated fibrotic features (average severity and heterogeneity) more typical of MPNs other than primary or secondary myelofibrosis (i.e., ET, PV, and pre-PMF). However, it should be noted that most patients recruited to this trial had high-risk disease (39/50 with IPSS Int-2/high risk) and 39/50 patients had received prior JAK2 inhibition. By contrast, our previously analyzed cohort of MPN included only newly diagnosed patients with no significant pretreatment. It remains unclear to what extent the inclusion of MPN patient samples with longstanding disease and/or significant pretreatment will influence our existing description of bone marrow fibrosis state in ongoing studies. Notwithstanding this caveat, our analysis suggests that variation in manual fibrosis assessment could adversely influence the accuracy and consistency of trials aiming to evaluate therapeutics targeting MF, and alternative methods for quantifying and defining fibrosis changes following therapy are indicated. This is particularly important given recent work questioning the role of marrow fibrosis assessment in evaluating outcomes in JAK inhibitor-naïve patients treated with momelotinib or ruxolitinib, particularly as the authors relied upon local fibrosis grading with no central review.¹³ Although we could not demonstrate evidence for a significant association between CIF score improvement and the secondary clinical endpoints, we had access to WSI from only 50 of the 97 recruited patients. This reflects challenges in collecting such data as part of post hoc analytical studies and highlights the value of including such analysis in the study protocols of future clinical trials looking to evaluate bone marrow morphological response. Our observation of a trend toward an association between improving CIF score and the best overall response as per IWG-MRT criteria warrants further evaluation of quantitative fibrosis analysis as a surrogate for clinical response in MPN trials aiming to stabilize or reverse marrow fibrosis.¹⁴

Conception and design: Daniel Royston, Kerstin Trunzer, Korsuk Sirinukunwattana, Hosuk Ryou, Alan Aberdeen & Jens Rittscher. Collection and assembly of data: Kerstin Trunzer, Frank Peale, Brian Higgins, Pontus Lundberg, Claire N. Harrison, Olga K. Weinberg, Robert Hasserjian & Olga Pozdnyakova. Data analysis and interpretation: Hosuk Ryou, Korsuk Sirinukunwattana, Ruby Wood, Alan Aberdeen & Daniel Royston. Manuscript writing: Daniel Royston, Korsuk Sirinukunwattana, Kerstin Trunzer, Pontus Lundberg & Alan Aberdeen. Final approval of manuscript: All authors.

Korsuk Sirinukunwattana, Alan Aberdeen, and Jens Rittscher are cofounders and equity holders of Ground Truth Labs Ltd. Daniel Royston provides consulting services to Ground Truth Labs Ltd. and Johnson & Johnson. Kerstin Trunzer and Pontus Lundberg are employees of F. Hoffmann-La Roche and have stock ownership. Brian Higgins is an employee of F. Hoffmann-La Roche and Genentech and has stock ownership. Frank Peale is an employee of Genentech and has stock ownership. Claire N. Harrison has received consulting fees from AbbVie, AOP, BMS, Constellation Pharmaceuticals, CTI BioPharma, Galecto, GSK, Karyopharm, Keros, MorphoSys, Novartis, Promedior, and Roche; honoraria from AbbVie, BMS, GSK, and Novartis; has advisory roles for Galecto and Keros; has received support from Novartis for attending meetings; and has a leadership or fiduciary role with the European Hematology Association and MPN Voice; and is an Editor of HemaSphere. The remaining authors declare no conflict of interest.

This study was supported by F. Hoffmann-La Roche, Ltd.; Blood Cancer UK, Grant/Award Number: 23012; Cancer Research UK, Grant/Award Number: EDDPJT-May23/100034; EPSRC-funded Seebibyte programme (EP/M013774/1); and Ludwig Institute for Cancer Research, Oxford Branch.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

HemaSphere Medicine-Hematology

CiteScore

6.10

自引率

4.50%

发文量

2776

审稿时长

7 weeks

期刊介绍： HemaSphere, as a publication, is dedicated to disseminating the outcomes of profoundly pertinent basic, translational, and clinical research endeavors within the field of hematology. The journal actively seeks robust studies that unveil novel discoveries with significant ramifications for hematology. In addition to original research, HemaSphere features review articles and guideline articles that furnish lucid synopses and discussions of emerging developments, along with recommendations for patient care. Positioned as the foremost resource in hematology, HemaSphere augments its offerings with specialized sections like HemaTopics and HemaPolicy. These segments engender insightful dialogues covering a spectrum of hematology-related topics, including digestible summaries of pivotal articles, updates on new therapies, deliberations on European policy matters, and other noteworthy news items within the field. Steering the course of HemaSphere are Editor in Chief Jan Cools and Deputy Editor in Chief Claire Harrison, alongside the guidance of an esteemed Editorial Board comprising international luminaries in both research and clinical realms, each representing diverse areas of hematologic expertise.