Zofia Gross, Richard Veyrat-Masson, Béatrice Grange, Sarah Huet, Aurélie Verney, Alexandra Traverse-Glehen, Philippe Ruminy, Lucile Baseggio
{"title":"诊断外周血中的慢性 B 细胞淋巴增生性疾病 = 机器学习如何帮助解读流式细胞仪数据","authors":"Zofia Gross, Richard Veyrat-Masson, Béatrice Grange, Sarah Huet, Aurélie Verney, Alexandra Traverse-Glehen, Philippe Ruminy, Lucile Baseggio","doi":"10.1002/hon.3245","DOIUrl":null,"url":null,"abstract":"<p>Flow cytometry (FCM) has become a method of choice for immunologic characterization of chronic lymphoproliferative disease (CLPD). To reduce the potential subjectivities of FCM data interpretation, we developed a machine learning random forest algorithm (RF) allowing unsupervised analysis. This assay relies on 16 parameters obtained from our FCM screening panel, routinely used in the exploration of peripheral blood (PB) samples (mean fluorescence intensity values (MFI) of CD19, CD45, CD5, CD20, CD200, CD23, HLA-DR, CD10 in CD19-gated B cells, ratio of kappa/Lambda, and different ratios of MFI B-cells/T-cells [CD20, CD200, CD23]). The RF algorithm was trained and validated on a large cohort of more than 300 annotated different CLPD cases (chronic B-cell leukemia, mantle cell lymphoma, marginal zone lymphoma, follicular lymphoma, splenic red pulp lymphoma, hairy cell leukemia) and non-tumoral selected from PB samples. The RF algorithm was able to differentiate tumoral from non-tumoral B-cells in all cases and to propose a correct CLPD classification in more than 90% of cases. In conclusion the RF algorithm could be proposed as an interesting help to FCM data interpretation allowing a first B-cells CLPD diagnostic hypothesis and/or to guide the management of complementary analysis (additional immunologic markers and genetic).</p>","PeriodicalId":12882,"journal":{"name":"Hematological Oncology","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Diagnosis of chronic B-cell lymphoproliferative disease in peripheral blood = how machine learning may help to the interpretation of flow cytometry data\",\"authors\":\"Zofia Gross, Richard Veyrat-Masson, Béatrice Grange, Sarah Huet, Aurélie Verney, Alexandra Traverse-Glehen, Philippe Ruminy, Lucile Baseggio\",\"doi\":\"10.1002/hon.3245\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Flow cytometry (FCM) has become a method of choice for immunologic characterization of chronic lymphoproliferative disease (CLPD). To reduce the potential subjectivities of FCM data interpretation, we developed a machine learning random forest algorithm (RF) allowing unsupervised analysis. This assay relies on 16 parameters obtained from our FCM screening panel, routinely used in the exploration of peripheral blood (PB) samples (mean fluorescence intensity values (MFI) of CD19, CD45, CD5, CD20, CD200, CD23, HLA-DR, CD10 in CD19-gated B cells, ratio of kappa/Lambda, and different ratios of MFI B-cells/T-cells [CD20, CD200, CD23]). The RF algorithm was trained and validated on a large cohort of more than 300 annotated different CLPD cases (chronic B-cell leukemia, mantle cell lymphoma, marginal zone lymphoma, follicular lymphoma, splenic red pulp lymphoma, hairy cell leukemia) and non-tumoral selected from PB samples. The RF algorithm was able to differentiate tumoral from non-tumoral B-cells in all cases and to propose a correct CLPD classification in more than 90% of cases. In conclusion the RF algorithm could be proposed as an interesting help to FCM data interpretation allowing a first B-cells CLPD diagnostic hypothesis and/or to guide the management of complementary analysis (additional immunologic markers and genetic).</p>\",\"PeriodicalId\":12882,\"journal\":{\"name\":\"Hematological Oncology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Hematological Oncology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/hon.3245\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEMATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hematological Oncology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/hon.3245","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEMATOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
流式细胞术(FCM)已成为慢性淋巴细胞增生性疾病(CLPD)免疫学特征描述的首选方法。为了减少 FCM 数据解读的潜在主观性,我们开发了一种机器学习随机森林算法 (RF),允许进行无监督分析。这种检测方法依赖于从我们的 FCM 筛选面板中获得的 16 个参数,这些参数通常用于外周血(PB)样本的检测(CD19、CD45、CD5、CD20、CD200、CD23、HLA-DR、CD19 门控 B 细胞中的 CD10 的平均荧光强度值 (MFI)、kappa/Lambda 比率以及 MFI B 细胞/T 细胞 [CD20、CD200、CD23] 的不同比率)。RF 算法是在一大批 300 多个注释不同的 CLPD 病例(慢性 B 细胞白血病、套细胞淋巴瘤、边缘区淋巴瘤、滤泡淋巴瘤、脾红髓淋巴瘤、毛细胞白血病)和从 PB 样本中筛选出的非肿瘤病例上训练和验证的。射频算法能够区分所有病例中的肿瘤性和非肿瘤性 B 细胞,并对 90% 以上的病例提出了正确的 CLPD 分类。总之,RF 算法可以作为 FCM 数据解读的一个有趣的帮助,允许提出第一个 B 细胞 CLPD 诊断假设和/或指导补充分析(其他免疫标记物和基因)的管理。
Diagnosis of chronic B-cell lymphoproliferative disease in peripheral blood = how machine learning may help to the interpretation of flow cytometry data
Flow cytometry (FCM) has become a method of choice for immunologic characterization of chronic lymphoproliferative disease (CLPD). To reduce the potential subjectivities of FCM data interpretation, we developed a machine learning random forest algorithm (RF) allowing unsupervised analysis. This assay relies on 16 parameters obtained from our FCM screening panel, routinely used in the exploration of peripheral blood (PB) samples (mean fluorescence intensity values (MFI) of CD19, CD45, CD5, CD20, CD200, CD23, HLA-DR, CD10 in CD19-gated B cells, ratio of kappa/Lambda, and different ratios of MFI B-cells/T-cells [CD20, CD200, CD23]). The RF algorithm was trained and validated on a large cohort of more than 300 annotated different CLPD cases (chronic B-cell leukemia, mantle cell lymphoma, marginal zone lymphoma, follicular lymphoma, splenic red pulp lymphoma, hairy cell leukemia) and non-tumoral selected from PB samples. The RF algorithm was able to differentiate tumoral from non-tumoral B-cells in all cases and to propose a correct CLPD classification in more than 90% of cases. In conclusion the RF algorithm could be proposed as an interesting help to FCM data interpretation allowing a first B-cells CLPD diagnostic hypothesis and/or to guide the management of complementary analysis (additional immunologic markers and genetic).
期刊介绍:
Hematological Oncology considers for publication articles dealing with experimental and clinical aspects of neoplastic diseases of the hemopoietic and lymphoid systems and relevant related matters. Translational studies applying basic science to clinical issues are particularly welcomed. Manuscripts dealing with the following areas are encouraged:
-Clinical practice and management of hematological neoplasia, including: acute and chronic leukemias, malignant lymphomas, myeloproliferative disorders
-Diagnostic investigations, including imaging and laboratory assays
-Epidemiology, pathology and pathobiology of hematological neoplasia of hematological diseases
-Therapeutic issues including Phase 1, 2 or 3 trials as well as allogeneic and autologous stem cell transplantation studies
-Aspects of the cell biology, molecular biology, molecular genetics and cytogenetics of normal or diseased hematopoeisis and lymphopoiesis, including stem cells and cytokines and other regulatory systems.
Concise, topical review material is welcomed, especially if it makes new concepts and ideas accessible to a wider community. Proposals for review material may be discussed with the Editor-in-Chief. Collections of case material and case reports will be considered only if they have broader scientific or clinical relevance.