Lea D. Schlieben , Maria Giulia Carta , Evgeny A. Moskalev , Robert Stöhr , Markus Metzler , Manuel Besendörfer , Norbert Meidenbauer , Sabine Semrau , Rolf Janka , Robert Grützmann , Stefan Wiemann , Arndt Hartmann , Abbas Agaimy , Florian Haller , Fulvia Ferrazzi
{"title":"使用靶向 RNA 测序对蓝圆细胞小肉瘤进行机器学习辅助诊断","authors":"Lea D. Schlieben , Maria Giulia Carta , Evgeny A. Moskalev , Robert Stöhr , Markus Metzler , Manuel Besendörfer , Norbert Meidenbauer , Sabine Semrau , Rolf Janka , Robert Grützmann , Stefan Wiemann , Arndt Hartmann , Abbas Agaimy , Florian Haller , Fulvia Ferrazzi","doi":"10.1016/j.jmoldx.2024.02.002","DOIUrl":null,"url":null,"abstract":"<div><p>Small blue round cell sarcomas (SBRCSs) are a heterogeneous group of tumors with overlapping morphologic features but markedly varying prognosis. They are characterized by distinct chromosomal alterations, particularly rearrangements leading to gene fusions, whose detection currently represents the most reliable diagnostic marker. Ewing sarcomas are the most common SBRCSs, defined by gene fusions involving <em>EWSR1</em> and transcription factors of the ETS family, and the most frequent non–<em>EWSR1</em>-rearranged SBRCSs harbor a <em>CIC</em> rearrangement. Unfortunately, currently the identification of <em>CIC</em>::<em>DUX4</em> translocation events, the most common <em>CIC</em> rearrangement, is challenging. Here, we present a machine-learning approach to support SBRCS diagnosis that relies on gene expression profiles measured via targeted sequencing. The analyses on a curated cohort of 69 soft-tissue tumors showed markedly distinct expression patterns for SBRCS subgroups. A random forest classifier trained on Ewing sarcoma and <em>CIC</em>-rearranged cases predicted probabilities of being <em>CIC</em>-rearranged >0.9 for <em>CIC</em>-rearranged–like sarcomas and <0.6 for other SBRCSs. Testing on a retrospective cohort of 1335 routine diagnostic cases identified 15 candidate <em>CIC</em>-rearranged tumors with a probability >0.75, all of which were supported by expert histopathologic reassessment. Furthermore, the multigene random forest classifier appeared advantageous over using high <em>ETV4</em> expression alone, previously proposed as a surrogate to identify <em>CIC</em> rearrangement. Taken together, the expression-based classifier can offer valuable support for SBRCS pathologic diagnosis.</p></div>","PeriodicalId":50128,"journal":{"name":"Journal of Molecular Diagnostics","volume":"26 5","pages":"Pages 387-398"},"PeriodicalIF":3.4000,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1525157824000461/pdfft?md5=1bdd20bb1eef2f4aac2f889d2553cfd5&pid=1-s2.0-S1525157824000461-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Machine Learning–Supported Diagnosis of Small Blue Round Cell Sarcomas Using Targeted RNA Sequencing\",\"authors\":\"Lea D. Schlieben , Maria Giulia Carta , Evgeny A. Moskalev , Robert Stöhr , Markus Metzler , Manuel Besendörfer , Norbert Meidenbauer , Sabine Semrau , Rolf Janka , Robert Grützmann , Stefan Wiemann , Arndt Hartmann , Abbas Agaimy , Florian Haller , Fulvia Ferrazzi\",\"doi\":\"10.1016/j.jmoldx.2024.02.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Small blue round cell sarcomas (SBRCSs) are a heterogeneous group of tumors with overlapping morphologic features but markedly varying prognosis. They are characterized by distinct chromosomal alterations, particularly rearrangements leading to gene fusions, whose detection currently represents the most reliable diagnostic marker. Ewing sarcomas are the most common SBRCSs, defined by gene fusions involving <em>EWSR1</em> and transcription factors of the ETS family, and the most frequent non–<em>EWSR1</em>-rearranged SBRCSs harbor a <em>CIC</em> rearrangement. Unfortunately, currently the identification of <em>CIC</em>::<em>DUX4</em> translocation events, the most common <em>CIC</em> rearrangement, is challenging. Here, we present a machine-learning approach to support SBRCS diagnosis that relies on gene expression profiles measured via targeted sequencing. The analyses on a curated cohort of 69 soft-tissue tumors showed markedly distinct expression patterns for SBRCS subgroups. A random forest classifier trained on Ewing sarcoma and <em>CIC</em>-rearranged cases predicted probabilities of being <em>CIC</em>-rearranged >0.9 for <em>CIC</em>-rearranged–like sarcomas and <0.6 for other SBRCSs. Testing on a retrospective cohort of 1335 routine diagnostic cases identified 15 candidate <em>CIC</em>-rearranged tumors with a probability >0.75, all of which were supported by expert histopathologic reassessment. Furthermore, the multigene random forest classifier appeared advantageous over using high <em>ETV4</em> expression alone, previously proposed as a surrogate to identify <em>CIC</em> rearrangement. Taken together, the expression-based classifier can offer valuable support for SBRCS pathologic diagnosis.</p></div>\",\"PeriodicalId\":50128,\"journal\":{\"name\":\"Journal of Molecular Diagnostics\",\"volume\":\"26 5\",\"pages\":\"Pages 387-398\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-02-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S1525157824000461/pdfft?md5=1bdd20bb1eef2f4aac2f889d2553cfd5&pid=1-s2.0-S1525157824000461-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Molecular Diagnostics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1525157824000461\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Diagnostics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1525157824000461","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PATHOLOGY","Score":null,"Total":0}
Machine Learning–Supported Diagnosis of Small Blue Round Cell Sarcomas Using Targeted RNA Sequencing
Small blue round cell sarcomas (SBRCSs) are a heterogeneous group of tumors with overlapping morphologic features but markedly varying prognosis. They are characterized by distinct chromosomal alterations, particularly rearrangements leading to gene fusions, whose detection currently represents the most reliable diagnostic marker. Ewing sarcomas are the most common SBRCSs, defined by gene fusions involving EWSR1 and transcription factors of the ETS family, and the most frequent non–EWSR1-rearranged SBRCSs harbor a CIC rearrangement. Unfortunately, currently the identification of CIC::DUX4 translocation events, the most common CIC rearrangement, is challenging. Here, we present a machine-learning approach to support SBRCS diagnosis that relies on gene expression profiles measured via targeted sequencing. The analyses on a curated cohort of 69 soft-tissue tumors showed markedly distinct expression patterns for SBRCS subgroups. A random forest classifier trained on Ewing sarcoma and CIC-rearranged cases predicted probabilities of being CIC-rearranged >0.9 for CIC-rearranged–like sarcomas and <0.6 for other SBRCSs. Testing on a retrospective cohort of 1335 routine diagnostic cases identified 15 candidate CIC-rearranged tumors with a probability >0.75, all of which were supported by expert histopathologic reassessment. Furthermore, the multigene random forest classifier appeared advantageous over using high ETV4 expression alone, previously proposed as a surrogate to identify CIC rearrangement. Taken together, the expression-based classifier can offer valuable support for SBRCS pathologic diagnosis.
期刊介绍:
The Journal of Molecular Diagnostics, the official publication of the Association for Molecular Pathology (AMP), co-owned by the American Society for Investigative Pathology (ASIP), seeks to publish high quality original papers on scientific advances in the translation and validation of molecular discoveries in medicine into the clinical diagnostic setting, and the description and application of technological advances in the field of molecular diagnostic medicine. The editors welcome for review articles that contain: novel discoveries or clinicopathologic correlations including studies in oncology, infectious diseases, inherited diseases, predisposition to disease, clinical informatics, or the description of polymorphisms linked to disease states or normal variations; the application of diagnostic methodologies in clinical trials; or the development of new or improved molecular methods which may be applied to diagnosis or monitoring of disease or disease predisposition.