A strong internal promoter drives massive expression of YEATS-domain devoid MLLT3 transcripts in HSC and most lethal AML

IF 20.1 1区 医学 Q1 ONCOLOGY
Chloé Bessière, Ahmed Zamani, Romain Pfeifer, Sandra Dailhau, Camille Marchet, Benoit Guibert, Anthony Boureux, Raïssa Silva Da Silva, Nicolas Gilbert, Thérèse Commes, Fabienne Meggetto, Christian Touriol, Christian Récher, Marina Bousquet, Stéphane Pyronnet
{"title":"A strong internal promoter drives massive expression of YEATS-domain devoid MLLT3 transcripts in HSC and most lethal AML","authors":"Chloé Bessière,&nbsp;Ahmed Zamani,&nbsp;Romain Pfeifer,&nbsp;Sandra Dailhau,&nbsp;Camille Marchet,&nbsp;Benoit Guibert,&nbsp;Anthony Boureux,&nbsp;Raïssa Silva Da Silva,&nbsp;Nicolas Gilbert,&nbsp;Thérèse Commes,&nbsp;Fabienne Meggetto,&nbsp;Christian Touriol,&nbsp;Christian Récher,&nbsp;Marina Bousquet,&nbsp;Stéphane Pyronnet","doi":"10.1002/cac2.12650","DOIUrl":null,"url":null,"abstract":"<p>The AF9 (protein AF9) transcription factor, encoded by <i>MLLT3</i> (<i>mixed-lineage leukemia translocated to 3</i>) on chromosome 9, functions as a chromatin reader. Through its N-terminal YEATS (Yaf9, ENL, AF9, Taf14, and Sas5) protein domain, it interacts with acetylated [<span>1</span>] or crotonylated [<span>2</span>] histone H3, as well as with the PAF1 (RNA polymerase II-associated factor 1 homolog) and P-TEFb (positive transcription elongation factor b) components of the super elongation complex (SEC). AF9 also interacts through its poly-serine domain (Poly-Ser) with the TFIID (Transcription factor II D) subunit of the RNA polymerase II (RNApol II) complex. In addition, its C-terminal transactivation domain, AHD (nuclear anchorage protein1 homology domain), binds other SEC components, such as AFF1 and AFF4 (ALF transcription elongation factor 1 or 4), as well as transcription regulators CBX8 (chromobox 8), DOT1L (disruptor of telomeric silencing 1 like), and BCOR (B cell lymphoma 6 corepressor), as reviewed by Kabra &amp; Bushweller [<span>3</span>] (Figure 1A). Thus, MLLT3 is an integral part of the SEC, which is essential for optimizing the catalytic activity of RNApol II transcription at specific genome loci.</p><p>Several studies have indicated that <i>MLLT3</i> is highly and specifically expressed in hematopoietic stem cells (HSCs), but it is rapidly and significantly downregulated during normal differentiation or immediately after HSCs are placed in <i>ex vivo</i> culture. In both scenarios, this shutdown parallels the rapid loss of stemness. Consistently, ectopic expression of <i>MLLT3</i> significantly prolongs self-renewal capacity of HSCs, suggesting that <i>MLLT3</i> is a crucial factor for HSC maintenance [<span>4</span>].</p><p>Based on standard quantification of RNA-sequencing reads mapping to the <i>MLLT3</i> locus, we first confirmed that, compared to the <i>MLLT1</i> paralogue used as an internal control, <i>MLLT3</i> expression was significantly higher in CD34<sup>+</sup> cells than in mature lymphocytes, granulocytes, or monocytes from healthy samples of the Leucegene dataset (Leucegene-NH, detailed in Supplementary Information) (Figure 1B, left panel). To refine this observation, made in CD34<sup>+</sup> cells containing a mixture of progenitors but only a few HSCs, we repeated the analysis in HSCs and various stages of progenitor cells sorted form healthy donors (IUCT-NH, detailed in Supplementary Information). The data clearly confirmed that <i>MLLT3</i> is highly expressed in HSCs but rapidly declines as differentiation proceeds (Figure 1B, right panel).</p><p>However, closer examination using a k-mer approach (described in Materials and Methods in Supplementary Information), which visualized RNA-sequencing read alignment along the 11 exons (E1-E11) of the reference <i>MLLT3</i> transcript, revealed an unexpected profile. Strikingly, the substantial <i>MLLT3</i> expression detected in HSCs was driven by a sharp and pronounced increase in reads starting precisely at the first nucleotide of exon E6 (Figure 1C). This unexpected profile was absent when examining <i>MLLT1</i> expression in the hematopoietic lineage (Supplementary Figure S1). These findings suggest the expression of one or more 5’ end shortened <i>MLLT3</i> transcripts arising from a hematologic lineage-specific internal promoter located in <i>MLLT3</i> intron 5.</p><p>A new set of specific and successive k-mers covering the entire <i>MLLT3</i> intron 5 revealed the presence of two novel segments retained in poly(A)<sup>+</sup> RNAs. Apart from the 5’ end of the first segment, which lacked a clearly defined starting point consistent with a probable transcription start site, consensus donor and acceptor splice sites flanked these two segments. These unexpected exons were designated as exon E6a and exon E6b/b’ (Figure 1D). Additional k-mer analyses confirmed that these exons were spliced to exon E6, resulting in three possible splice variants: E6a-E6b-E6, E6a-E6b'-E6, and E6a-E6, with the E6a-E6 variant being the most predominant (Figure 1E and Supplementary Figure S2). This assortment of novel CD34<sup>+</sup>-specific alternative exons was further validated using standard Sanger sequencing following RT-PCR amplification with specific primers (Supplementary Figure S3).</p><p>Next, exploration of public CHIP-Seq (chromatin immunoprecipitation followed by sequencing) and CAGE (mRNA 5’ cap analysis of gene expression) datasets revealed the existence of an alternative P2 promoter in addition to the canonical P1 promoter. Associated with an active promoter H3K4me3 mark and a CAGE peak, this P2 promoter, found exclusively in immature CD34<sup>+</sup> cells but absent in CD14<sup>+</sup> monocytes, was predicted to drive the expression of transcripts beginning with exon E6a (Supplementary Figure S4). Transcript-specific k-mer quantifications confirmed that the exceptionally high overall level of <i>MLLT3</i> observed in HSCs was primarily due to these shorter transcripts starting with exon E6a. We have collectively named these shorter transcripts <i>s-MLLT3</i>, in contrast to the reference full-length <i>MLLT3</i> transcript, referred to as <i>l-MLLT3</i> (Figure 1F).</p><p>These findings revealed the existence of an HSC-specific internal promoter (P2) that drives the expression of shorter <i>MLLT3</i> transcripts (<i>s-MLLT3</i> mRNAs, Figure 1G). To evaluate the translational potential of these <i>s-MLLT3</i> transcripts, we cloned a C-terminal myc-tagged version of the predominant E6a-E6 variant into an expression vector (Figure 1H, top), and assessed its protein expression capacity in the HEK (human embryonic kidney) cell line by western blotting. Compared with a similar vector encoding <i>l-MLLT3</i>, the E6a-E6 <i>s-MLLT3</i> transcript produced shorter s-MLLT3 proteins, (Figure 1H, bottom left). A western blotting analysis of endogenous proteins confirmed the existence of these shorter forms in CD34<sup>+</sup> cells but not in the K-562 leukemic cell line, which served as a low-expressing control (Figure 1H, bottom middle). Isoform-specific RT-qPCR quantification further corroborated that, compared to CD34<sup>+</sup> cells and full-length <i>l-MLLT3</i>, expression of the three shorter forms (E6a-E6b, E6a-E6b’, and E6a-E6) was very low in K-562 cells (Figure 1H, bottom right). These western blot results identified at least two distinct s-MLLT3 proteins, likely arising from alternative translation initiation codons (AUG2 and AUG3) located in exon 7, producing proteins that retain the C-terminal AHD transactivation domain but lack the YEATS and Poly-Ser domains (Supplementary Figure S5).</p><p>Interestingly, the shorter <i>MLLT3</i> alternative transcripts initiate within intron 5, which is also a frequent site of chromosomal translocation in acute myeloid leukemia (AML). Specifically, <i>MLLT3</i> intron 5 is the primary site of the t(9;11) chromosomal translocation, leading to fusion with <i>KMT2A</i> (lysine (K) methyl transferase 2A) [<span>5</span>]. KMT2A is a chromatin writer that deposits epigenetic marks indicating active transcription at specific loci, particularly the <i>HOX</i> genes required for hematopoiesis [<span>6</span>]. The resulting <i>KMT2A-MLLT3</i> fusion transcript encodes a chimeric protein composing the N-terminal third of KMT2A fused to the C-terminal portion of MLLT3, which lacks the YEATS domain but retains the AHD transactivation domain (Supplementary Figure S6). To investigate global <i>MLLT3</i> expression in AML, we analyzed two RNA-Seq datasets: IUCT-AML [<span>7</span>] and Beat-AML [<span>8</span>] (detailed in Supplementary Information). Compared with <i>MLLT1</i>, the expression range of <i>MLLT3</i> was broader, showing a &gt; 120-fold amplitude across samples (Figure 1I). Isoform- specific k-mers targeting alternative (E6a-E6 + E6b/b'-E6 for <i>s-MLLT3</i>) and canonical (E5-E6 for <i>l-MLLT3</i>) exon-exon junctions revealed no correlation between <i>s-MLLT3</i> and <i>l-MLLT3</i> expressions. Approximately 20% of samples exhibited high <i>s-MLLT3</i> levels, whereas in other samples, <i>s-MLLT3</i> was undetectable despite <i>l-MLLT3</i> expression (Supplementary Figure S7). This finding suggests differential regulation of the two promoters and/or the resulting transcripts in AML.</p><p>We next investigated whether the ∼20% of AML samples with high <i>s-MLLT3</i> expression represented a distinct clinical entity. Samples which were either <i>MECOM</i><sup>+</sup> (myelodysplasia syndrome 1 and EVI1 complex locus) or <i>GATA2-MECOM</i> (Supplementary Figure S8), and those with mutant <i>RUNX1</i> (runt-related transcription factor 1) and/or <i>TP53</i> (tumor protein p53) showed higher levels of <i>l-MLLT3</i>, with even greater levels of <i>s-MLLT3</i> (Supplementary Figure S9, left and middle). No correlation with the <i>KMT2A-MLLT3</i> translocation was observed. Conversely, <i>NPM1</i> (nucleophosmin 1)-mutated samples exhibited very low levels of both transcripts (Supplementary Figure S9, left and middle). Notably, elevated <i>s-MLLT3</i> or <i>l-MLLT3</i> levels were associated with an adverse ELN2017 (European LeukemiaNet 2017) score (Supplementary Figure S9, right). Median expression-based group separation revealed that patients with the highest overall <i>MLLT3</i> expression had worse survival outcomes (Figure 1J, left). However, given the lack of correlation between <i>s-MLLT3</i> and <i>l-MLLT3</i> expression in AML samples, we assessed their independent impacts on survival. Isoform-specific k-mers showed that <i>s-MLLT3</i> (but not <i>l-MLLT3</i>) expression significantly influenced poor patient survival (Figure 1J, middle and right panels).</p><p>In conclusion, these findings demonstrate the existence of an internal promoter within the <i>MLLT3</i> locus, driving expression of 5’-end-shortened transcripts encoding an AF9 protein lacking the YEAST chromatin reader domain. These alternative transcripts are highly expressed in HSCs and in ∼20% of AML patients with the worst survival outcomes. These results suggest that the role of the <i>MLLT3</i> locus in HSCs and in AML should be re-evaluated, considering the expression of this YEATS-domain-devoid AF9 transcription factor.</p><p>Stéphane Pyronnet wrote the manuscript. Chloé Bessière, Ahmed Zamani, Sandra Dailhau, Christian Récher, Marina Bousquet, and Stéphane Pyronnet contributed to the study design, conception, and data analysis. Ahmed Zamani, Romain Pfeifer, and Marina Bousquet designed and performed biological experiments. Chloé Bessière, Sandra Dailhau, Camille Marchet, Benoit Guibert, Anthony Boureux, Raïssa Silva Da Silva, Nicolas Gilbert, and Thérèse Commes developed the k-mer-based bioinformatics tools. Fabienne Meggetto, Christian Touriol, and Marina Bousquet provided comments on and contributed to editing the manuscript.</p><p>Christian Récher declares a consulting or advisory role with Abbvie, Amgen, Astellas, BMS, Boehringer, Jazz Pharmaceuticals, and Servier, and has received research funding from Abbvie, Amgen, Astellas, BMS, Iqvia, and Jazz Pharmaceuticals. All other authors declare no conflict of interest.</p><p>This work was funded by INSERM, Institut Universitaire du Cancer-Toulouse (IUCT), Labex Toucan, Fondation Leucémie Espoir, Ligue Régionale Contre le Cancer, Fondation ARC, Association Laurette Fugain, Agence Nationale de la Recherche (ANR-18-CE45-0020 Transipedia) and (ANR-22-CE45-0007 full-RNA). Chloé Bessière was supported by Fondation de France, Ahmed Zamani and Raïssa Silva Da Silva by Ligue Nationale Contre le Cancer, Romain Pfeifer by Ministère de l'Enseignement Supérieur et de la Recherche and Fondation ARC, Sandra Dailhau by Ministère de la Santé and Institut National du Cancer (INCA, PRT-K-2022-184CircOma).</p><p>In accordance with French law, each anonymous volunteer donor or patient was informed, and the HIMIP collection has been declared to the Ministère de l'Enseignement Supérieur et de la Recherche (DC 2008-307). A transfer agreement was obtained (AC 2008-129) after approval by the local ethical committee, Comité de Protection des Personnes Sud-Ouest et Outremer II, and the local Research Ethics Committee of the Etablissement Français du Sang (Toulouse, France, agreement #21PLER2021-007). Clinical and biological annotations have also been declared to the Comité National Informatique et Libertés (CNIL). This study was conducted in accordance with the Declaration of Helsinki.</p>","PeriodicalId":9495,"journal":{"name":"Cancer Communications","volume":"45 3","pages":"380-385"},"PeriodicalIF":20.1000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cac2.12650","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Communications","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cac2.12650","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The AF9 (protein AF9) transcription factor, encoded by MLLT3 (mixed-lineage leukemia translocated to 3) on chromosome 9, functions as a chromatin reader. Through its N-terminal YEATS (Yaf9, ENL, AF9, Taf14, and Sas5) protein domain, it interacts with acetylated [1] or crotonylated [2] histone H3, as well as with the PAF1 (RNA polymerase II-associated factor 1 homolog) and P-TEFb (positive transcription elongation factor b) components of the super elongation complex (SEC). AF9 also interacts through its poly-serine domain (Poly-Ser) with the TFIID (Transcription factor II D) subunit of the RNA polymerase II (RNApol II) complex. In addition, its C-terminal transactivation domain, AHD (nuclear anchorage protein1 homology domain), binds other SEC components, such as AFF1 and AFF4 (ALF transcription elongation factor 1 or 4), as well as transcription regulators CBX8 (chromobox 8), DOT1L (disruptor of telomeric silencing 1 like), and BCOR (B cell lymphoma 6 corepressor), as reviewed by Kabra & Bushweller [3] (Figure 1A). Thus, MLLT3 is an integral part of the SEC, which is essential for optimizing the catalytic activity of RNApol II transcription at specific genome loci.

Several studies have indicated that MLLT3 is highly and specifically expressed in hematopoietic stem cells (HSCs), but it is rapidly and significantly downregulated during normal differentiation or immediately after HSCs are placed in ex vivo culture. In both scenarios, this shutdown parallels the rapid loss of stemness. Consistently, ectopic expression of MLLT3 significantly prolongs self-renewal capacity of HSCs, suggesting that MLLT3 is a crucial factor for HSC maintenance [4].

Based on standard quantification of RNA-sequencing reads mapping to the MLLT3 locus, we first confirmed that, compared to the MLLT1 paralogue used as an internal control, MLLT3 expression was significantly higher in CD34+ cells than in mature lymphocytes, granulocytes, or monocytes from healthy samples of the Leucegene dataset (Leucegene-NH, detailed in Supplementary Information) (Figure 1B, left panel). To refine this observation, made in CD34+ cells containing a mixture of progenitors but only a few HSCs, we repeated the analysis in HSCs and various stages of progenitor cells sorted form healthy donors (IUCT-NH, detailed in Supplementary Information). The data clearly confirmed that MLLT3 is highly expressed in HSCs but rapidly declines as differentiation proceeds (Figure 1B, right panel).

However, closer examination using a k-mer approach (described in Materials and Methods in Supplementary Information), which visualized RNA-sequencing read alignment along the 11 exons (E1-E11) of the reference MLLT3 transcript, revealed an unexpected profile. Strikingly, the substantial MLLT3 expression detected in HSCs was driven by a sharp and pronounced increase in reads starting precisely at the first nucleotide of exon E6 (Figure 1C). This unexpected profile was absent when examining MLLT1 expression in the hematopoietic lineage (Supplementary Figure S1). These findings suggest the expression of one or more 5’ end shortened MLLT3 transcripts arising from a hematologic lineage-specific internal promoter located in MLLT3 intron 5.

A new set of specific and successive k-mers covering the entire MLLT3 intron 5 revealed the presence of two novel segments retained in poly(A)+ RNAs. Apart from the 5’ end of the first segment, which lacked a clearly defined starting point consistent with a probable transcription start site, consensus donor and acceptor splice sites flanked these two segments. These unexpected exons were designated as exon E6a and exon E6b/b’ (Figure 1D). Additional k-mer analyses confirmed that these exons were spliced to exon E6, resulting in three possible splice variants: E6a-E6b-E6, E6a-E6b'-E6, and E6a-E6, with the E6a-E6 variant being the most predominant (Figure 1E and Supplementary Figure S2). This assortment of novel CD34+-specific alternative exons was further validated using standard Sanger sequencing following RT-PCR amplification with specific primers (Supplementary Figure S3).

Next, exploration of public CHIP-Seq (chromatin immunoprecipitation followed by sequencing) and CAGE (mRNA 5’ cap analysis of gene expression) datasets revealed the existence of an alternative P2 promoter in addition to the canonical P1 promoter. Associated with an active promoter H3K4me3 mark and a CAGE peak, this P2 promoter, found exclusively in immature CD34+ cells but absent in CD14+ monocytes, was predicted to drive the expression of transcripts beginning with exon E6a (Supplementary Figure S4). Transcript-specific k-mer quantifications confirmed that the exceptionally high overall level of MLLT3 observed in HSCs was primarily due to these shorter transcripts starting with exon E6a. We have collectively named these shorter transcripts s-MLLT3, in contrast to the reference full-length MLLT3 transcript, referred to as l-MLLT3 (Figure 1F).

These findings revealed the existence of an HSC-specific internal promoter (P2) that drives the expression of shorter MLLT3 transcripts (s-MLLT3 mRNAs, Figure 1G). To evaluate the translational potential of these s-MLLT3 transcripts, we cloned a C-terminal myc-tagged version of the predominant E6a-E6 variant into an expression vector (Figure 1H, top), and assessed its protein expression capacity in the HEK (human embryonic kidney) cell line by western blotting. Compared with a similar vector encoding l-MLLT3, the E6a-E6 s-MLLT3 transcript produced shorter s-MLLT3 proteins, (Figure 1H, bottom left). A western blotting analysis of endogenous proteins confirmed the existence of these shorter forms in CD34+ cells but not in the K-562 leukemic cell line, which served as a low-expressing control (Figure 1H, bottom middle). Isoform-specific RT-qPCR quantification further corroborated that, compared to CD34+ cells and full-length l-MLLT3, expression of the three shorter forms (E6a-E6b, E6a-E6b’, and E6a-E6) was very low in K-562 cells (Figure 1H, bottom right). These western blot results identified at least two distinct s-MLLT3 proteins, likely arising from alternative translation initiation codons (AUG2 and AUG3) located in exon 7, producing proteins that retain the C-terminal AHD transactivation domain but lack the YEATS and Poly-Ser domains (Supplementary Figure S5).

Interestingly, the shorter MLLT3 alternative transcripts initiate within intron 5, which is also a frequent site of chromosomal translocation in acute myeloid leukemia (AML). Specifically, MLLT3 intron 5 is the primary site of the t(9;11) chromosomal translocation, leading to fusion with KMT2A (lysine (K) methyl transferase 2A) [5]. KMT2A is a chromatin writer that deposits epigenetic marks indicating active transcription at specific loci, particularly the HOX genes required for hematopoiesis [6]. The resulting KMT2A-MLLT3 fusion transcript encodes a chimeric protein composing the N-terminal third of KMT2A fused to the C-terminal portion of MLLT3, which lacks the YEATS domain but retains the AHD transactivation domain (Supplementary Figure S6). To investigate global MLLT3 expression in AML, we analyzed two RNA-Seq datasets: IUCT-AML [7] and Beat-AML [8] (detailed in Supplementary Information). Compared with MLLT1, the expression range of MLLT3 was broader, showing a > 120-fold amplitude across samples (Figure 1I). Isoform- specific k-mers targeting alternative (E6a-E6 + E6b/b'-E6 for s-MLLT3) and canonical (E5-E6 for l-MLLT3) exon-exon junctions revealed no correlation between s-MLLT3 and l-MLLT3 expressions. Approximately 20% of samples exhibited high s-MLLT3 levels, whereas in other samples, s-MLLT3 was undetectable despite l-MLLT3 expression (Supplementary Figure S7). This finding suggests differential regulation of the two promoters and/or the resulting transcripts in AML.

We next investigated whether the ∼20% of AML samples with high s-MLLT3 expression represented a distinct clinical entity. Samples which were either MECOM+ (myelodysplasia syndrome 1 and EVI1 complex locus) or GATA2-MECOM (Supplementary Figure S8), and those with mutant RUNX1 (runt-related transcription factor 1) and/or TP53 (tumor protein p53) showed higher levels of l-MLLT3, with even greater levels of s-MLLT3 (Supplementary Figure S9, left and middle). No correlation with the KMT2A-MLLT3 translocation was observed. Conversely, NPM1 (nucleophosmin 1)-mutated samples exhibited very low levels of both transcripts (Supplementary Figure S9, left and middle). Notably, elevated s-MLLT3 or l-MLLT3 levels were associated with an adverse ELN2017 (European LeukemiaNet 2017) score (Supplementary Figure S9, right). Median expression-based group separation revealed that patients with the highest overall MLLT3 expression had worse survival outcomes (Figure 1J, left). However, given the lack of correlation between s-MLLT3 and l-MLLT3 expression in AML samples, we assessed their independent impacts on survival. Isoform-specific k-mers showed that s-MLLT3 (but not l-MLLT3) expression significantly influenced poor patient survival (Figure 1J, middle and right panels).

In conclusion, these findings demonstrate the existence of an internal promoter within the MLLT3 locus, driving expression of 5’-end-shortened transcripts encoding an AF9 protein lacking the YEAST chromatin reader domain. These alternative transcripts are highly expressed in HSCs and in ∼20% of AML patients with the worst survival outcomes. These results suggest that the role of the MLLT3 locus in HSCs and in AML should be re-evaluated, considering the expression of this YEATS-domain-devoid AF9 transcription factor.

Stéphane Pyronnet wrote the manuscript. Chloé Bessière, Ahmed Zamani, Sandra Dailhau, Christian Récher, Marina Bousquet, and Stéphane Pyronnet contributed to the study design, conception, and data analysis. Ahmed Zamani, Romain Pfeifer, and Marina Bousquet designed and performed biological experiments. Chloé Bessière, Sandra Dailhau, Camille Marchet, Benoit Guibert, Anthony Boureux, Raïssa Silva Da Silva, Nicolas Gilbert, and Thérèse Commes developed the k-mer-based bioinformatics tools. Fabienne Meggetto, Christian Touriol, and Marina Bousquet provided comments on and contributed to editing the manuscript.

Christian Récher declares a consulting or advisory role with Abbvie, Amgen, Astellas, BMS, Boehringer, Jazz Pharmaceuticals, and Servier, and has received research funding from Abbvie, Amgen, Astellas, BMS, Iqvia, and Jazz Pharmaceuticals. All other authors declare no conflict of interest.

This work was funded by INSERM, Institut Universitaire du Cancer-Toulouse (IUCT), Labex Toucan, Fondation Leucémie Espoir, Ligue Régionale Contre le Cancer, Fondation ARC, Association Laurette Fugain, Agence Nationale de la Recherche (ANR-18-CE45-0020 Transipedia) and (ANR-22-CE45-0007 full-RNA). Chloé Bessière was supported by Fondation de France, Ahmed Zamani and Raïssa Silva Da Silva by Ligue Nationale Contre le Cancer, Romain Pfeifer by Ministère de l'Enseignement Supérieur et de la Recherche and Fondation ARC, Sandra Dailhau by Ministère de la Santé and Institut National du Cancer (INCA, PRT-K-2022-184CircOma).

In accordance with French law, each anonymous volunteer donor or patient was informed, and the HIMIP collection has been declared to the Ministère de l'Enseignement Supérieur et de la Recherche (DC 2008-307). A transfer agreement was obtained (AC 2008-129) after approval by the local ethical committee, Comité de Protection des Personnes Sud-Ouest et Outremer II, and the local Research Ethics Committee of the Etablissement Français du Sang (Toulouse, France, agreement #21PLER2021-007). Clinical and biological annotations have also been declared to the Comité National Informatique et Libertés (CNIL). This study was conducted in accordance with the Declaration of Helsinki.

Abstract Image

在HSC和大多数致死性AML中,一个强大的内部启动子驱动yeats结构域缺失的MLLT3转录本的大量表达。
AF9(蛋白AF9)转录因子由9号染色体上的MLLT3(混合谱系白血病易位3)编码,起染色质读取器的作用。通过其n端YEATS (Yaf9、ENL、AF9、Taf14和Sas5)蛋白结构域,它可以与乙酰化[1]或巴酰基化[2]组蛋白H3,以及超延伸复合物(SEC)的PAF1 (RNA聚合酶ii相关因子1同源物)和P-TEFb(正转录延伸因子b)组分相互作用。AF9还通过其多丝氨酸结构域(Poly-Ser)与RNA聚合酶II (RNApol II)复合物的TFIID(转录因子IID)亚基相互作用。此外,它的c端反激活结构域AHD(核锚定蛋白1同源结构域)结合其他SEC成分,如AFF1和AFF4 (ALF转录延伸因子1或4),以及转录调节因子CBX8 (chromobox 8), DOT1L(端粒沉默样物的中断物)和BCOR (B细胞淋巴瘤6共抑制因子),如Kabra &amp;Bushweller[3](图1A)。因此,MLLT3是SEC的一个组成部分,对于优化特定基因组位点上RNApol II转录的催化活性至关重要。多项研究表明,MLLT3在造血干细胞(hematopoietic stem cells, hsc)中高度特异性表达,但在正常分化过程中或hsc体外培养后立即迅速显著下调。在这两种情况下,这种关闭与干性的迅速丧失是平行的。与此一致,MLLT3的异位表达显著延长了HSC的自我更新能力,这表明MLLT3是HSC维持的关键因素。基于MLLT3位点的rna测序reads的标准定量,我们首先证实,与作为内部对照的MLLT1旁系谱相比,MLLT3在CD34+细胞中的表达明显高于来自Leucegene数据集(Leucegene- nh,详见补充信息)的健康样本中的成熟淋巴细胞、粒细胞或单核细胞(图1B,左面板)。为了完善这一观察结果,我们在含有混合祖细胞但只有少数造血干细胞的CD34+细胞中重复了造血干细胞和从健康供体中分选的不同阶段的祖细胞的分析(IUCT-NH,详见补充信息)。数据清楚地证实,MLLT3在hsc中高度表达,但随着分化的进行迅速下降(图1B,右图)。然而,使用k-mer方法(在补充信息中的材料和方法中描述)对参考MLLT3转录本的11个外显子(E1-E11)的rna测序读取比对进行更仔细的检查,揭示了一个意想不到的特征。引人注目的是,在hsc中检测到的大量MLLT3表达是由精确地从外显子E6的第一个核苷酸开始的读取量急剧和显著增加所驱动的(图1C)。当检测造血谱系中的MLLT1表达时,这种意想不到的特征是不存在的(补充图S1)。这些发现表明,位于MLLT3内含子5的血液学谱系特异性内部启动子表达了一个或多个5 '端缩短的MLLT3转录本。一组新的覆盖整个MLLT3内含子5的特异性连续k-mers揭示了poly(A)+ rna中保留的两个新片段的存在。除了第一个片段的5 '端缺乏明确定义的起始点与可能的转录起始点一致之外,共识的供体和受体剪接位点位于这两个片段的两侧。这些意想不到的外显子被命名为外显子E6a和E6b/b’(图1D)。额外的k-mer分析证实,这些外显子被剪接到外显子E6上,导致三种可能的剪接变体:E6a-E6b-E6、E6a-E6b'-E6和E6a-E6,其中E6a-E6变体是最主要的(图1E和补充图S2)。这种新的CD34+特异性外显子的分类使用标准Sanger测序进一步验证,随后使用特定引物进行RT-PCR扩增(补充图S3)。接下来,对公共CHIP-Seq(染色质免疫沉淀后测序)和CAGE(基因表达的mRNA 5 '帽分析)数据集的探索发现,除了典型的P1启动子外,还存在另一个P2启动子。这个P2启动子与活性启动子H3K4me3标记和CAGE峰相关,仅在未成熟的CD34+细胞中发现,但在CD14+单核细胞中不存在,预计会驱动以外显子E6a开始的转录本的表达(补充图S4)。转录特异性k-mer定量证实,hsc中观察到的MLLT3总体水平异常高,主要是由于这些从E6a外显子开始的较短转录本。我们将这些较短的转录本统称为s-MLLT3,而将参考全长的MLLT3转录本称为l-MLLT3(图1F)。 这些发现揭示了hsc特异性内部启动子(P2)的存在,该启动子驱动较短的MLLT3转录物(s-MLLT3 mrna,图1G)的表达。为了评估这些s-MLLT3转录本的翻译潜力,我们将主要的E6a-E6变体的c端myc标记版本克隆到表达载体中(图1H,顶部),并通过western blotting评估其在HEK(人胚胎肾)细胞系中的蛋白表达能力。与编码l-MLLT3的类似载体相比,E6a-E6 s-MLLT3转录物产生了更短的s-MLLT3蛋白(图1H,左下)。内源性蛋白的western blotting分析证实了这些较短形式在CD34+细胞中存在,但在作为低表达对照的K-562白血病细胞系中不存在(图1H,中下)。同种异构体特异性RT-qPCR定量进一步证实,与CD34+细胞和全长l-MLLT3相比,三种较短形式(E6a-E6b, E6a-E6b '和E6a-E6)在K-562细胞中的表达非常低(图1H,右下)。这些western blot结果鉴定了至少两种不同的s-MLLT3蛋白,可能是由位于外显子7的替代翻译起始密码子(AUG2和AUG3)引起的,产生保留c端AHD反活化结构域但缺乏YEATS和Poly-Ser结构域的蛋白(补充图S5)。有趣的是,较短的MLLT3替代转录本在5内含子内启动,这也是急性髓性白血病(AML)中染色体易位的常见位点。具体来说,MLLT3内含子5是t(9;11)染色体易位的主要位点,导致与KMT2A(赖氨酸(K)甲基转移酶2A)[5]融合。KMT2A是一个染色质写入者,它在特定位点上沉积表观遗传标记,表明活性转录,特别是造血所需的HOX基因[6]。由此产生的KMT2A-MLLT3融合转录物编码了一个嵌合蛋白,该蛋白组成了KMT2A的n端三分之一与MLLT3的c端部分融合,该蛋白缺乏YEATS结构域,但保留了AHD反激活结构域(补充图S6)。为了研究AML中的全局MLLT3表达,我们分析了两个RNA-Seq数据集:IUCT-AML[7]和Beat-AML[8](详见补充信息)。与MLLT1相比,MLLT3的表达范围更广,呈现出&gt;120倍的振幅跨样本(图1I)。针对备选(s-MLLT3的E6a-E6 + E6b/b'- e6)和典型(l-MLLT3的E5-E6)外显子连接的异构体特异性k-mers的表达与s-MLLT3和l-MLLT3之间没有相关性。大约20%的样品显示出高水平的s-MLLT3,而在其他样品中,尽管l-MLLT3表达,但s-MLLT3无法检测到(补充图S7)。这一发现表明,在AML中,这两个启动子和/或由此产生的转录本存在差异调控。接下来,我们研究了约20%高s-MLLT3表达的AML样本是否代表一种独特的临床实体。MECOM+(骨髓增生异常综合征1和EVI1复合体位点)或GATA2-MECOM(补充图S8),以及RUNX1(矮子相关转录因子1)和/或TP53(肿瘤蛋白p53)突变的样本显示更高水平的l-MLLT3,甚至更高水平的s-MLLT3(补充图S9,左和中)。与KMT2A-MLLT3易位无相关性。相反,NPM1(核磷蛋白1)突变的样品显示出非常低的两种转录本水平(补充图S9,左和中)。值得注意的是,s-MLLT3或l-MLLT3水平升高与不良的ELN2017 (European LeukemiaNet 2017)评分相关(补充图S9,右)。基于中位数表达的组分离显示,总体MLLT3表达最高的患者生存结果更差(图1J,左)。然而,鉴于AML样本中s-MLLT3和l-MLLT3表达之间缺乏相关性,我们评估了它们对生存的独立影响。同种异构体特异性k-mers显示,s-MLLT3(而非l-MLLT3)表达显著影响不良患者的生存(图1J,中间和右侧面板)。总之,这些发现证明了在MLLT3位点内存在一个内部启动子,驱动编码缺乏酵母染色质解读域的AF9蛋白的5 '端缩短转录本的表达。这些替代转录本在造血干细胞和约20%的急性髓系白血病患者中高度表达,具有最差的生存结果。这些结果表明,考虑到这种缺乏yeats结构域的AF9转录因子的表达,应该重新评估MLLT3位点在hsc和AML中的作用。斯特文森·皮罗尼特写了这份手稿。chlorebessi<e:1>, Ahmed Zamani, Sandra Dailhau, Christian r<s:1>, Marina Bousquet和st<s:1> Pyronnet对研究的设计,构思和数据分析做出了贡献。Ahmed Zamani, Romain Pfeifer和Marina Bousquet设计并进行了生物实验。 chloe bessi<e:1> re、Sandra Dailhau、Camille Marchet、Benoit Guibert、Anthony Boureux、Raïssa Silva Da Silva、Nicolas Gilbert和thvangirise comes开发了基于k-mer的生物信息学工具。Fabienne Meggetto, Christian Touriol和Marina Bousquet对手稿进行了评论和编辑。Christian rcher在艾伯维(Abbvie)、安进(Amgen)、安斯泰来(Astellas)、BMS、勃林格(Boehringer)、爵士制药(Jazz Pharmaceuticals)和施维雅(Servier)担任顾问或顾问职务,并获得了艾伯维(Abbvie)、安进(Amgen)、安斯泰来(Astellas)、BMS、Iqvia和爵士制药(Jazz Pharmaceuticals)的研究资助。所有其他作者声明无利益冲突。这项工作由INSERM、图卢兹癌症大学研究所(IUCT)、Labex Toucan、leucespoir基金会、Ligue ronale contrle Cancer、ARC基金会、Laurette Fugain协会、法国国家研究机构(ANR-18-CE45-0020 Transipedia)和(ANR-22-CE45-0007 all - rna)资助。chlo<s:1> bessi<e:1> re得到法国基金会的支持,Ahmed Zamani和Raïssa Silva Da Silva得到法国国家癌症防治联盟的支持,Romain Pfeifer得到法国最高医学会和国家癌症研究所的支持,Sandra Dailhau得到法国医学会和国家癌症研究所(INCA, PRT-K-2022-184CircOma)的支持。根据法国法律,每一位匿名志愿捐献者或病人都得到了通知,并且已向卫生部申报了hiip收集情况(DC 2008-307)。经当地伦理委员会、西南及外省人身保护委员会(comit<e:1> de Protection des Personnes sudd - ouest et Outremer II)和当地法兰西高等教育研究伦理委员会(图卢兹,法国,协议#21PLER2021-007)批准后,获得了转让协议(AC 2008-129)。临床和生物学注释也已向国家信息和自由委员会(CNIL)申报。这项研究是根据《赫尔辛基宣言》进行的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cancer Communications
Cancer Communications Biochemistry, Genetics and Molecular Biology-Cancer Research
CiteScore
25.50
自引率
4.30%
发文量
153
审稿时长
4 weeks
期刊介绍: Cancer Communications is an open access, peer-reviewed online journal that encompasses basic, clinical, and translational cancer research. The journal welcomes submissions concerning clinical trials, epidemiology, molecular and cellular biology, and genetics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信