{"title":"Chasing the Ghost Proteome in the Dark matter.","authors":"Tristan Cardon, Isabelle Fournier, Michel Salzet","doi":"10.1016/j.mcpro.2025.101076","DOIUrl":null,"url":null,"abstract":"<p><p>Emerging evidence shows that translation from non-canonical open reading frames (ORFs) produces a diverse set of biologically active proteins. These ORFs reside in 5' and 3' untranslated regions, long non-coding RNAs, overlapping frames within annotated genes (dual coding), pseudogenes, and can initiate at non-AUG start codons. The resulting products, variously termed microproteins, small proteins (smPROTs), small ORF-encoded peptides (SEPs), and alternative proteins (AltProts), modulate fundamental cellular processes, including metabolic flux and epigenetic regulation. We consolidate these entities under the umbrella of the ghost proteome, a functional proteome arising from the genome's presumed \"dark matter.\" This concept is distinct from the dark proteome, which refers to regions of canonical proteins lacking structural, functional, or experimental annotation and is not necessarily derived from non-canonical loci. Recognizing the ghost proteome expands the boundary of what is considered protein-coding, demands harmonized nomenclature and database integration, and motivates systematic discovery and functional characterization. By reframing sequences once dismissed as non-coding or \"junk,\" the ghost proteome compels a re-evaluation of genome annotation and reveals new opportunities to interrogate biology and disease.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101076"},"PeriodicalIF":5.5000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular & Cellular Proteomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.mcpro.2025.101076","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Emerging evidence shows that translation from non-canonical open reading frames (ORFs) produces a diverse set of biologically active proteins. These ORFs reside in 5' and 3' untranslated regions, long non-coding RNAs, overlapping frames within annotated genes (dual coding), pseudogenes, and can initiate at non-AUG start codons. The resulting products, variously termed microproteins, small proteins (smPROTs), small ORF-encoded peptides (SEPs), and alternative proteins (AltProts), modulate fundamental cellular processes, including metabolic flux and epigenetic regulation. We consolidate these entities under the umbrella of the ghost proteome, a functional proteome arising from the genome's presumed "dark matter." This concept is distinct from the dark proteome, which refers to regions of canonical proteins lacking structural, functional, or experimental annotation and is not necessarily derived from non-canonical loci. Recognizing the ghost proteome expands the boundary of what is considered protein-coding, demands harmonized nomenclature and database integration, and motivates systematic discovery and functional characterization. By reframing sequences once dismissed as non-coding or "junk," the ghost proteome compels a re-evaluation of genome annotation and reveals new opportunities to interrogate biology and disease.
期刊介绍:
The mission of MCP is to foster the development and applications of proteomics in both basic and translational research. MCP will publish manuscripts that report significant new biological or clinical discoveries underpinned by proteomic observations across all kingdoms of life. Manuscripts must define the biological roles played by the proteins investigated or their mechanisms of action.
The journal also emphasizes articles that describe innovative new computational methods and technological advancements that will enable future discoveries. Manuscripts describing such approaches do not have to include a solution to a biological problem, but must demonstrate that the technology works as described, is reproducible and is appropriate to uncover yet unknown protein/proteome function or properties using relevant model systems or publicly available data.
Scope:
-Fundamental studies in biology, including integrative "omics" studies, that provide mechanistic insights
-Novel experimental and computational technologies
-Proteogenomic data integration and analysis that enable greater understanding of physiology and disease processes
-Pathway and network analyses of signaling that focus on the roles of post-translational modifications
-Studies of proteome dynamics and quality controls, and their roles in disease
-Studies of evolutionary processes effecting proteome dynamics, quality and regulation
-Chemical proteomics, including mechanisms of drug action
-Proteomics of the immune system and antigen presentation/recognition
-Microbiome proteomics, host-microbe and host-pathogen interactions, and their roles in health and disease
-Clinical and translational studies of human diseases
-Metabolomics to understand functional connections between genes, proteins and phenotypes