Design of diverse, functional mitochondrial targeting sequences across eukaryotic organisms using variational autoencoder

Aashutosh Girish Boob, Shih-I Tan, Airah Zaidi, Nilmani Singh, Xueyi Xue, Shuaizhen Zhou, Teresa A Martin, Li-Qing Chen, Huimin Zhao
{"title":"Design of diverse, functional mitochondrial targeting sequences across eukaryotic organisms using variational autoencoder","authors":"Aashutosh Girish Boob, Shih-I Tan, Airah Zaidi, Nilmani Singh, Xueyi Xue, Shuaizhen Zhou, Teresa A Martin, Li-Qing Chen, Huimin Zhao","doi":"10.1101/2024.08.28.610205","DOIUrl":null,"url":null,"abstract":"Mitochondria play a key role in energy production and cellular metabolism, making them a promising target for metabolic engineering and disease treatment. However, despite the known influence of passenger proteins on localization efficiency, only a few protein-localization tags have been characterized for mitochondrial targeting. To address this limitation, we exploited Variational Autoencoder (VAE), an unsupervised deep learning framework, to design novel mitochondrial targeting sequences (MTSs). In silico analysis revealed that a high fraction of generated peptides are functional and possess features important for mitochondrial targeting. Additionally, we devised a sampling scheme to indirectly address biases arising from the differences in mitochondrial protein import machinery and characterized artificial MTSs in four eukaryotic organisms. These sequences displayed significant diversity, sharing less than 60% sequence identity with MTSs in the UniProt database. Moreover, we trained a separate VAE and employed latent space interpolation to design dual targeting sequences capable of targeting both mitochondria and chloroplasts, shedding light on their evolutionary origins. As a proof-of-concept, we demonstrate the application of these artificial MTSs in increasing titers of 3-hydroxypropionic acid through pathway compartmentalization and improving 5-aminolevulinate synthase delivery by 1.62-fold and 4.76-fold, respectively. Overall, our work not only demonstrates the potential of generative artificial intelligence in designing novel, functional mitochondrial targeting sequences but also highlights their utility in engineering mitochondria for both fundamental research and practical applications in biology.","PeriodicalId":501408,"journal":{"name":"bioRxiv - Synthetic Biology","volume":"53 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Synthetic Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.28.610205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Mitochondria play a key role in energy production and cellular metabolism, making them a promising target for metabolic engineering and disease treatment. However, despite the known influence of passenger proteins on localization efficiency, only a few protein-localization tags have been characterized for mitochondrial targeting. To address this limitation, we exploited Variational Autoencoder (VAE), an unsupervised deep learning framework, to design novel mitochondrial targeting sequences (MTSs). In silico analysis revealed that a high fraction of generated peptides are functional and possess features important for mitochondrial targeting. Additionally, we devised a sampling scheme to indirectly address biases arising from the differences in mitochondrial protein import machinery and characterized artificial MTSs in four eukaryotic organisms. These sequences displayed significant diversity, sharing less than 60% sequence identity with MTSs in the UniProt database. Moreover, we trained a separate VAE and employed latent space interpolation to design dual targeting sequences capable of targeting both mitochondria and chloroplasts, shedding light on their evolutionary origins. As a proof-of-concept, we demonstrate the application of these artificial MTSs in increasing titers of 3-hydroxypropionic acid through pathway compartmentalization and improving 5-aminolevulinate synthase delivery by 1.62-fold and 4.76-fold, respectively. Overall, our work not only demonstrates the potential of generative artificial intelligence in designing novel, functional mitochondrial targeting sequences but also highlights their utility in engineering mitochondria for both fundamental research and practical applications in biology.
利用变异自动编码器设计真核生物中多样化的功能性线粒体靶向序列
线粒体在能量产生和细胞新陈代谢中发挥着关键作用,使其成为代谢工程和疾病治疗的理想靶标。然而,尽管客体蛋白对定位效率的影响众所周知,但只有少数蛋白质定位标记被鉴定为线粒体靶标。为了解决这一局限性,我们利用无监督深度学习框架变异自动编码器(VAE)设计了新型线粒体靶向序列(MTS)。硅学分析表明,生成的多肽中有很大一部分具有功能性,并具备线粒体靶向的重要特征。此外,我们还设计了一种取样方案,间接解决了线粒体蛋白质导入机制差异所产生的偏差,并对四种真核生物的人工 MTS 进行了表征。这些序列显示出显著的多样性,与 UniProt 数据库中的 MTS 的序列相同度低于 60%。此外,我们还训练了一个单独的VAE,并采用潜空间插值法设计了能够同时靶向线粒体和叶绿体的双靶向序列,从而揭示了它们的进化起源。作为概念验证,我们展示了这些人工 MTS 的应用,它们通过途径区隔提高了 3-hydroxypropionic acid 的滴度,并将 5-aminolevulinate 合成酶的输送分别提高了 1.62 倍和 4.76 倍。总之,我们的工作不仅证明了生成性人工智能在设计新颖、功能性线粒体靶向序列方面的潜力,还强调了它们在线粒体工程学基础研究和生物学实际应用方面的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信