Nature Machine Intelligence最新文献

筛选
英文 中文
Integrated structure prediction of protein–protein docking with experimental restraints using ColabDock 利用 ColabDock 进行带有实验约束的蛋白质-蛋白质对接的综合结构预测
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-08-05 DOI: 10.1038/s42256-024-00873-z
Shihao Feng, Zhenyu Chen, Chengwei Zhang, Yuhao Xie, Sergey Ovchinnikov, Yi Qin Gao, Sirui Liu
{"title":"Integrated structure prediction of protein–protein docking with experimental restraints using ColabDock","authors":"Shihao Feng, Zhenyu Chen, Chengwei Zhang, Yuhao Xie, Sergey Ovchinnikov, Yi Qin Gao, Sirui Liu","doi":"10.1038/s42256-024-00873-z","DOIUrl":"10.1038/s42256-024-00873-z","url":null,"abstract":"Protein complex structure prediction plays important roles in various applications, such as drug discovery and antibody design. However, due to limited prediction accuracy, there are frequent inconsistencies between the predictions and the experiments. Here we present ColabDock, a general framework adapting deep learning structure prediction models to integrate experimental restraints of different forms and sources without further large-scale retraining or fine tuning. With a generation–prediction architecture and trained ranking model, ColabDock outperforms HADDOCK and ClusPro using AlphaFold2 as the structure prediction model, not only in complex structure predictions with simulated residue and surface restraints but also in those assisted by nuclear magnetic resonance chemical shift perturbation as well as covalent labelling. It also assists antibody–antigen interface prediction with emulated interface scan restraints, which could be obtained by experiments such as deep mutational scanning. As a unified framework, we hope that ColabDock can help to bridge the gap between experimental and computational protein science. Despite rapid developments in predicting the complex structures of proteins, there are still inconsistencies between predictions and experiments. Feng et al. developed ColabDock, a general framework for deep learning models that integrates various experimental restraints and improves complex interface prediction, including antibody–antigen interactions.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 8","pages":"924-935"},"PeriodicalIF":18.8,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141895210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Author Correction: A 5′ UTR language model for decoding untranslated regions of mRNA and function predictions 作者更正:用于解码 mRNA 非翻译区和功能预测的 5′ UTR 语言模型
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-08-02 DOI: 10.1038/s42256-024-00890-y
Yanyi Chu, Dan Yu, Yupeng Li, Kaixuan Huang, Yue Shen, Le Cong, Jason Zhang, Mengdi Wang
{"title":"Author Correction: A 5′ UTR language model for decoding untranslated regions of mRNA and function predictions","authors":"Yanyi Chu, Dan Yu, Yupeng Li, Kaixuan Huang, Yue Shen, Le Cong, Jason Zhang, Mengdi Wang","doi":"10.1038/s42256-024-00890-y","DOIUrl":"10.1038/s42256-024-00890-y","url":null,"abstract":"","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 8","pages":"988-988"},"PeriodicalIF":18.8,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00890-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142091165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning prediction of glycopeptide tandem mass spectra powers glycoproteomics 糖肽串联质谱的深度学习预测为糖蛋白组学提供动力
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-07-30 DOI: 10.1038/s42256-024-00875-x
Yu Zong, Yuxin Wang, Xipeng Qiu, Xuanjing Huang, Liang Qiao
{"title":"Deep learning prediction of glycopeptide tandem mass spectra powers glycoproteomics","authors":"Yu Zong, Yuxin Wang, Xipeng Qiu, Xuanjing Huang, Liang Qiao","doi":"10.1038/s42256-024-00875-x","DOIUrl":"10.1038/s42256-024-00875-x","url":null,"abstract":"Protein glycosylation, a post-translational modification of proteins by glycans, plays an important role in numerous physiological and pathological cellular functions. Glycoproteomics, the study of protein glycosylation on a proteome-wide scale, utilizes liquid chromatography coupled with tandem mass spectrometry (MS/MS) to get combinational information on glycosylation site, glycosylation level and glycan structure. However, current database searching methods for glycoproteomics often struggle with glycan structure determination due to the limited occurrence of structure-determining ions. Although spectral searching methods can leverage fragment intensity to facilitate the structure identification of glycopeptides, their application is hindered by difficulties in spectral library construction. In this work, we present DeepGP, a hybrid deep learning framework based on transformer and graph neural networks, for the prediction of MS/MS spectra and retention time of glycopeptides. Two graph neural network modules are employed to capture the branched glycan structure and predict glycan ion intensity, respectively. Additionally, a pretraining strategy is implemented to alleviate the insufficiency of glycoproteomics data. Testing on multiple biological datasets, DeepGP accurately predicts MS/MS spectra and retention time of glycopeptides, closely aligning with the experimental results. Comprehensive benchmarking of DeepGP on synthetic and biological datasets validates its effectiveness in distinguishing similar glycans. Based on various decoy methods, DeepGP in combination with database searching can increase glycopeptide detection sensitivity. We anticipate that DeepGP can inspire extensive future work in glycoproteomics. Glycosylation, a prevalent type of post-translational modification of proteins by glycan molecules, plays a major role in the proteome. Zong et al. present DeepGP, a hybrid deep learning framework based on transformer and graph neural network architectures that accurately predicts tandem mass spectra and retention times of glycopeptides, providing information on glycosylation and glycan structure.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 8","pages":"950-961"},"PeriodicalIF":18.8,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141794616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advanced AI assistants that act on our behalf may not be ethically or legally feasible 代表我们行事的高级人工智能助手可能在道德或法律上不可行
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-07-29 DOI: 10.1038/s42256-024-00877-9
Silvia Milano, Sven Nyholm
{"title":"Advanced AI assistants that act on our behalf may not be ethically or legally feasible","authors":"Silvia Milano, Sven Nyholm","doi":"10.1038/s42256-024-00877-9","DOIUrl":"10.1038/s42256-024-00877-9","url":null,"abstract":"","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 8","pages":"846-847"},"PeriodicalIF":18.8,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141791118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A question of trust for AI research in medicine 医学人工智能研究的信任问题
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-07-24 DOI: 10.1038/s42256-024-00880-0
{"title":"A question of trust for AI research in medicine","authors":"","doi":"10.1038/s42256-024-00880-0","DOIUrl":"10.1038/s42256-024-00880-0","url":null,"abstract":"Medical research is one of the most impactful areas for machine learning applications, but access to large and diverse health datasets is needed for models to be useful. Winning trust from patients by demonstrating that data are handled securely and effectively is key.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 7","pages":"739-739"},"PeriodicalIF":18.8,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00880-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141764060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DNA language model GROVER learns sequence context in the human genome DNA 语言模型 GROVER 学习人类基因组中的序列上下文
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-07-23 DOI: 10.1038/s42256-024-00872-0
Melissa Sanabria, Jonas Hirsch, Pierre M. Joubert, Anna R. Poetsch
{"title":"DNA language model GROVER learns sequence context in the human genome","authors":"Melissa Sanabria, Jonas Hirsch, Pierre M. Joubert, Anna R. Poetsch","doi":"10.1038/s42256-024-00872-0","DOIUrl":"10.1038/s42256-024-00872-0","url":null,"abstract":"Deep-learning models that learn a sense of language on DNA have achieved a high level of performance on genome biological tasks. Genome sequences follow rules similar to natural language but are distinct in the absence of a concept of words. We established byte-pair encoding on the human genome and trained a foundation language model called GROVER (Genome Rules Obtained Via Extracted Representations) with the vocabulary selected via a custom task, next-k-mer prediction. The defined dictionary of tokens in the human genome carries best the information content for GROVER. Analysing learned representations, we observed that trained token embeddings primarily encode information related to frequency, sequence content and length. Some tokens are primarily localized in repeats, whereas the majority widely distribute over the genome. GROVER also learns context and lexical ambiguity. Average trained embeddings of genomic regions relate to functional genomics annotation and thus indicate learning of these structures purely from the contextual relationships of tokens. This highlights the extent of information content encoded by the sequence that can be grasped by GROVER. On fine-tuning tasks addressing genome biology with questions of genome element identification and protein–DNA binding, GROVER exceeds other models’ performance. GROVER learns sequence context, a sense for structure and language rules. Extracting this knowledge can be used to compose a grammar book for the code of life. Genomes can be modelled with language approaches by treating nucleotide bases A, C, G and T like text, but there is no natural concept of what the words would be and whether there is even a ‘language’ to be learned this way. Sanabria et al. have developed a language model called GROVER that learns with a ‘vocabulary’ of genome sequences with byte-pair encoding, a method from text compression, and shows good performance on genome biological tasks.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 8","pages":"911-923"},"PeriodicalIF":18.8,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00872-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141750334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Partial-convolution-implemented generative adversarial network for global oceanic data assimilation 用于全球海洋数据同化的部分卷积生成对抗网络
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-07-22 DOI: 10.1038/s42256-024-00867-x
Yoo-Geun Ham, Yong-Sik Joo, Jeong-Hwan Kim, Jeong-Gil Lee
{"title":"Partial-convolution-implemented generative adversarial network for global oceanic data assimilation","authors":"Yoo-Geun Ham, Yong-Sik Joo, Jeong-Hwan Kim, Jeong-Gil Lee","doi":"10.1038/s42256-024-00867-x","DOIUrl":"10.1038/s42256-024-00867-x","url":null,"abstract":"The oceanic data assimilation (DA) system has been developed to optimally combine numerical-model predictions with actual measurements from the ocean to create the best estimates of current ocean conditions and their uncertainties, improving our ability to forecast and understand the global climate variations. We developed DeepDA, a global oceanic DA system using deep learning, by integrating a partial convolutional neural network and a generative adversarial network. Partial convolution serves as an observation operator, mapping irregular observational data onto gridded fields, while generative adversarial network incorporates observational information from previous time frames. Our observing system simulation experiments, using simulated observations for the DA, revealed that DeepDA markedly reduces analysis error of the oceanic temperature, outperforming both background and observed values. DeepDA’s real-case global temperature reanalysis spanning from 1981 to 2020 accurately reconstructs observed global climatological temperature fields, along with their seasonal cycles, major oceanic temperature variabilities and global warming trend. Developed solely with a long-term control simulation, DeepDA lowers technical hurdles in creating global ocean reanalysis datasets using multiple numerical models’ physical constraints, thereby diminishing systematic uncertainties in estimating global oceanic states over decades with these models. Data assimilation (DA) techniques are commonly used to assess global Earth system variability but require considerable computational resources and struggle to handle sparse observational data. Ham and colleagues introduce a partial convolution and generative adversarial network-based global oceanic DA system and successfully reconstruct the observed global temperature in a real case study with smaller computational costs than traditional DA systems.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 7","pages":"834-843"},"PeriodicalIF":18.8,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141736940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A transformer-based weakly supervised computational pathology method for clinical-grade diagnosis and molecular marker discovery of gliomas 基于变换器的弱监督计算病理学方法,用于胶质瘤的临床分级诊断和分子标记物发现
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-07-18 DOI: 10.1038/s42256-024-00868-w
Rui Jiang, Xiaoxu Yin, Pengshuai Yang, Lingchao Cheng, Juan Hu, Jiao Yang, Ying Wang, Xiaodan Fu, Li Shang, Liling Li, Wei Lin, Huan Zhou, Fufeng Chen, Xuegong Zhang, Zhongliang Hu, Hairong Lv
{"title":"A transformer-based weakly supervised computational pathology method for clinical-grade diagnosis and molecular marker discovery of gliomas","authors":"Rui Jiang, Xiaoxu Yin, Pengshuai Yang, Lingchao Cheng, Juan Hu, Jiao Yang, Ying Wang, Xiaodan Fu, Li Shang, Liling Li, Wei Lin, Huan Zhou, Fufeng Chen, Xuegong Zhang, Zhongliang Hu, Hairong Lv","doi":"10.1038/s42256-024-00868-w","DOIUrl":"10.1038/s42256-024-00868-w","url":null,"abstract":"The complex diagnostic criteria for gliomas pose great challenges for making accurate diagnoses with computational pathology methods. There are no in-depth analyses of the accuracy, reliability and auxiliary capability of present approaches from a clinical perspective. Previous studies have overlooked the exploration of molecular and morphological correlations. To overcome these limitations, we propose ROAM, a multiple-instance learning model based on large regions of interest and a pyramid transformer. ROAM enlarges regions of interest to facilitate the consideration of tissue contexts. It utilizes the pyramid transformer to model both intrascale and interscale correlations of morphological features and leverages class-specific multiple-instance learning based on attention to extract slide-level visual representations that can be used to diagnose gliomas. Through comprehensive experiments on both in-house and external glioma datasets, we demonstrate that ROAM can automatically capture key morphological features consistent with the experience of pathologists and thus provide accurate, reliable and adaptable clinical-grade diagnoses of gliomas. Moreover, ROAM has clinical value for auxiliary diagnoses and could pave the way for the study of molecular and morphological correlations. ROAM, based on large regions of interest and a pyramid transformer, can automatically capture key morphological features consistent with the experience of pathologists to provide accurate, reliable and adaptable clinical-grade diagnoses of gliomas while advancing the discovery of molecular and morphological markers related to glioma diagnosis.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 8","pages":"876-891"},"PeriodicalIF":18.8,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141725778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated construction of cognitive maps with visual predictive coding 利用视觉预测编码自动构建认知地图
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-07-18 DOI: 10.1038/s42256-024-00863-1
James Gornet, Matt Thomson
{"title":"Automated construction of cognitive maps with visual predictive coding","authors":"James Gornet, Matt Thomson","doi":"10.1038/s42256-024-00863-1","DOIUrl":"10.1038/s42256-024-00863-1","url":null,"abstract":"Humans construct internal cognitive maps of their environment directly from sensory inputs without access to a system of explicit coordinates or distance measurements. Although machine learning algorithms like simultaneous localization and mapping utilize specialized inference procedures to identify visual features and construct spatial maps from visual and odometry data, the general nature of cognitive maps in the brain suggests a unified mapping algorithmic strategy that can generalize to auditory, tactile and linguistic inputs. Here we demonstrate that predictive coding provides a natural and versatile neural network algorithm for constructing spatial maps using sensory data. We introduce a framework in which an agent navigates a virtual environment while engaging in visual predictive coding using a self-attention-equipped convolutional neural network. While learning a next-image prediction task, the agent automatically constructs an internal representation of the environment that quantitatively reflects spatial distances. The internal map enables the agent to pinpoint its location relative to landmarks using only visual information.The predictive coding network generates a vectorized encoding of the environment that supports vector navigation, where individual latent space units delineate localized, overlapping neighbourhoods in the environment. Broadly, our work introduces predictive coding as a unified algorithmic framework for constructing cognitive maps that can naturally extend to the mapping of auditory, sensorimotor and linguistic inputs. Constructing spatial maps from sensory inputs is challenging in both neuroscience and artificial intelligence. Gornet and Thomson show that as an agent navigates an environment, a self-attention neural network using predictive coding can recover the environment’s map in its latent space.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 7","pages":"820-833"},"PeriodicalIF":18.8,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00863-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141725785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The need for reproducible research in soft robotics 软机器人技术领域需要可重复的研究
IF 18.8 1区 计算机科学
Nature Machine Intelligence Pub Date : 2024-07-17 DOI: 10.1038/s42256-024-00869-9
Robert Baines, Dylan Shah, Jeremy Marvel, Jennifer Case, Andrew Spielberg
{"title":"The need for reproducible research in soft robotics","authors":"Robert Baines, Dylan Shah, Jeremy Marvel, Jennifer Case, Andrew Spielberg","doi":"10.1038/s42256-024-00869-9","DOIUrl":"10.1038/s42256-024-00869-9","url":null,"abstract":"","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 7","pages":"740-741"},"PeriodicalIF":18.8,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141631382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信