Victoria A. Higman, Eliza Płoskoń, Gary S. Thompson, Geerten W. Vuister
{"title":"Perspective: on the importance of extensive, high-quality and reliable deposition of biomolecular NMR data in the age of artificial intelligence","authors":"Victoria A. Higman, Eliza Płoskoń, Gary S. Thompson, Geerten W. Vuister","doi":"10.1007/s10858-024-00451-w","DOIUrl":null,"url":null,"abstract":"<div><p>Artificial intelligence (AI) models are revolutionising scientific data analysis but are reliant on large training data sets. While artificial training data can be used in the context of NMR processing and data analysis methods, relating NMR parameters back to protein sequence and structure requires experimental data. In this perspective we examine what the biological NMR community needs to do, in order to store and share its data better so that we can make effective use of AI methods to further our understanding of biological molecules. We argue, first, that the community should be depositing much more of its experimental data. In particular, we should be depositing more spectra and dynamics data. Second, the NMR data deposited needs to capture the full information content required to be able to use and validate it adequately. The NMR Exchange Format (NEF) was designed several years ago to do this. The widespread adoption of NEF combined with a new proposal for dynamics data specifications come at the right time for the community to expand its deposition of data. Third, we highlight the importance of expanding and safeguarding our experimental data repository, the Biological Magnetic Resonance Data Bank (BMRB), not only in the interests of NMR spectroscopists, but biological scientists more widely. With this article we invite others in the biological NMR community to champion increased (possibly mandatory) data deposition, to get involved in designing new NEF specifications, and to advocate on behalf of the BMRB within the wider scientific community.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"78 4","pages":"193 - 197"},"PeriodicalIF":1.3000,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10858-024-00451-w.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomolecular NMR","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10858-024-00451-w","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Artificial intelligence (AI) models are revolutionising scientific data analysis but are reliant on large training data sets. While artificial training data can be used in the context of NMR processing and data analysis methods, relating NMR parameters back to protein sequence and structure requires experimental data. In this perspective we examine what the biological NMR community needs to do, in order to store and share its data better so that we can make effective use of AI methods to further our understanding of biological molecules. We argue, first, that the community should be depositing much more of its experimental data. In particular, we should be depositing more spectra and dynamics data. Second, the NMR data deposited needs to capture the full information content required to be able to use and validate it adequately. The NMR Exchange Format (NEF) was designed several years ago to do this. The widespread adoption of NEF combined with a new proposal for dynamics data specifications come at the right time for the community to expand its deposition of data. Third, we highlight the importance of expanding and safeguarding our experimental data repository, the Biological Magnetic Resonance Data Bank (BMRB), not only in the interests of NMR spectroscopists, but biological scientists more widely. With this article we invite others in the biological NMR community to champion increased (possibly mandatory) data deposition, to get involved in designing new NEF specifications, and to advocate on behalf of the BMRB within the wider scientific community.
期刊介绍:
The Journal of Biomolecular NMR provides a forum for publishing research on technical developments and innovative applications of nuclear magnetic resonance spectroscopy for the study of structure and dynamic properties of biopolymers in solution, liquid crystals, solids and mixed environments, e.g., attached to membranes. This may include:
Three-dimensional structure determination of biological macromolecules (polypeptides/proteins, DNA, RNA, oligosaccharides) by NMR.
New NMR techniques for studies of biological macromolecules.
Novel approaches to computer-aided automated analysis of multidimensional NMR spectra.
Computational methods for the structural interpretation of NMR data, including structure refinement.
Comparisons of structures determined by NMR with those obtained by other methods, e.g. by diffraction techniques with protein single crystals.
New techniques of sample preparation for NMR experiments (biosynthetic and chemical methods for isotope labeling, preparation of nutrients for biosynthetic isotope labeling, etc.). An NMR characterization of the products must be included.