{"title":"淀粉样蛋白研究的发展前景。","authors":"Bernardo Bonilauri","doi":"10.1002/prot.70048","DOIUrl":null,"url":null,"abstract":"<p><p>The exponential growth of biomedical and life sciences literature, including research on amyloid biology, has made it increasingly challenging to track new discoveries and gain a comprehensive understanding of the evolution of specific research fields. Advances in natural language models (NLM) and artificial intelligence (AI) approaches now enable large-scale analysis of scientific publications, uncovering hidden patterns and facilitating data-driven insights. Here, a two-dimensional mapping of the global amyloid research landscape is presented, using the transformer-based large language model PubMedBERT, in combination with t-SNE and Latent Dirichlet Allocation (LDA), to analyze more than 140 000 abstracts from the PubMed database. This analysis provides a comprehensive visualization of the amyloid field, capturing key trends such as the historical progression of amyloid research, the emergence of dominant subfields, the distribution of contributing authors and their respective countries, and the identification of latent research topics over time, including chemicals and small molecules. By integrating AI-driven text analysis with large-scale bibliometric data, this study offers a novel perspective on the evolution of amyloid research, facilitating a deeper interdisciplinary understanding. This work serves as a valuable interactive resource for researchers while highlighting the potential of machine learning-driven literature mapping in identifying knowledge gaps and guiding future investigations.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Evolving Landscape of Amyloid Research.\",\"authors\":\"Bernardo Bonilauri\",\"doi\":\"10.1002/prot.70048\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The exponential growth of biomedical and life sciences literature, including research on amyloid biology, has made it increasingly challenging to track new discoveries and gain a comprehensive understanding of the evolution of specific research fields. Advances in natural language models (NLM) and artificial intelligence (AI) approaches now enable large-scale analysis of scientific publications, uncovering hidden patterns and facilitating data-driven insights. Here, a two-dimensional mapping of the global amyloid research landscape is presented, using the transformer-based large language model PubMedBERT, in combination with t-SNE and Latent Dirichlet Allocation (LDA), to analyze more than 140 000 abstracts from the PubMed database. This analysis provides a comprehensive visualization of the amyloid field, capturing key trends such as the historical progression of amyloid research, the emergence of dominant subfields, the distribution of contributing authors and their respective countries, and the identification of latent research topics over time, including chemicals and small molecules. By integrating AI-driven text analysis with large-scale bibliometric data, this study offers a novel perspective on the evolution of amyloid research, facilitating a deeper interdisciplinary understanding. This work serves as a valuable interactive resource for researchers while highlighting the potential of machine learning-driven literature mapping in identifying knowledge gaps and guiding future investigations.</p>\",\"PeriodicalId\":56271,\"journal\":{\"name\":\"Proteins-Structure Function and Bioinformatics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proteins-Structure Function and Bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/prot.70048\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteins-Structure Function and Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/prot.70048","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
The exponential growth of biomedical and life sciences literature, including research on amyloid biology, has made it increasingly challenging to track new discoveries and gain a comprehensive understanding of the evolution of specific research fields. Advances in natural language models (NLM) and artificial intelligence (AI) approaches now enable large-scale analysis of scientific publications, uncovering hidden patterns and facilitating data-driven insights. Here, a two-dimensional mapping of the global amyloid research landscape is presented, using the transformer-based large language model PubMedBERT, in combination with t-SNE and Latent Dirichlet Allocation (LDA), to analyze more than 140 000 abstracts from the PubMed database. This analysis provides a comprehensive visualization of the amyloid field, capturing key trends such as the historical progression of amyloid research, the emergence of dominant subfields, the distribution of contributing authors and their respective countries, and the identification of latent research topics over time, including chemicals and small molecules. By integrating AI-driven text analysis with large-scale bibliometric data, this study offers a novel perspective on the evolution of amyloid research, facilitating a deeper interdisciplinary understanding. This work serves as a valuable interactive resource for researchers while highlighting the potential of machine learning-driven literature mapping in identifying knowledge gaps and guiding future investigations.
期刊介绍:
PROTEINS : Structure, Function, and Bioinformatics publishes original reports of significant experimental and analytic research in all areas of protein research: structure, function, computation, genetics, and design. The journal encourages reports that present new experimental or computational approaches for interpreting and understanding data from biophysical chemistry, structural studies of proteins and macromolecular assemblies, alterations of protein structure and function engineered through techniques of molecular biology and genetics, functional analyses under physiologic conditions, as well as the interactions of proteins with receptors, nucleic acids, or other specific ligands or substrates. Research in protein and peptide biochemistry directed toward synthesizing or characterizing molecules that simulate aspects of the activity of proteins, or that act as inhibitors of protein function, is also within the scope of PROTEINS. In addition to full-length reports, short communications (usually not more than 4 printed pages) and prediction reports are welcome. Reviews are typically by invitation; authors are encouraged to submit proposed topics for consideration.