NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data.

IF 2.5 4区 医学 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Frontiers in Neuroinformatics Pub Date : 2023-08-31 eCollection Date: 2023-01-01 DOI:10.3389/fninf.2023.1215261
Lei Wang, José Luis Ambite, Abhishek Appaji, Janine Bijsterbosch, Jerome Dockes, Rick Herrick, Alex Kogan, Howard Lander, Daniel Marcus, Stephen M Moore, Jean-Baptiste Poline, Arcot Rajasekar, Satya S Sahoo, Matthew D Turner, Xiaochen Wang, Yue Wang, Jessica A Turner
{"title":"NeuroBridge: a prototype platform for discovery of the long-tail neuroimaging data.","authors":"Lei Wang,&nbsp;José Luis Ambite,&nbsp;Abhishek Appaji,&nbsp;Janine Bijsterbosch,&nbsp;Jerome Dockes,&nbsp;Rick Herrick,&nbsp;Alex Kogan,&nbsp;Howard Lander,&nbsp;Daniel Marcus,&nbsp;Stephen M Moore,&nbsp;Jean-Baptiste Poline,&nbsp;Arcot Rajasekar,&nbsp;Satya S Sahoo,&nbsp;Matthew D Turner,&nbsp;Xiaochen Wang,&nbsp;Yue Wang,&nbsp;Jessica A Turner","doi":"10.3389/fninf.2023.1215261","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Open science initiatives have enabled sharing of large amounts of already collected data. However, significant gaps remain regarding how to find appropriate data, including underutilized data that exist in the long tail of science. We demonstrate the NeuroBridge prototype and its ability to search PubMed Central full-text papers for information relevant to neuroimaging data collected from schizophrenia and addiction studies.</p><p><strong>Methods: </strong>The NeuroBridge architecture contained the following components: (1) Extensible ontology for modeling study metadata: subject population, imaging techniques, and relevant behavioral, cognitive, or clinical data. Details are described in the companion paper in this special issue; (2) A natural-language based document processor that leveraged pre-trained deep-learning models on a small-sample document corpus to establish efficient representations for each article as a collection of machine-recognized ontological terms; (3) Integrated search using ontology-driven similarity to query PubMed Central and NeuroQuery, which provides fMRI activation maps along with PubMed source articles.</p><p><strong>Results: </strong>The NeuroBridge prototype contains a corpus of 356 papers from 2018 to 2021 describing schizophrenia and addiction neuroimaging studies, of which 186 were annotated with the NeuroBridge ontology. The search portal on the NeuroBridge website https://neurobridges.org/ provides an interactive Query Builder, where the user builds queries by selecting NeuroBridge ontology terms to preserve the ontology tree structure. For each return entry, links to the PubMed abstract as well as to the PMC full-text article, if available, are presented. For each of the returned articles, we provide a list of clinical assessments described in the Section \"Methods\" of the article. Articles returned from NeuroQuery based on the same search are also presented.</p><p><strong>Conclusion: </strong>The NeuroBridge prototype combines ontology-based search with natural-language text-mining approaches to demonstrate that papers relevant to a user's research question can be identified. The NeuroBridge prototype takes a first step toward identifying potential neuroimaging data described in full-text papers. Toward the overall goal of discovering \"enough data of the right kind,\" ongoing work includes validating the document processor with a larger corpus, extending the ontology to include detailed imaging data, and extracting information regarding data availability from the returned publications and incorporating XNAT-based neuroimaging databases to enhance data accessibility.</p>","PeriodicalId":12462,"journal":{"name":"Frontiers in Neuroinformatics","volume":"17 ","pages":"1215261"},"PeriodicalIF":2.5000,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10500076/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Neuroinformatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fninf.2023.1215261","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 1

Abstract

Introduction: Open science initiatives have enabled sharing of large amounts of already collected data. However, significant gaps remain regarding how to find appropriate data, including underutilized data that exist in the long tail of science. We demonstrate the NeuroBridge prototype and its ability to search PubMed Central full-text papers for information relevant to neuroimaging data collected from schizophrenia and addiction studies.

Methods: The NeuroBridge architecture contained the following components: (1) Extensible ontology for modeling study metadata: subject population, imaging techniques, and relevant behavioral, cognitive, or clinical data. Details are described in the companion paper in this special issue; (2) A natural-language based document processor that leveraged pre-trained deep-learning models on a small-sample document corpus to establish efficient representations for each article as a collection of machine-recognized ontological terms; (3) Integrated search using ontology-driven similarity to query PubMed Central and NeuroQuery, which provides fMRI activation maps along with PubMed source articles.

Results: The NeuroBridge prototype contains a corpus of 356 papers from 2018 to 2021 describing schizophrenia and addiction neuroimaging studies, of which 186 were annotated with the NeuroBridge ontology. The search portal on the NeuroBridge website https://neurobridges.org/ provides an interactive Query Builder, where the user builds queries by selecting NeuroBridge ontology terms to preserve the ontology tree structure. For each return entry, links to the PubMed abstract as well as to the PMC full-text article, if available, are presented. For each of the returned articles, we provide a list of clinical assessments described in the Section "Methods" of the article. Articles returned from NeuroQuery based on the same search are also presented.

Conclusion: The NeuroBridge prototype combines ontology-based search with natural-language text-mining approaches to demonstrate that papers relevant to a user's research question can be identified. The NeuroBridge prototype takes a first step toward identifying potential neuroimaging data described in full-text papers. Toward the overall goal of discovering "enough data of the right kind," ongoing work includes validating the document processor with a larger corpus, extending the ontology to include detailed imaging data, and extracting information regarding data availability from the returned publications and incorporating XNAT-based neuroimaging databases to enhance data accessibility.

Abstract Image

Abstract Image

Abstract Image

NeuroBridge:发现长尾神经影像数据的原型平台。
引言:开放科学举措使大量已经收集的数据得以共享。然而,在如何找到适当的数据方面仍然存在重大差距,包括科学长尾中存在的未充分利用的数据。我们展示了NeuroBridge原型及其在PubMed Central全文论文中搜索从精神分裂症和成瘾研究中收集的神经成像数据相关信息的能力。方法:NeuroBridge体系结构包含以下组成部分:(1)用于建模研究元数据的可扩展本体:受试者群体、成像技术和相关的行为、认知或临床数据。本期特刊的配套文件对细节进行了说明;(2) 一种基于自然语言的文档处理器,利用小样本文档语料库上预先训练的深度学习模型,为每一篇文章建立有效的表示,作为机器识别的本体论术语的集合;(3) 集成搜索,使用本体驱动的相似性来查询PubMed Central和NeuroQuery,后者提供功能磁共振成像激活图以及PubMed源文章。结果:NeuroBridge原型包含2018年至2021年356篇描述精神分裂症和成瘾神经影像学研究的论文,其中186篇用NeuroBridge本体论进行了注释。NeuroBridge网站上的搜索门户https://neurobridges.org/提供了一个交互式查询生成器,其中用户通过选择NeuroBridge本体术语来构建查询,以保留本体树结构。对于每个返回条目,都会显示PubMed摘要以及PMC全文文章的链接(如果可用)。对于每一篇返回的文章,我们提供了文章“方法”一节中描述的临床评估列表。还介绍了基于相同搜索从NeuroQuery返回的文章。结论:NeuroBridge原型将基于本体的搜索与自然语言文本挖掘方法相结合,证明可以识别与用户研究问题相关的论文。NeuroBridge原型迈出了识别全文论文中描述的潜在神经成像数据的第一步。为了实现发现“足够的正确类型的数据”的总体目标,正在进行的工作包括用更大的语料库验证文档处理器,扩展本体以包括详细的成像数据,从返回的出版物中提取有关数据可用性的信息,并合并基于XNAT的神经成像数据库以增强数据的可访问性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Neuroinformatics
Frontiers in Neuroinformatics MATHEMATICAL & COMPUTATIONAL BIOLOGY-NEUROSCIENCES
CiteScore
4.80
自引率
5.70%
发文量
132
审稿时长
14 weeks
期刊介绍: Frontiers in Neuroinformatics publishes rigorously peer-reviewed research on the development and implementation of numerical/computational models and analytical tools used to share, integrate and analyze experimental data and advance theories of the nervous system functions. Specialty Chief Editors Jan G. Bjaalie at the University of Oslo and Sean L. Hill at the École Polytechnique Fédérale de Lausanne are supported by an outstanding Editorial Board of international experts. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics and the public worldwide. Neuroscience is being propelled into the information age as the volume of information explodes, demanding organization and synthesis. Novel synthesis approaches are opening up a new dimension for the exploration of the components of brain elements and systems and the vast number of variables that underlie their functions. Neural data is highly heterogeneous with complex inter-relations across multiple levels, driving the need for innovative organizing and synthesizing approaches from genes to cognition, and covering a range of species and disease states. Frontiers in Neuroinformatics therefore welcomes submissions on existing neuroscience databases, development of data and knowledge bases for all levels of neuroscience, applications and technologies that can facilitate data sharing (interoperability, formats, terminologies, and ontologies), and novel tools for data acquisition, analyses, visualization, and dissemination of nervous system data. Our journal welcomes submissions on new tools (software and hardware) that support brain modeling, and the merging of neuroscience databases with brain models used for simulation and visualization.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信