{"title":"MONTRA2:卫生领域分布式数据库剖析网络平台","authors":"João Rafael Almeida , José Luís Oliveira","doi":"10.1016/j.imu.2024.101447","DOIUrl":null,"url":null,"abstract":"<div><h3>Background:</h3><p>Data catalogues are used in multiple domains to provide an overview of databases’ characteristics without releasing the actual data. Despite the existence of several web-based catalogues, they do not always meet the needs of certain domains. In the healthcare field, they need to give multiple and iterative views to the data, from high-level metadata up to low-level samples or patient data. This approach is critical to help researchers find relevant datasets for their studies.</p></div><div><h3>Methods:</h3><p>In this paper, we present MONTRA2, a web platform for profiling distributed databases. The users’ requirements were designed in the context of the EHDEN European project, in close collaboration with medical researchers, data owners, and pharmaceutical companies, leading to a rich set of functionalities to support databases and cohorts discovery. The platform was developed with a modular architecture which simplifies the integration of internal and external services.</p></div><div><h3>Results:</h3><p>MONTRA2 is successfully being used in several European projects and research initiatives, focused on the dissemination and sharing of biomedical databases. In this paper, we present three health data catalogues that were built upon the core of this framework. MONTRA2 is publicly available under the MIT license at <span>https://github.com/bioinformatics-ua/montra2</span><svg><path></path></svg>.</p></div><div><h3>Conclusions:</h3><p>The execution of federated studies on a large scale and involving multiple centres is only possible if adequate tools for data management and discovery are available. By providing tools for study management, database characterisation and publishing, among others, MONTRA2 simplifies the process of setting up a workspace for a community to expose the characteristics of datasets and provide multiple strategies for data analysis.</p></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"45 ","pages":"Article 101447"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352914824000030/pdfft?md5=321e094f8f4fd42cb0d7c13a2baecca8&pid=1-s2.0-S2352914824000030-main.pdf","citationCount":"0","resultStr":"{\"title\":\"MONTRA2: A web platform for profiling distributed databases in the health domain\",\"authors\":\"João Rafael Almeida , José Luís Oliveira\",\"doi\":\"10.1016/j.imu.2024.101447\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background:</h3><p>Data catalogues are used in multiple domains to provide an overview of databases’ characteristics without releasing the actual data. Despite the existence of several web-based catalogues, they do not always meet the needs of certain domains. In the healthcare field, they need to give multiple and iterative views to the data, from high-level metadata up to low-level samples or patient data. This approach is critical to help researchers find relevant datasets for their studies.</p></div><div><h3>Methods:</h3><p>In this paper, we present MONTRA2, a web platform for profiling distributed databases. The users’ requirements were designed in the context of the EHDEN European project, in close collaboration with medical researchers, data owners, and pharmaceutical companies, leading to a rich set of functionalities to support databases and cohorts discovery. The platform was developed with a modular architecture which simplifies the integration of internal and external services.</p></div><div><h3>Results:</h3><p>MONTRA2 is successfully being used in several European projects and research initiatives, focused on the dissemination and sharing of biomedical databases. In this paper, we present three health data catalogues that were built upon the core of this framework. MONTRA2 is publicly available under the MIT license at <span>https://github.com/bioinformatics-ua/montra2</span><svg><path></path></svg>.</p></div><div><h3>Conclusions:</h3><p>The execution of federated studies on a large scale and involving multiple centres is only possible if adequate tools for data management and discovery are available. By providing tools for study management, database characterisation and publishing, among others, MONTRA2 simplifies the process of setting up a workspace for a community to expose the characteristics of datasets and provide multiple strategies for data analysis.</p></div>\",\"PeriodicalId\":13953,\"journal\":{\"name\":\"Informatics in Medicine Unlocked\",\"volume\":\"45 \",\"pages\":\"Article 101447\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2352914824000030/pdfft?md5=321e094f8f4fd42cb0d7c13a2baecca8&pid=1-s2.0-S2352914824000030-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Informatics in Medicine Unlocked\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352914824000030\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352914824000030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
摘要
背景:数据目录用于多个领域,在不公开实际数据的情况下提供数据库特征概览。尽管存在一些基于网络的目录,但它们并不总能满足某些领域的需求。在医疗保健领域,它们需要提供从高级元数据到低级样本或患者数据的多重迭代数据视图。方法:在本文中,我们介绍了用于剖析分布式数据库的网络平台 MONTRA2。用户的需求是在欧洲 EHDEN 项目的背景下,与医学研究人员、数据所有者和制药公司密切合作设计的,从而产生了支持数据库和队列发现的丰富功能。成果:MONTRA2已成功应用于多个欧洲项目和研究计划,重点关注生物医学数据库的传播和共享。本文介绍了以该框架为核心构建的三个健康数据目录。MONTRA2 在 MIT 许可下公开发布,网址为 https://github.com/bioinformatics-ua/montra2.Conclusions:The 只有提供适当的数据管理和发现工具,才有可能开展大规模、涉及多个中心的联合研究。通过提供研究管理、数据库特征描述和发布等工具,MONTRA2简化了为社区建立工作空间的过程,从而揭示了数据集的特征,并为数据分析提供了多种策略。
MONTRA2: A web platform for profiling distributed databases in the health domain
Background:
Data catalogues are used in multiple domains to provide an overview of databases’ characteristics without releasing the actual data. Despite the existence of several web-based catalogues, they do not always meet the needs of certain domains. In the healthcare field, they need to give multiple and iterative views to the data, from high-level metadata up to low-level samples or patient data. This approach is critical to help researchers find relevant datasets for their studies.
Methods:
In this paper, we present MONTRA2, a web platform for profiling distributed databases. The users’ requirements were designed in the context of the EHDEN European project, in close collaboration with medical researchers, data owners, and pharmaceutical companies, leading to a rich set of functionalities to support databases and cohorts discovery. The platform was developed with a modular architecture which simplifies the integration of internal and external services.
Results:
MONTRA2 is successfully being used in several European projects and research initiatives, focused on the dissemination and sharing of biomedical databases. In this paper, we present three health data catalogues that were built upon the core of this framework. MONTRA2 is publicly available under the MIT license at https://github.com/bioinformatics-ua/montra2.
Conclusions:
The execution of federated studies on a large scale and involving multiple centres is only possible if adequate tools for data management and discovery are available. By providing tools for study management, database characterisation and publishing, among others, MONTRA2 simplifies the process of setting up a workspace for a community to expose the characteristics of datasets and provide multiple strategies for data analysis.
期刊介绍:
Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.