火焰喷雾热解过程的语义数据表示集成数据管道。

Q2 Pharmacology, Toxicology and Pharmaceutics

F1000Research Pub Date : 2025-05-09 eCollection Date: 2025-01-01 DOI:10.12688/f1000research.161252.2

Manuel Vollbrecht, Keno Krieger, Jannis Grundmann, Henk Birkholz, Norbert Riefler, Lutz Mädler

{"title":"火焰喷雾热解过程的语义数据表示集成数据管道。","authors":"Manuel Vollbrecht, Keno Krieger, Jannis Grundmann, Henk Birkholz, Norbert Riefler, Lutz Mädler","doi":"10.12688/f1000research.161252.2","DOIUrl":null,"url":null,"abstract":"Ongoing digitalization and data-driven developments in materials science and engineering (MSE) emphasize the growing importance of reusing research data and enabling machine accessibility, which requires robust data management and consistent semantic data representation. Ontologies have emerged as powerful tools for establishing interoperable and reusable data structures from inconsistent data structures. Despite advancements in semantic data representation for specific applications, integrating application ontologies with primary data repositories, such as electronic lab notebooks (ELNs), to feed world data remains an open task. As a use case in the MSE domain, this work presents a system based on semantic technologies from the point of view of engineers, developed with the help of information scientists, and unraveled on a small scale. The development of an application ontology (AO) was elaborated for flame spray pyrolysis (FSP) processes with the implementation of a data pipeline. The proposed FSP application ontology emerges from experimental in-house best-practice procedures and is adapted to the mid-level Project Material Digital core ontology (PMDco) to allow interoperability within the MSE domain. The pipeline retrieves manually acquired experimental data from an ELN, translates it into a machine-actionable format, and converts it into a Resource Description Framework (RDF) format to support semantic interoperability. The latter was stored in a triple store with a SPARQL interface, enabling findable and accessible datasets that are searchable and traceable. By creating semantically linked data structures in line with FAIR principles, this approach allows traceable and findable experimental results between stakeholders through both human-readable and machine-actionable formats. Seamless integration of the modular microservices of the data pipeline within established lab practices minimizes disruption while maintaining the software framework. The present work demonstrates the practical implementation of a FAIR data pipeline within a laboratory setting, paving the way for future data-centric science.","PeriodicalId":12260,"journal":{"name":"F1000Research","volume":"14 ","pages":"173"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12120422/pdf/","citationCount":"0","resultStr":"{\"title\":\"An integrated data pipeline for semantic data representation of the flame spray pyrolysis process.\",\"authors\":\"Manuel Vollbrecht, Keno Krieger, Jannis Grundmann, Henk Birkholz, Norbert Riefler, Lutz Mädler\",\"doi\":\"10.12688/f1000research.161252.2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ongoing digitalization and data-driven developments in materials science and engineering (MSE) emphasize the growing importance of reusing research data and enabling machine accessibility, which requires robust data management and consistent semantic data representation. Ontologies have emerged as powerful tools for establishing interoperable and reusable data structures from inconsistent data structures. Despite advancements in semantic data representation for specific applications, integrating application ontologies with primary data repositories, such as electronic lab notebooks (ELNs), to feed world data remains an open task. As a use case in the MSE domain, this work presents a system based on semantic technologies from the point of view of engineers, developed with the help of information scientists, and unraveled on a small scale. The development of an application ontology (AO) was elaborated for flame spray pyrolysis (FSP) processes with the implementation of a data pipeline. The proposed FSP application ontology emerges from experimental in-house best-practice procedures and is adapted to the mid-level Project Material Digital core ontology (PMDco) to allow interoperability within the MSE domain. The pipeline retrieves manually acquired experimental data from an ELN, translates it into a machine-actionable format, and converts it into a Resource Description Framework (RDF) format to support semantic interoperability. The latter was stored in a triple store with a SPARQL interface, enabling findable and accessible datasets that are searchable and traceable. By creating semantically linked data structures in line with FAIR principles, this approach allows traceable and findable experimental results between stakeholders through both human-readable and machine-actionable formats. Seamless integration of the modular microservices of the data pipeline within established lab practices minimizes disruption while maintaining the software framework. The present work demonstrates the practical implementation of a FAIR data pipeline within a laboratory setting, paving the way for future data-centric science.\",\"PeriodicalId\":12260,\"journal\":{\"name\":\"F1000Research\",\"volume\":\"14 \",\"pages\":\"173\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12120422/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"F1000Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12688/f1000research.161252.2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"Pharmacology, Toxicology and Pharmaceutics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"F1000Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/f1000research.161252.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}

引用次数: 0

摘要

材料科学与工程（MSE）中持续的数字化和数据驱动的发展强调了重复使用研究数据和实现机器可访问性的重要性，这需要强大的数据管理和一致的语义数据表示。本体已经成为从不一致的数据结构中建立可互操作和可重用数据结构的强大工具。尽管在特定应用的语义数据表示方面取得了进展，但将应用本体与主要数据存储库（如电子实验室笔记本（eln））集成以提供世界数据仍然是一项开放的任务。作为MSE领域的一个用例，这项工作从工程师的角度提出了一个基于语义技术的系统，在信息科学家的帮助下开发，并在小范围内展开。阐述了火焰喷雾热解（FSP）过程应用本体（AO）的开发，实现了一个数据管道。拟议的FSP应用本体来自实验性的内部最佳实践程序，并适应中级项目材料数字核心本体（PMDco），以实现MSE领域内的互操作性。该管道从ELN中检索人工获取的实验数据，将其转换为机器可操作的格式，并将其转换为资源描述框架（RDF）格式，以支持语义互操作性。后者存储在具有SPARQL接口的三重存储中，支持可搜索和可跟踪的可查找和可访问数据集。通过创建符合FAIR原则的语义链接数据结构，这种方法允许在利益相关者之间通过人类可读和机器可操作的格式进行可跟踪和可查找的实验结果。在已建立的实验室实践中无缝集成数据管道的模块化微服务，在维护软件框架的同时最大限度地减少中断。目前的工作展示了FAIR数据管道在实验室环境中的实际实施，为未来以数据为中心的科学铺平了道路。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An integrated data pipeline for semantic data representation of the flame spray pyrolysis process.

Ongoing digitalization and data-driven developments in materials science and engineering (MSE) emphasize the growing importance of reusing research data and enabling machine accessibility, which requires robust data management and consistent semantic data representation. Ontologies have emerged as powerful tools for establishing interoperable and reusable data structures from inconsistent data structures. Despite advancements in semantic data representation for specific applications, integrating application ontologies with primary data repositories, such as electronic lab notebooks (ELNs), to feed world data remains an open task. As a use case in the MSE domain, this work presents a system based on semantic technologies from the point of view of engineers, developed with the help of information scientists, and unraveled on a small scale. The development of an application ontology (AO) was elaborated for flame spray pyrolysis (FSP) processes with the implementation of a data pipeline. The proposed FSP application ontology emerges from experimental in-house best-practice procedures and is adapted to the mid-level Project Material Digital core ontology (PMDco) to allow interoperability within the MSE domain. The pipeline retrieves manually acquired experimental data from an ELN, translates it into a machine-actionable format, and converts it into a Resource Description Framework (RDF) format to support semantic interoperability. The latter was stored in a triple store with a SPARQL interface, enabling findable and accessible datasets that are searchable and traceable. By creating semantically linked data structures in line with FAIR principles, this approach allows traceable and findable experimental results between stakeholders through both human-readable and machine-actionable formats. Seamless integration of the modular microservices of the data pipeline within established lab practices minimizes disruption while maintaining the software framework. The present work demonstrates the practical implementation of a FAIR data pipeline within a laboratory setting, paving the way for future data-centric science.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

F1000Research Pharmacology, Toxicology and Pharmaceutics-Pharmacology, Toxicology and Pharmaceutics (all)

CiteScore

5.00

自引率

0.00%

发文量

1646

审稿时长

1 weeks

期刊介绍： F1000Research publishes articles and other research outputs reporting basic scientific, scholarly, translational and clinical research across the physical and life sciences, engineering, medicine, social sciences and humanities. F1000Research is a scholarly publication platform set up for the scientific, scholarly and medical research community; each article has at least one author who is a qualified researcher, scholar or clinician actively working in their speciality and who has made a key contribution to the article. Articles must be original (not duplications). All research is suitable irrespective of the perceived level of interest or novelty; we welcome confirmatory and negative results, as well as null studies. F1000Research publishes different type of research, including clinical trials, systematic reviews, software tools, method articles, and many others. Reviews and Opinion articles providing a balanced and comprehensive overview of the latest discoveries in a particular field, or presenting a personal perspective on recent developments, are also welcome. See the full list of article types we accept for more information.