{"title":"火焰喷雾热解过程的语义数据表示集成数据管道。","authors":"Manuel Vollbrecht, Keno Krieger, Jannis Grundmann, Henk Birkholz, Norbert Riefler, Lutz Mädler","doi":"10.12688/f1000research.161252.2","DOIUrl":null,"url":null,"abstract":"<p><p>Ongoing digitalization and data-driven developments in materials science and engineering (MSE) emphasize the growing importance of reusing research data and enabling machine accessibility, which requires robust data management and consistent semantic data representation. Ontologies have emerged as powerful tools for establishing interoperable and reusable data structures from inconsistent data structures. Despite advancements in semantic data representation for specific applications, integrating application ontologies with primary data repositories, such as electronic lab notebooks (ELNs), to feed world data remains an open task. As a use case in the MSE domain, this work presents a system based on semantic technologies from the point of view of engineers, developed with the help of information scientists, and unraveled on a small scale. The development of an application ontology (AO) was elaborated for flame spray pyrolysis (FSP) processes with the implementation of a data pipeline. The proposed FSP application ontology emerges from experimental in-house best-practice procedures and is adapted to the mid-level Project Material Digital core ontology (PMDco) to allow interoperability within the MSE domain. The pipeline retrieves manually acquired experimental data from an ELN, translates it into a machine-actionable format, and converts it into a Resource Description Framework (RDF) format to support semantic interoperability. The latter was stored in a triple store with a SPARQL interface, enabling findable and accessible datasets that are searchable and traceable. By creating semantically linked data structures in line with FAIR principles, this approach allows traceable and findable experimental results between stakeholders through both human-readable and machine-actionable formats. Seamless integration of the modular microservices of the data pipeline within established lab practices minimizes disruption while maintaining the software framework. The present work demonstrates the practical implementation of a FAIR data pipeline within a laboratory setting, paving the way for future data-centric science.</p>","PeriodicalId":12260,"journal":{"name":"F1000Research","volume":"14 ","pages":"173"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12120422/pdf/","citationCount":"0","resultStr":"{\"title\":\"An integrated data pipeline for semantic data representation of the flame spray pyrolysis process.\",\"authors\":\"Manuel Vollbrecht, Keno Krieger, Jannis Grundmann, Henk Birkholz, Norbert Riefler, Lutz Mädler\",\"doi\":\"10.12688/f1000research.161252.2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Ongoing digitalization and data-driven developments in materials science and engineering (MSE) emphasize the growing importance of reusing research data and enabling machine accessibility, which requires robust data management and consistent semantic data representation. Ontologies have emerged as powerful tools for establishing interoperable and reusable data structures from inconsistent data structures. Despite advancements in semantic data representation for specific applications, integrating application ontologies with primary data repositories, such as electronic lab notebooks (ELNs), to feed world data remains an open task. As a use case in the MSE domain, this work presents a system based on semantic technologies from the point of view of engineers, developed with the help of information scientists, and unraveled on a small scale. The development of an application ontology (AO) was elaborated for flame spray pyrolysis (FSP) processes with the implementation of a data pipeline. The proposed FSP application ontology emerges from experimental in-house best-practice procedures and is adapted to the mid-level Project Material Digital core ontology (PMDco) to allow interoperability within the MSE domain. The pipeline retrieves manually acquired experimental data from an ELN, translates it into a machine-actionable format, and converts it into a Resource Description Framework (RDF) format to support semantic interoperability. The latter was stored in a triple store with a SPARQL interface, enabling findable and accessible datasets that are searchable and traceable. By creating semantically linked data structures in line with FAIR principles, this approach allows traceable and findable experimental results between stakeholders through both human-readable and machine-actionable formats. Seamless integration of the modular microservices of the data pipeline within established lab practices minimizes disruption while maintaining the software framework. The present work demonstrates the practical implementation of a FAIR data pipeline within a laboratory setting, paving the way for future data-centric science.</p>\",\"PeriodicalId\":12260,\"journal\":{\"name\":\"F1000Research\",\"volume\":\"14 \",\"pages\":\"173\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12120422/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"F1000Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12688/f1000research.161252.2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"Pharmacology, Toxicology and Pharmaceutics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"F1000Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/f1000research.161252.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}
An integrated data pipeline for semantic data representation of the flame spray pyrolysis process.
Ongoing digitalization and data-driven developments in materials science and engineering (MSE) emphasize the growing importance of reusing research data and enabling machine accessibility, which requires robust data management and consistent semantic data representation. Ontologies have emerged as powerful tools for establishing interoperable and reusable data structures from inconsistent data structures. Despite advancements in semantic data representation for specific applications, integrating application ontologies with primary data repositories, such as electronic lab notebooks (ELNs), to feed world data remains an open task. As a use case in the MSE domain, this work presents a system based on semantic technologies from the point of view of engineers, developed with the help of information scientists, and unraveled on a small scale. The development of an application ontology (AO) was elaborated for flame spray pyrolysis (FSP) processes with the implementation of a data pipeline. The proposed FSP application ontology emerges from experimental in-house best-practice procedures and is adapted to the mid-level Project Material Digital core ontology (PMDco) to allow interoperability within the MSE domain. The pipeline retrieves manually acquired experimental data from an ELN, translates it into a machine-actionable format, and converts it into a Resource Description Framework (RDF) format to support semantic interoperability. The latter was stored in a triple store with a SPARQL interface, enabling findable and accessible datasets that are searchable and traceable. By creating semantically linked data structures in line with FAIR principles, this approach allows traceable and findable experimental results between stakeholders through both human-readable and machine-actionable formats. Seamless integration of the modular microservices of the data pipeline within established lab practices minimizes disruption while maintaining the software framework. The present work demonstrates the practical implementation of a FAIR data pipeline within a laboratory setting, paving the way for future data-centric science.
F1000ResearchPharmacology, Toxicology and Pharmaceutics-Pharmacology, Toxicology and Pharmaceutics (all)
CiteScore
5.00
自引率
0.00%
发文量
1646
审稿时长
1 weeks
期刊介绍:
F1000Research publishes articles and other research outputs reporting basic scientific, scholarly, translational and clinical research across the physical and life sciences, engineering, medicine, social sciences and humanities. F1000Research is a scholarly publication platform set up for the scientific, scholarly and medical research community; each article has at least one author who is a qualified researcher, scholar or clinician actively working in their speciality and who has made a key contribution to the article. Articles must be original (not duplications). All research is suitable irrespective of the perceived level of interest or novelty; we welcome confirmatory and negative results, as well as null studies. F1000Research publishes different type of research, including clinical trials, systematic reviews, software tools, method articles, and many others. Reviews and Opinion articles providing a balanced and comprehensive overview of the latest discoveries in a particular field, or presenting a personal perspective on recent developments, are also welcome. See the full list of article types we accept for more information.