Sathvik Guru Rao, Pranitha Rokkam, Bide Zhang, Astghik Sargsyan, Abish Kaladharan, Priya Sethumadhavan, Marc Jacobs, Martin Hofmann-Apitius, Alpha Tom Kodamullil
{"title":"Three-layered semantic framework for public health intelligence.","authors":"Sathvik Guru Rao, Pranitha Rokkam, Bide Zhang, Astghik Sargsyan, Abish Kaladharan, Priya Sethumadhavan, Marc Jacobs, Martin Hofmann-Apitius, Alpha Tom Kodamullil","doi":"10.1186/s13326-025-00338-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Disease surveillance systems play a crucial role in monitoring and preventing infectious diseases. However, the current landscape, primarily focused on fragmented health data, poses challenges to contextual understanding and decision-making. This paper addresses this issue by proposing a semantic framework using ontologies to provide a unified data representation for seamless integration. The paper demonstrates the effectiveness of this approach using a case study of a COVID-19 incident at a football game in Italy.</p><p><strong>Method: </strong>In this study, we undertook a comprehensive approach to gather and analyze data for the development of ontologies within the realm of pandemic intelligence. Multiple ontologies were meticulously crafted to cater to different domains related to pandemic intelligence, such as healthcare systems, mass gatherings, travel, and diseases. The ontologies were classified into top-level, domain, and application layers. This classification facilitated the development of a three-layered architecture, promoting reusability, and consistency in knowledge representation, and serving as the backbone of our semantic framework.</p><p><strong>Result: </strong>Through the utilization of our semantic framework, we accomplished semantic enrichment of both structured and unstructured data. The integration of data from diverse sources involved mapping to ontology concepts, leading to the creation and storage of RDF triples in the triple store. This process resulted in the construction of linked data, ultimately enhancing the discoverability and accessibility of valuable insights. Furthermore, our anomaly detection algorithm effectively leveraged knowledge graphs extracted from the triple store, employing semantic relationships to discern patterns and anomalies within the data. Notably, this capability was exemplified by the identification of correlations between a football game and a COVID-19 event occurring at the same location and time.</p><p><strong>Conclusion: </strong>The framework showcased its capability to address intricate, multi-domain queries and support diverse levels of detail. Additionally, it demonstrated proficiency in data analysis and visualization, generating graphs that depict patterns and trends; however, challenges related to ontology maintenance, alignment, and mapping must be addressed for the approach's optimal utilization.</p>","PeriodicalId":15055,"journal":{"name":"Journal of Biomedical Semantics","volume":"16 1","pages":"17"},"PeriodicalIF":2.0000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12439389/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomedical Semantics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1186/s13326-025-00338-1","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Disease surveillance systems play a crucial role in monitoring and preventing infectious diseases. However, the current landscape, primarily focused on fragmented health data, poses challenges to contextual understanding and decision-making. This paper addresses this issue by proposing a semantic framework using ontologies to provide a unified data representation for seamless integration. The paper demonstrates the effectiveness of this approach using a case study of a COVID-19 incident at a football game in Italy.
Method: In this study, we undertook a comprehensive approach to gather and analyze data for the development of ontologies within the realm of pandemic intelligence. Multiple ontologies were meticulously crafted to cater to different domains related to pandemic intelligence, such as healthcare systems, mass gatherings, travel, and diseases. The ontologies were classified into top-level, domain, and application layers. This classification facilitated the development of a three-layered architecture, promoting reusability, and consistency in knowledge representation, and serving as the backbone of our semantic framework.
Result: Through the utilization of our semantic framework, we accomplished semantic enrichment of both structured and unstructured data. The integration of data from diverse sources involved mapping to ontology concepts, leading to the creation and storage of RDF triples in the triple store. This process resulted in the construction of linked data, ultimately enhancing the discoverability and accessibility of valuable insights. Furthermore, our anomaly detection algorithm effectively leveraged knowledge graphs extracted from the triple store, employing semantic relationships to discern patterns and anomalies within the data. Notably, this capability was exemplified by the identification of correlations between a football game and a COVID-19 event occurring at the same location and time.
Conclusion: The framework showcased its capability to address intricate, multi-domain queries and support diverse levels of detail. Additionally, it demonstrated proficiency in data analysis and visualization, generating graphs that depict patterns and trends; however, challenges related to ontology maintenance, alignment, and mapping must be addressed for the approach's optimal utilization.
期刊介绍:
Journal of Biomedical Semantics addresses issues of semantic enrichment and semantic processing in the biomedical domain. The scope of the journal covers two main areas:
Infrastructure for biomedical semantics: focusing on semantic resources and repositories, meta-data management and resource description, knowledge representation and semantic frameworks, the Biomedical Semantic Web, and semantic interoperability.
Semantic mining, annotation, and analysis: focusing on approaches and applications of semantic resources; and tools for investigation, reasoning, prediction, and discoveries in biomedicine.