{"title":"语义驱动的电子健康记录数据质量改进:系统回顾。","authors":"Yirong Wu, Mudan Ren, Na Chen, Liu Yang","doi":"10.1186/s12911-025-03146-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Data quality (DQ) of electronic health record (EHR) is crucial for the advancement of health informatization, yet it remains a significant challenge. Scholars are showing a growing interest in leveraging semantic technologies to enhance EHR data quality. However, previous studies have focused predominantly on specific semantic technologies, scenarios, or objectives-such as interoperability-often overlooking the potential of a various semantic technologies across different scenarios.</p><p><strong>Objective: </strong>This systematic review aimed to explore the potential of employing a range of semantic technologies to improve EHR data quality in a broader spectrum of application scenarios.</p><p><strong>Methods: </strong>Our systematic review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Three databases were searched, including PubMed, IEEE Xplore, and Web of Science Core Collection. The search terms used included \"Semantic*\", \"Quality\", \"Electronic Health Record*\", \"EHR*\", \"Electronic Medical Record*\", and \"EMR*\". These terms were combined via various Boolean operators to formulate multiple search queries.</p><p><strong>Results: </strong>Thirty-seven papers that met the inclusion criteria between 2008 and 2024 were analyzed. Six semantic techniques were identified as instrumental in improving EHR DQ: EHR standardization, controlled vocabulary, ontology, semantic web, knowledge graph, and natural language processing (NLP). These technologies were further mapped to 16 core data quality indicators and the FAIR principles (Findable, Accessible, Interoperable, and Reusable), highlighting their contributions across both technical and governance dimensions.</p><p><strong>Conclusions: </strong>The six identified semantic technologies can be categorized into three levels: foundational, general, and advanced. These technologies show significant potential in enhancing EHR DQ, particularly in the areas of conformance, portability, usability, and applicability, and they are suitable for a variety of contexs beyond interoperability, aligning with FAIR-aligned best practices in data management and reuse.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"298"},"PeriodicalIF":3.8000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337493/pdf/","citationCount":"0","resultStr":"{\"title\":\"Semantics-driven improvements in electronic health records data quality: a systematic review.\",\"authors\":\"Yirong Wu, Mudan Ren, Na Chen, Liu Yang\",\"doi\":\"10.1186/s12911-025-03146-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Data quality (DQ) of electronic health record (EHR) is crucial for the advancement of health informatization, yet it remains a significant challenge. Scholars are showing a growing interest in leveraging semantic technologies to enhance EHR data quality. However, previous studies have focused predominantly on specific semantic technologies, scenarios, or objectives-such as interoperability-often overlooking the potential of a various semantic technologies across different scenarios.</p><p><strong>Objective: </strong>This systematic review aimed to explore the potential of employing a range of semantic technologies to improve EHR data quality in a broader spectrum of application scenarios.</p><p><strong>Methods: </strong>Our systematic review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Three databases were searched, including PubMed, IEEE Xplore, and Web of Science Core Collection. The search terms used included \\\"Semantic*\\\", \\\"Quality\\\", \\\"Electronic Health Record*\\\", \\\"EHR*\\\", \\\"Electronic Medical Record*\\\", and \\\"EMR*\\\". These terms were combined via various Boolean operators to formulate multiple search queries.</p><p><strong>Results: </strong>Thirty-seven papers that met the inclusion criteria between 2008 and 2024 were analyzed. Six semantic techniques were identified as instrumental in improving EHR DQ: EHR standardization, controlled vocabulary, ontology, semantic web, knowledge graph, and natural language processing (NLP). These technologies were further mapped to 16 core data quality indicators and the FAIR principles (Findable, Accessible, Interoperable, and Reusable), highlighting their contributions across both technical and governance dimensions.</p><p><strong>Conclusions: </strong>The six identified semantic technologies can be categorized into three levels: foundational, general, and advanced. These technologies show significant potential in enhancing EHR DQ, particularly in the areas of conformance, portability, usability, and applicability, and they are suitable for a variety of contexs beyond interoperability, aligning with FAIR-aligned best practices in data management and reuse.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"25 1\",\"pages\":\"298\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337493/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-025-03146-w\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03146-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
摘要
背景:电子病历(EHR)的数据质量是推进健康信息化的关键,但仍是一个重大挑战。学者们对利用语义技术来提高电子病历数据质量越来越感兴趣。然而,以前的研究主要集中在特定的语义技术、场景或目标(如互操作性)上,往往忽略了各种语义技术在不同场景中的潜力。目的:本系统综述旨在探索在更广泛的应用场景中采用一系列语义技术来提高电子病历数据质量的潜力。方法:我们的系统评价遵循系统评价和荟萃分析的首选报告项目(PRISMA)指南。检索了三个数据库,包括PubMed, IEEE Xplore和Web of Science Core Collection。搜索词包括“语义*”、“质量”、“电子病历*”、“电子病历*”、“电子病历*”和“电子病历*”。这些术语通过各种布尔运算符组合在一起,形成多个搜索查询。结果:共纳入2008 ~ 2024年符合纳入标准的论文37篇。六种语义技术在提高电子病历DQ方面发挥着重要作用:电子病历标准化、受控词汇、本体、语义网、知识图和自然语言处理。这些技术进一步映射到16个核心数据质量指标和FAIR原则(可查找、可访问、可互操作和可重用),突出了它们在技术和治理维度上的贡献。结论:六种识别的语义技术可分为基础、通用和高级三个层次。这些技术在增强EHR DQ方面显示出巨大的潜力,特别是在一致性、可移植性、可用性和适用性方面,它们适用于除互操作性之外的各种环境,在数据管理和重用方面与fair一致的最佳实践保持一致。
Semantics-driven improvements in electronic health records data quality: a systematic review.
Background: Data quality (DQ) of electronic health record (EHR) is crucial for the advancement of health informatization, yet it remains a significant challenge. Scholars are showing a growing interest in leveraging semantic technologies to enhance EHR data quality. However, previous studies have focused predominantly on specific semantic technologies, scenarios, or objectives-such as interoperability-often overlooking the potential of a various semantic technologies across different scenarios.
Objective: This systematic review aimed to explore the potential of employing a range of semantic technologies to improve EHR data quality in a broader spectrum of application scenarios.
Methods: Our systematic review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Three databases were searched, including PubMed, IEEE Xplore, and Web of Science Core Collection. The search terms used included "Semantic*", "Quality", "Electronic Health Record*", "EHR*", "Electronic Medical Record*", and "EMR*". These terms were combined via various Boolean operators to formulate multiple search queries.
Results: Thirty-seven papers that met the inclusion criteria between 2008 and 2024 were analyzed. Six semantic techniques were identified as instrumental in improving EHR DQ: EHR standardization, controlled vocabulary, ontology, semantic web, knowledge graph, and natural language processing (NLP). These technologies were further mapped to 16 core data quality indicators and the FAIR principles (Findable, Accessible, Interoperable, and Reusable), highlighting their contributions across both technical and governance dimensions.
Conclusions: The six identified semantic technologies can be categorized into three levels: foundational, general, and advanced. These technologies show significant potential in enhancing EHR DQ, particularly in the areas of conformance, portability, usability, and applicability, and they are suitable for a variety of contexs beyond interoperability, aligning with FAIR-aligned best practices in data management and reuse.
期刊介绍:
BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.