Hao Fan, Sarah C Rossetti, Jennifer Thate, Rosemary Mugoya, Albert M Lai, Po-Yin Yen
{"title":"Semi-automated pipeline to accelerate multi-site flowsheet alignment and concept mapping in electronic health records.","authors":"Hao Fan, Sarah C Rossetti, Jennifer Thate, Rosemary Mugoya, Albert M Lai, Po-Yin Yen","doi":"10.1093/jamia/ocaf076","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Health-care institutions customize electronic health record (EHR) configurations to reflect their unique workflows and patient care priorities. Ensuring EHR alignment across sites facilitates seamless information exchange. We developed a pipeline for EHR flowsheet alignment between health-care organizations. The pipeline is augmented by mapping flowsheet data fields to concepts in the Clinical Care Classification (CCC) nursing terminology.</p><p><strong>Materials and methods: </strong>Flowsheet templates and measures from 2 study sites were transformed into template-measure (T-M) pairs. They were aligned through exact, lexical, or semantic matching. Lexical matches were assessed using Jaccard similarity and fuzzy matching methods. Semantic alignment was determined using cosine similarity between large language model-generated embeddings of T-M pairs and CCC concepts to rank and recommend the top n concepts in CCC. Concept mappings were evaluated based on whether concepts were mapped consistently within the CCC hierarchy.</p><p><strong>Results: </strong>We totally aligned 31 255 unique T-M pairs in acute care units and 27 012 T-M pairs in intensive care units from 2 study sites. When restricted to the top-ranked CCC concept (n = 1), we achieved a 63% flowsheet alignment rate with a 53% concept mapping rate. Expanding to the top 3 concepts (n = 3) improved alignment to 96.5% and concept mapping to 96%.</p><p><strong>Discussion and conclusion: </strong>Electronic health record data field alignment with concept mapping offers opportunities to standardize data elements presented in flowsheets across health-care sites. We demonstrated the feasibility of leveraging a semi-automated pipeline to streamline the EHR flowsheet alignment and accelerate the manual concept mapping process.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"1140-1148"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12202140/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1093/jamia/ocaf076","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: Health-care institutions customize electronic health record (EHR) configurations to reflect their unique workflows and patient care priorities. Ensuring EHR alignment across sites facilitates seamless information exchange. We developed a pipeline for EHR flowsheet alignment between health-care organizations. The pipeline is augmented by mapping flowsheet data fields to concepts in the Clinical Care Classification (CCC) nursing terminology.
Materials and methods: Flowsheet templates and measures from 2 study sites were transformed into template-measure (T-M) pairs. They were aligned through exact, lexical, or semantic matching. Lexical matches were assessed using Jaccard similarity and fuzzy matching methods. Semantic alignment was determined using cosine similarity between large language model-generated embeddings of T-M pairs and CCC concepts to rank and recommend the top n concepts in CCC. Concept mappings were evaluated based on whether concepts were mapped consistently within the CCC hierarchy.
Results: We totally aligned 31 255 unique T-M pairs in acute care units and 27 012 T-M pairs in intensive care units from 2 study sites. When restricted to the top-ranked CCC concept (n = 1), we achieved a 63% flowsheet alignment rate with a 53% concept mapping rate. Expanding to the top 3 concepts (n = 3) improved alignment to 96.5% and concept mapping to 96%.
Discussion and conclusion: Electronic health record data field alignment with concept mapping offers opportunities to standardize data elements presented in flowsheets across health-care sites. We demonstrated the feasibility of leveraging a semi-automated pipeline to streamline the EHR flowsheet alignment and accelerate the manual concept mapping process.
期刊介绍:
JAMIA is AMIA''s premier peer-reviewed journal for biomedical and health informatics. Covering the full spectrum of activities in the field, JAMIA includes informatics articles in the areas of clinical care, clinical research, translational science, implementation science, imaging, education, consumer health, public health, and policy. JAMIA''s articles describe innovative informatics research and systems that help to advance biomedical science and to promote health. Case reports, perspectives and reviews also help readers stay connected with the most important informatics developments in implementation, policy and education.