{"title":"Enhancing Patient Identification Accuracy in Shared Child Health Records: a Hybrid Approach for the Lao Language Context.","authors":"Thepphouthone Sorsavanh, Chang Liu, Goshiro Yamamoto, Yukiko Mori, Shinji Kobayashi, Tomohiro Kuroda","doi":"10.1007/s10916-025-02260-6","DOIUrl":null,"url":null,"abstract":"<p><p>The Shared Child Health Record (SCHR) project in Lao People's Democratic Republic (PDR) aims to enhance pediatric health care services and health outcomes by enabling data exchange between health care systems. However, persistent challenges of duplication due to patient identification are hindered by non-Latin script complexities, including phonetic variations, a tonal alphabet, and temporary naming practices (e.g., placeholder names such as \"Eanoi\"). Existing patient-matching algorithms designed for Latin scripts underperform in this context. We assessed deterministic, probabilistic, and hybrid matching approaches using a Lao SCHR dataset of 20,433 records. A manual gold standard review (3,191 matches) validated their performance. Probabilistic matching employed the Fellegi-Sunter model with Jaro‒Winkler similarity, whereas the hybrid method combined deterministic rules (exact name/DOB matches) and probabilistic adjustments for unresolved cases. The hybrid and probabilistic methods consistently outperformed deterministic matching, achieving a 90% recall rate on the SCHR dataset. Despite its lower performance in Lao health records, the hybrid method resolved approximately 2,872 duplicates in SCHR. Challenges included twin records (shared identifiers) and temporary-to-permanent name transitions. This study is the first to adapt patient-matching methodologies for Lao's linguistic and infrastructural context. While hybrid methods show promise, performance gaps persist compared with those of Latin-based systems. These findings have significant implications with respect to improving the accuracy and efficiency of HIE systems in Lao PDR and other resource-limited settings.Clinical trial number: Not applicable.</p>","PeriodicalId":16338,"journal":{"name":"Journal of Medical Systems","volume":"49 1","pages":"119"},"PeriodicalIF":5.7000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12474658/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10916-025-02260-6","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
The Shared Child Health Record (SCHR) project in Lao People's Democratic Republic (PDR) aims to enhance pediatric health care services and health outcomes by enabling data exchange between health care systems. However, persistent challenges of duplication due to patient identification are hindered by non-Latin script complexities, including phonetic variations, a tonal alphabet, and temporary naming practices (e.g., placeholder names such as "Eanoi"). Existing patient-matching algorithms designed for Latin scripts underperform in this context. We assessed deterministic, probabilistic, and hybrid matching approaches using a Lao SCHR dataset of 20,433 records. A manual gold standard review (3,191 matches) validated their performance. Probabilistic matching employed the Fellegi-Sunter model with Jaro‒Winkler similarity, whereas the hybrid method combined deterministic rules (exact name/DOB matches) and probabilistic adjustments for unresolved cases. The hybrid and probabilistic methods consistently outperformed deterministic matching, achieving a 90% recall rate on the SCHR dataset. Despite its lower performance in Lao health records, the hybrid method resolved approximately 2,872 duplicates in SCHR. Challenges included twin records (shared identifiers) and temporary-to-permanent name transitions. This study is the first to adapt patient-matching methodologies for Lao's linguistic and infrastructural context. While hybrid methods show promise, performance gaps persist compared with those of Latin-based systems. These findings have significant implications with respect to improving the accuracy and efficiency of HIE systems in Lao PDR and other resource-limited settings.Clinical trial number: Not applicable.
期刊介绍:
Journal of Medical Systems provides a forum for the presentation and discussion of the increasingly extensive applications of new systems techniques and methods in hospital clinic and physician''s office administration; pathology radiology and pharmaceutical delivery systems; medical records storage and retrieval; and ancillary patient-support systems. The journal publishes informative articles essays and studies across the entire scale of medical systems from large hospital programs to novel small-scale medical services. Education is an integral part of this amalgamation of sciences and selected articles are published in this area. Since existing medical systems are constantly being modified to fit particular circumstances and to solve specific problems the journal includes a special section devoted to status reports on current installations.