{"title":"评估牙科教育中电子健康记录的完整性:一项大数据研究。","authors":"Tamanna Tiwari, Maxim Kondratenko, Nihmath Nasiha, Toan Ong, Sangeetha Chandrasekaran, Gary Kostbade, Zachary Giano","doi":"10.3389/froh.2025.1535164","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>The BigMouth Dental Data Repository is an oral health database developed from de-identified electronic health record (EHR) data from eleven dental schools within the United States. To better understand how this database can be used for further research, the repository must be analyzed for data quality, such as accuracy, consistency, and completeness. This study determined the completeness of all patient health records between 2017 and 2019, including demographic, dental, behavioral, and health history variables at the students, faculty, and resident level.</p><p><strong>Methods: </strong>This study analyzed demographic (age, gender, race/ethnicity, zip code, insurance), dental (pain ratings), behavioral (tobacco, alcohol, and drug use), and health history variables for completeness. ANOVA was conducted to detect differences in providers collecting data by year (using Tukey <i>post hoc</i> differences at <i>p</i> < .05). Effect sizes are presented by comparing students to all other provider types.</p><p><strong>Results: </strong>Overall, the data showed high completeness in demographic variables (97.6%-99.9% for age, gender, and zip code) among the total sample of 543,363 patient visits. However, lower completeness rates were found in dental and behavioral variables (ranging from 1.5% to 66.1%), suggesting potential limitations for certain research applications. The study found significant differences in the completeness of records between students, faculty, and residents. In demographic variables, students demonstrated significantly higher completeness rates than faculty across the years 2017-2019, with 79.8%, 79%, and 78.8% completeness for race/ethnicity records, respectively. Furthermore, residents and faculty exhibited significantly higher completeness rates (76.8% and 86.7%, respectively) in insurance information compared to students (56.7%). Notably, students showcased greater completeness percentages in variables related to tobacco use, alcohol use, drug use, and health history compared to both faculty and residents.</p><p><strong>Conclusion: </strong>This study underscores significant variations in the completeness of EHR data among students, faculty, and residents across different schools. Despite these variances, the overall findings suggest a robust level of completeness in the demographic and health variables within the dataset.</p>","PeriodicalId":94016,"journal":{"name":"Frontiers in oral health","volume":"6 ","pages":"1535164"},"PeriodicalIF":3.0000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12170648/pdf/","citationCount":"0","resultStr":"{\"title\":\"Evaluating the completeness of electronic health records in dental education: a big data study.\",\"authors\":\"Tamanna Tiwari, Maxim Kondratenko, Nihmath Nasiha, Toan Ong, Sangeetha Chandrasekaran, Gary Kostbade, Zachary Giano\",\"doi\":\"10.3389/froh.2025.1535164\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>The BigMouth Dental Data Repository is an oral health database developed from de-identified electronic health record (EHR) data from eleven dental schools within the United States. To better understand how this database can be used for further research, the repository must be analyzed for data quality, such as accuracy, consistency, and completeness. This study determined the completeness of all patient health records between 2017 and 2019, including demographic, dental, behavioral, and health history variables at the students, faculty, and resident level.</p><p><strong>Methods: </strong>This study analyzed demographic (age, gender, race/ethnicity, zip code, insurance), dental (pain ratings), behavioral (tobacco, alcohol, and drug use), and health history variables for completeness. ANOVA was conducted to detect differences in providers collecting data by year (using Tukey <i>post hoc</i> differences at <i>p</i> < .05). Effect sizes are presented by comparing students to all other provider types.</p><p><strong>Results: </strong>Overall, the data showed high completeness in demographic variables (97.6%-99.9% for age, gender, and zip code) among the total sample of 543,363 patient visits. However, lower completeness rates were found in dental and behavioral variables (ranging from 1.5% to 66.1%), suggesting potential limitations for certain research applications. The study found significant differences in the completeness of records between students, faculty, and residents. In demographic variables, students demonstrated significantly higher completeness rates than faculty across the years 2017-2019, with 79.8%, 79%, and 78.8% completeness for race/ethnicity records, respectively. Furthermore, residents and faculty exhibited significantly higher completeness rates (76.8% and 86.7%, respectively) in insurance information compared to students (56.7%). Notably, students showcased greater completeness percentages in variables related to tobacco use, alcohol use, drug use, and health history compared to both faculty and residents.</p><p><strong>Conclusion: </strong>This study underscores significant variations in the completeness of EHR data among students, faculty, and residents across different schools. Despite these variances, the overall findings suggest a robust level of completeness in the demographic and health variables within the dataset.</p>\",\"PeriodicalId\":94016,\"journal\":{\"name\":\"Frontiers in oral health\",\"volume\":\"6 \",\"pages\":\"1535164\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12170648/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in oral health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/froh.2025.1535164\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"DENTISTRY, ORAL SURGERY & MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in oral health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/froh.2025.1535164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
Evaluating the completeness of electronic health records in dental education: a big data study.
Objectives: The BigMouth Dental Data Repository is an oral health database developed from de-identified electronic health record (EHR) data from eleven dental schools within the United States. To better understand how this database can be used for further research, the repository must be analyzed for data quality, such as accuracy, consistency, and completeness. This study determined the completeness of all patient health records between 2017 and 2019, including demographic, dental, behavioral, and health history variables at the students, faculty, and resident level.
Methods: This study analyzed demographic (age, gender, race/ethnicity, zip code, insurance), dental (pain ratings), behavioral (tobacco, alcohol, and drug use), and health history variables for completeness. ANOVA was conducted to detect differences in providers collecting data by year (using Tukey post hoc differences at p < .05). Effect sizes are presented by comparing students to all other provider types.
Results: Overall, the data showed high completeness in demographic variables (97.6%-99.9% for age, gender, and zip code) among the total sample of 543,363 patient visits. However, lower completeness rates were found in dental and behavioral variables (ranging from 1.5% to 66.1%), suggesting potential limitations for certain research applications. The study found significant differences in the completeness of records between students, faculty, and residents. In demographic variables, students demonstrated significantly higher completeness rates than faculty across the years 2017-2019, with 79.8%, 79%, and 78.8% completeness for race/ethnicity records, respectively. Furthermore, residents and faculty exhibited significantly higher completeness rates (76.8% and 86.7%, respectively) in insurance information compared to students (56.7%). Notably, students showcased greater completeness percentages in variables related to tobacco use, alcohol use, drug use, and health history compared to both faculty and residents.
Conclusion: This study underscores significant variations in the completeness of EHR data among students, faculty, and residents across different schools. Despite these variances, the overall findings suggest a robust level of completeness in the demographic and health variables within the dataset.