Yang Dai, Huei Hsun Wen, Joanna Yang, Neepa Gupta, Connie Rhee, Carol R Horowitz, Dinushika Mohottige, Girish N Nadkarni, Steven Coca, Lili Chan
{"title":"自然语言处理识别血液透析患者症状记录不足","authors":"Yang Dai, Huei Hsun Wen, Joanna Yang, Neepa Gupta, Connie Rhee, Carol R Horowitz, Dinushika Mohottige, Girish N Nadkarni, Steven Coca, Lili Chan","doi":"10.34067/KID.0000000694","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Patients on hemodialysis (HD) have a high burden of emotional and physical symptoms. These symptoms are often under-recognized. NLP can be used to identify patient symptoms from the EHR. However, whether symptom documentation matches patient reported burden is unclear.</p><p><strong>Methods: </strong>We conducted a prospective study of patients seen at an ambulatory nephrology practice from September 2020 to April 2021. We collected symptom surveys from patients, nurses, and physicians. We then developed a natural language processing (NLP) algorithm to identify symptoms from the patients' electronic health records (EHR) and validated the performance of this algorithm using manual chart review and patient surveys as a reference standard. Using patient surveys as the reference standard, we compared symptom identification by 1) physicians, 2) nurses, 3) physicians or nurses, and 4) NLP.</p><p><strong>Results: </strong>We enrolled 97 patients into our study, 63% were female, 49% were Non-Hispanic Black, and 41% were Hispanic. The most common symptoms reported by patients were fatigue (61%), cramping (59%), dry skin (53%), muscle soreness (43%), and itching (41%). Physicians and nurses significantly under-recognized patients' symptoms (sensitivity 0.51 (95% CI 0.40-0.61) and 0.63 (95% CI 0.52-0.72) respectively). Nurses were better at identifying symptoms when patients reported more severe symptoms. There was no difference in results by patients' sex or ethnicity. NLP had a sensitivity of 0.92, specificity of 0.95, PPV of 0.75, and NPV of 0.99 with manual EHR review as the reference standard, and a sensitivity of 0.58 (95% CI 0.47-0.68), specificity of 0.73 (95% CI 0.48-0.89), PPV of 0.92 (95% CI 0.82-0.97), and NPV of 0.24 (95% CI 0.14-0.38) compared with patient surveys.</p><p><strong>Conclusions: </strong>While patients on HD report high prevalence of symptoms, symptoms are under-recognized and under-documented. NLP was accurate at identifying symptoms when they were documented. Larger studies in representative populations are needed to assess the generalizability of the results of the study.</p>","PeriodicalId":17882,"journal":{"name":"Kidney360","volume":" ","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Natural Language Processing Identifies Under-Documentation of Symptoms in Patients on Hemodialysis.\",\"authors\":\"Yang Dai, Huei Hsun Wen, Joanna Yang, Neepa Gupta, Connie Rhee, Carol R Horowitz, Dinushika Mohottige, Girish N Nadkarni, Steven Coca, Lili Chan\",\"doi\":\"10.34067/KID.0000000694\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Patients on hemodialysis (HD) have a high burden of emotional and physical symptoms. These symptoms are often under-recognized. NLP can be used to identify patient symptoms from the EHR. However, whether symptom documentation matches patient reported burden is unclear.</p><p><strong>Methods: </strong>We conducted a prospective study of patients seen at an ambulatory nephrology practice from September 2020 to April 2021. We collected symptom surveys from patients, nurses, and physicians. We then developed a natural language processing (NLP) algorithm to identify symptoms from the patients' electronic health records (EHR) and validated the performance of this algorithm using manual chart review and patient surveys as a reference standard. Using patient surveys as the reference standard, we compared symptom identification by 1) physicians, 2) nurses, 3) physicians or nurses, and 4) NLP.</p><p><strong>Results: </strong>We enrolled 97 patients into our study, 63% were female, 49% were Non-Hispanic Black, and 41% were Hispanic. The most common symptoms reported by patients were fatigue (61%), cramping (59%), dry skin (53%), muscle soreness (43%), and itching (41%). Physicians and nurses significantly under-recognized patients' symptoms (sensitivity 0.51 (95% CI 0.40-0.61) and 0.63 (95% CI 0.52-0.72) respectively). Nurses were better at identifying symptoms when patients reported more severe symptoms. There was no difference in results by patients' sex or ethnicity. NLP had a sensitivity of 0.92, specificity of 0.95, PPV of 0.75, and NPV of 0.99 with manual EHR review as the reference standard, and a sensitivity of 0.58 (95% CI 0.47-0.68), specificity of 0.73 (95% CI 0.48-0.89), PPV of 0.92 (95% CI 0.82-0.97), and NPV of 0.24 (95% CI 0.14-0.38) compared with patient surveys.</p><p><strong>Conclusions: </strong>While patients on HD report high prevalence of symptoms, symptoms are under-recognized and under-documented. NLP was accurate at identifying symptoms when they were documented. Larger studies in representative populations are needed to assess the generalizability of the results of the study.</p>\",\"PeriodicalId\":17882,\"journal\":{\"name\":\"Kidney360\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-01-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Kidney360\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.34067/KID.0000000694\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"UROLOGY & NEPHROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kidney360","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34067/KID.0000000694","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0
摘要
背景:血液透析(HD)患者有很高的情绪和身体症状负担。这些症状往往没有得到充分认识。NLP可用于从电子病历中识别患者症状。然而,症状记录是否与患者报告的负担相符尚不清楚。方法:我们对2020年9月至2021年4月在门诊肾脏病科就诊的患者进行了一项前瞻性研究。我们收集了患者、护士和医生的症状调查。然后,我们开发了一种自然语言处理(NLP)算法来从患者的电子健康记录(EHR)中识别症状,并使用手动图表审查和患者调查作为参考标准验证了该算法的性能。我们以患者调查作为参考标准,比较了1)医生、2)护士、3)医生或护士和4)NLP的症状识别。结果:我们纳入了97例患者,63%为女性,49%为非西班牙裔黑人,41%为西班牙裔。患者报告的最常见症状是疲劳(61%)、痉挛(59%)、皮肤干燥(53%)、肌肉酸痛(43%)和瘙痒(41%)。医生和护士明显低估了患者的症状(敏感性分别为0.51 (95% CI 0.40-0.61)和0.63 (95% CI 0.52-0.72))。当病人报告的症状更严重时,护士更善于识别症状。患者的性别和种族在结果上没有差异。以人工EHR为参考标准,NLP的敏感性为0.92,特异性为0.95,PPV为0.75,NPV为0.99;与患者调查相比,NLP的敏感性为0.58 (95% CI 0.47-0.68),特异性为0.73 (95% CI 0.48-0.89), PPV为0.92 (95% CI 0.82-0.97), NPV为0.24 (95% CI 0.14-0.38)。结论:虽然HD患者报告的症状患病率很高,但症状未被充分认识和记录。当记录症状时,NLP在识别症状方面是准确的。需要在代表性人群中进行更大规模的研究,以评估研究结果的普遍性。
Natural Language Processing Identifies Under-Documentation of Symptoms in Patients on Hemodialysis.
Background: Patients on hemodialysis (HD) have a high burden of emotional and physical symptoms. These symptoms are often under-recognized. NLP can be used to identify patient symptoms from the EHR. However, whether symptom documentation matches patient reported burden is unclear.
Methods: We conducted a prospective study of patients seen at an ambulatory nephrology practice from September 2020 to April 2021. We collected symptom surveys from patients, nurses, and physicians. We then developed a natural language processing (NLP) algorithm to identify symptoms from the patients' electronic health records (EHR) and validated the performance of this algorithm using manual chart review and patient surveys as a reference standard. Using patient surveys as the reference standard, we compared symptom identification by 1) physicians, 2) nurses, 3) physicians or nurses, and 4) NLP.
Results: We enrolled 97 patients into our study, 63% were female, 49% were Non-Hispanic Black, and 41% were Hispanic. The most common symptoms reported by patients were fatigue (61%), cramping (59%), dry skin (53%), muscle soreness (43%), and itching (41%). Physicians and nurses significantly under-recognized patients' symptoms (sensitivity 0.51 (95% CI 0.40-0.61) and 0.63 (95% CI 0.52-0.72) respectively). Nurses were better at identifying symptoms when patients reported more severe symptoms. There was no difference in results by patients' sex or ethnicity. NLP had a sensitivity of 0.92, specificity of 0.95, PPV of 0.75, and NPV of 0.99 with manual EHR review as the reference standard, and a sensitivity of 0.58 (95% CI 0.47-0.68), specificity of 0.73 (95% CI 0.48-0.89), PPV of 0.92 (95% CI 0.82-0.97), and NPV of 0.24 (95% CI 0.14-0.38) compared with patient surveys.
Conclusions: While patients on HD report high prevalence of symptoms, symptoms are under-recognized and under-documented. NLP was accurate at identifying symptoms when they were documented. Larger studies in representative populations are needed to assess the generalizability of the results of the study.