Evaluation Data and Benchmarks for Cascaded Speech Recognition and Entity Extraction

Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia Pub Date : 2015-10-30 DOI:10.1145/2802558.2814646

Liyuan Zhou, H. Suominen, L. Hanlen

{"title":"Evaluation Data and Benchmarks for Cascaded Speech Recognition and Entity Extraction","authors":"Liyuan Zhou, H. Suominen, L. Hanlen","doi":"10.1145/2802558.2814646","DOIUrl":null,"url":null,"abstract":"During clinical handover, clinicians exchange information about the patients and the state of clinical management. To improve care safety and quality, both handover and its documentation have been standardized. Speech recognition and entity extraction provide a way to help health service providers to follow these standards by implementing the handover process as a structured form, whose headings guide the handover narrative, and the documentation process as proofing and sign-off of the automatically filled-out form. In this paper, we evaluate such systems. The form considers the sections of Handover nurse, Patient introduction, My shift, Medication, Appointments, and Future care, divided in 49 mutually exclusive headings to fill out with speech recognized and extracted entities. Our system correctly recognizes 10,244 out of 14,095 spoken words and regardless of 6,692 erroneous words, its error percentage is significantly smaller than for systems submitted to the CLEF eHealth Evaluation Lab 2015. In the extraction of 35 entities with training data (i.e., 14 headings were not present in the 101 expert-annotated training documents with 8,487 words in total), the system correctly extracts 2,375 out of 3,793 words in 50 test documents after calibration on 3,937 words in 50 validation documents. This translates to over 90% F1 in extracting information for the patient's age, current bed, current room, and given name and over 70% F1 for patient's admission reason/diagnosis and last name. F1 for filtering out irrelevant information is 78%. We have made the data publicly available for 201 handover cases together with processing results and code and proposed the extraction task for CLEF eHealth 2016.","PeriodicalId":115369,"journal":{"name":"Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia","volume":"127 1-2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2802558.2814646","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

During clinical handover, clinicians exchange information about the patients and the state of clinical management. To improve care safety and quality, both handover and its documentation have been standardized. Speech recognition and entity extraction provide a way to help health service providers to follow these standards by implementing the handover process as a structured form, whose headings guide the handover narrative, and the documentation process as proofing and sign-off of the automatically filled-out form. In this paper, we evaluate such systems. The form considers the sections of Handover nurse, Patient introduction, My shift, Medication, Appointments, and Future care, divided in 49 mutually exclusive headings to fill out with speech recognized and extracted entities. Our system correctly recognizes 10,244 out of 14,095 spoken words and regardless of 6,692 erroneous words, its error percentage is significantly smaller than for systems submitted to the CLEF eHealth Evaluation Lab 2015. In the extraction of 35 entities with training data (i.e., 14 headings were not present in the 101 expert-annotated training documents with 8,487 words in total), the system correctly extracts 2,375 out of 3,793 words in 50 test documents after calibration on 3,937 words in 50 validation documents. This translates to over 90% F1 in extracting information for the patient's age, current bed, current room, and given name and over 70% F1 for patient's admission reason/diagnosis and last name. F1 for filtering out irrelevant information is 78%. We have made the data publicly available for 201 handover cases together with processing results and code and proposed the extraction task for CLEF eHealth 2016.

查看原文本刊更多论文

级联语音识别和实体提取的评估数据和基准

在临床交接过程中，临床医生交换患者信息和临床管理情况。为了提高护理的安全性和质量，移交及其文件都已标准化。语音识别和实体提取提供了一种帮助卫生服务提供者遵循这些标准的方法，方法是将移交过程作为结构化表格实施，其标题指导移交叙述，并将文档过程作为自动填写表格的验证和签名。在本文中，我们对这样的系统进行了评价。该表单考虑交接护士、患者介绍、我的班次、药物、预约和未来护理等部分，这些部分分为49个相互排斥的标题，用语音识别和提取的实体填写。我们的系统正确识别了14,095个口语单词中的10,244个，并且不考虑6,692个错误单词，其错误率明显低于提交给CLEF eHealth Evaluation Lab 2015的系统。在提取35个具有训练数据的实体(即101个专家注释的训练文档共8487个单词中没有14个标题)中，系统对50个验证文档中的3937个单词进行校准后，正确提取了50个测试文档中的3793个单词中的2375个。这意味着在提取患者的年龄、当前床位、当前房间和姓名信息方面F1超过90%，提取患者的入院原因/诊断和姓氏信息F1超过70%。过滤掉不相关信息的F1是78%。我们已经公开了201个交接案例的数据以及处理结果和代码，并提出了CLEF eHealth 2016的提取任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia

自引率

0.00%

发文量