基于符号转换器编码器的瑞典处方信息提取

Clinical Natural Language Processing Workshop Pub Date : 2020-10-10 DOI:10.18653/v1/2020.clinicalnlp-1.5

John N. Pougue Biyong, Bo Wang, Terry Lyons, A. Nevado-Holgado

{"title":"基于符号转换器编码器的瑞典处方信息提取","authors":"John N. Pougue Biyong, Bo Wang, Terry Lyons, A. Nevado-Holgado","doi":"10.18653/v1/2020.clinicalnlp-1.5","DOIUrl":null,"url":null,"abstract":"Relying on large pretrained language models such as Bidirectional Encoder Representations from Transformers (BERT) for encoding and adding a simple prediction layer has led to impressive performance in many clinical natural language processing (NLP) tasks. In this work, we present a novel extension to the Transformer architecture, by incorporating signature transform with the self-attention model. This architecture is added between embedding and prediction layers. Experiments on a new Swedish prescription data show the proposed architecture to be superior in two of the three information extraction tasks, comparing to baseline models. Finally, we evaluate two different embedding approaches between applying Multilingual BERT and translating the Swedish text to English then encode with a BERT model pretrained on clinical notes.","PeriodicalId":216954,"journal":{"name":"Clinical Natural Language Processing Workshop","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Information Extraction from Swedish Medical Prescriptions with Sig-Transformer Encoder\",\"authors\":\"John N. Pougue Biyong, Bo Wang, Terry Lyons, A. Nevado-Holgado\",\"doi\":\"10.18653/v1/2020.clinicalnlp-1.5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Relying on large pretrained language models such as Bidirectional Encoder Representations from Transformers (BERT) for encoding and adding a simple prediction layer has led to impressive performance in many clinical natural language processing (NLP) tasks. In this work, we present a novel extension to the Transformer architecture, by incorporating signature transform with the self-attention model. This architecture is added between embedding and prediction layers. Experiments on a new Swedish prescription data show the proposed architecture to be superior in two of the three information extraction tasks, comparing to baseline models. Finally, we evaluate two different embedding approaches between applying Multilingual BERT and translating the Swedish text to English then encode with a BERT model pretrained on clinical notes.\",\"PeriodicalId\":216954,\"journal\":{\"name\":\"Clinical Natural Language Processing Workshop\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical Natural Language Processing Workshop\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18653/v1/2020.clinicalnlp-1.5\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Natural Language Processing Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2020.clinicalnlp-1.5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

依赖于大型预训练语言模型，如来自变形器的双向编码器表示(BERT)进行编码并添加一个简单的预测层，在许多临床自然语言处理(NLP)任务中取得了令人印象深刻的表现。在这项工作中，我们通过将签名转换与自关注模型相结合，提出了对Transformer体系结构的一种新的扩展。该架构被添加到嵌入层和预测层之间。在一个新的瑞典处方数据上的实验表明，与基线模型相比，所提出的架构在三个信息提取任务中的两个方面都优于基线模型。最后，我们评估了两种不同的嵌入方法:应用多语言BERT和将瑞典语文本翻译成英语，然后使用临床记录预训练的BERT模型进行编码。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Information Extraction from Swedish Medical Prescriptions with Sig-Transformer Encoder

Relying on large pretrained language models such as Bidirectional Encoder Representations from Transformers (BERT) for encoding and adding a simple prediction layer has led to impressive performance in many clinical natural language processing (NLP) tasks. In this work, we present a novel extension to the Transformer architecture, by incorporating signature transform with the self-attention model. This architecture is added between embedding and prediction layers. Experiments on a new Swedish prescription data show the proposed architecture to be superior in two of the three information extraction tasks, comparing to baseline models. Finally, we evaluate two different embedding approaches between applying Multilingual BERT and translating the Swedish text to English then encode with a BERT model pretrained on clinical notes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Clinical Natural Language Processing Workshop

自引率

0.00%

发文量