Alejandro Ruiz-de-laCuadra, J. L. L. Cuadrado, I. González-Carrasco, B. Ruíz-Mezcua
{"title":"西班牙临床叙事中的序列时间表达识别","authors":"Alejandro Ruiz-de-laCuadra, J. L. L. Cuadrado, I. González-Carrasco, B. Ruíz-Mezcua","doi":"10.1109/CBMS.2019.00074","DOIUrl":null,"url":null,"abstract":"Time expression recognition is one of the open issues in Natural Language Processing. These expressions are relevant to determine temporal aspects of the text as well as to establish relationships among facts described in said text. In the clinical domain, the temporal aspects are relevant to determine, for example, a sequence of facts in a clinical history. This paper presents research on the recognition of time expressions in Spanish according to the TIMEX3 standard. First, we establish HeidelTime, a well-known state of the art rule-based system, as a reference. Next, a hybrid model (a combination of bidirectional LSTM, CNN and CRF) is introduced to try to improve the results for the Spanish language. Both architectures have been tested with a Timex3 annotated Spanish corpus (TIMEBANK 1.0) to compare them. First, the results obtained show that the neural architecture obtains better results in Spanish. Finally, the neural architecture has been tested on a corpus of Clinical Notes (English and Spanish) in order to determine the results on this domain.","PeriodicalId":311634,"journal":{"name":"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Sequence Time Expression Recognition in the Spanish Clinical Narrative\",\"authors\":\"Alejandro Ruiz-de-laCuadra, J. L. L. Cuadrado, I. González-Carrasco, B. Ruíz-Mezcua\",\"doi\":\"10.1109/CBMS.2019.00074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Time expression recognition is one of the open issues in Natural Language Processing. These expressions are relevant to determine temporal aspects of the text as well as to establish relationships among facts described in said text. In the clinical domain, the temporal aspects are relevant to determine, for example, a sequence of facts in a clinical history. This paper presents research on the recognition of time expressions in Spanish according to the TIMEX3 standard. First, we establish HeidelTime, a well-known state of the art rule-based system, as a reference. Next, a hybrid model (a combination of bidirectional LSTM, CNN and CRF) is introduced to try to improve the results for the Spanish language. Both architectures have been tested with a Timex3 annotated Spanish corpus (TIMEBANK 1.0) to compare them. First, the results obtained show that the neural architecture obtains better results in Spanish. Finally, the neural architecture has been tested on a corpus of Clinical Notes (English and Spanish) in order to determine the results on this domain.\",\"PeriodicalId\":311634,\"journal\":{\"name\":\"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMS.2019.00074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2019.00074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sequence Time Expression Recognition in the Spanish Clinical Narrative
Time expression recognition is one of the open issues in Natural Language Processing. These expressions are relevant to determine temporal aspects of the text as well as to establish relationships among facts described in said text. In the clinical domain, the temporal aspects are relevant to determine, for example, a sequence of facts in a clinical history. This paper presents research on the recognition of time expressions in Spanish according to the TIMEX3 standard. First, we establish HeidelTime, a well-known state of the art rule-based system, as a reference. Next, a hybrid model (a combination of bidirectional LSTM, CNN and CRF) is introduced to try to improve the results for the Spanish language. Both architectures have been tested with a Timex3 annotated Spanish corpus (TIMEBANK 1.0) to compare them. First, the results obtained show that the neural architecture obtains better results in Spanish. Finally, the neural architecture has been tested on a corpus of Clinical Notes (English and Spanish) in order to determine the results on this domain.