Workshop on Biomedical Natural Language Processing最新文献

筛选
英文 中文
Prediction of Protein Sub-cellular Localization using Information from Texts and Sequences. 利用文本和序列信息预测蛋白质亚细胞定位。
Workshop on Biomedical Natural Language Processing Pub Date : 2008-06-19 DOI: 10.3115/1572306.1572324
H. Chun, Chisato Yamasaki, Naomi Saichi, Masayuki Tanaka, T. Hishiki, T. Imanishi, T. Gojobori, Jin-Dong Kim, Junichi Tsujii, T. Takagi
{"title":"Prediction of Protein Sub-cellular Localization using Information from Texts and Sequences.","authors":"H. Chun, Chisato Yamasaki, Naomi Saichi, Masayuki Tanaka, T. Hishiki, T. Imanishi, T. Gojobori, Jin-Dong Kim, Junichi Tsujii, T. Takagi","doi":"10.3115/1572306.1572324","DOIUrl":"https://doi.org/10.3115/1572306.1572324","url":null,"abstract":"This paper presents a novel prediction approach for protein sub-cellular localization. We have incorporated text and sequence-based approaches.","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130805553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge Sources for Word Sense Disambiguation of Biomedical Text 生物医学文本词义消歧的知识来源
Workshop on Biomedical Natural Language Processing Pub Date : 2008-06-19 DOI: 10.3115/1572306.1572321
Mark Stevenson, Yikun Guo, R. Gaizauskas, David Martínez
{"title":"Knowledge Sources for Word Sense Disambiguation of Biomedical Text","authors":"Mark Stevenson, Yikun Guo, R. Gaizauskas, David Martínez","doi":"10.3115/1572306.1572321","DOIUrl":"https://doi.org/10.3115/1572306.1572321","url":null,"abstract":"Like text in other domains, biomedical documents contain a range of terms with more than one possible meaning. These ambiguities form a significant obstacle to the automatic processing of biomedical texts. Previous approaches to resolving this problem have made use of a variety of knowledge sources including linguistic information (from the context in which the ambiguous term is used) and domain-specific resources (such as UMLS). In this paper we compare a range of knowledge sources which have been previously used and introduce a novel one: MeSH terms. The best performance is obtained using linguistic features in combination with MeSH terms. Results from our system outperform published results for previously reported systems on a standard test set (the NLM-WSD corpus).","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131375842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
A Pilot Annotation to Investigate Discourse Connectivity in Biomedical Text 生物医学语篇连通性研究的试点标注
Workshop on Biomedical Natural Language Processing Pub Date : 2008-06-19 DOI: 10.3115/1572306.1572325
Hong Yu, Nadya Frid, S. McRoy, R. Prasad, Alan Lee, A. Joshi
{"title":"A Pilot Annotation to Investigate Discourse Connectivity in Biomedical Text","authors":"Hong Yu, Nadya Frid, S. McRoy, R. Prasad, Alan Lee, A. Joshi","doi":"10.3115/1572306.1572325","DOIUrl":"https://doi.org/10.3115/1572306.1572325","url":null,"abstract":"The goal of the Penn Discourse Treebank (PDTB) project is to develop a large-scale corpus, annotated with coherence relations marked by discourse connectives. Currently, the primary application of the PDTB annotation has been to news articles. In this study, we tested whether the PDTB guidelines can be adapted to a different genre. We annotated discourse connectives and their arguments in one 4,937-token full-text biomedical article. Two linguist annotators showed an agreement of 85% after simple conventions were added. For the remaining 15% cases, we found that biomedical domain-specific knowledge is needed to capture the linguistic cues that can be used to resolve inter-annotator disagreement. We found that the two annotators were able to reach an agreement after discussion. Thus our experiments suggest that the PDTB annotation can be adapted to new domains by minimally adjusting the guidelines and by adding some further domain-specific linguistic cues.","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130050004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Conditional Random Fields and Support Vector Machines for Disorder Named Entity Recognition in Clinical Texts 临床文本中无序命名实体识别的条件随机场和支持向量机
Workshop on Biomedical Natural Language Processing Pub Date : 2008-06-19 DOI: 10.3115/1572306.1572326
Dingcheng Li, G. Savova, K. Schuler
{"title":"Conditional Random Fields and Support Vector Machines for Disorder Named Entity Recognition in Clinical Texts","authors":"Dingcheng Li, G. Savova, K. Schuler","doi":"10.3115/1572306.1572326","DOIUrl":"https://doi.org/10.3115/1572306.1572326","url":null,"abstract":"We present a comparative study between two machine learning methods, Conditional Random Fields and Support Vector Machines for clinical named entity recognition. We explore their applicability to clinical domain. Evaluation against a set of gold standard named entities shows that CRFs outperform SVMs. The best F-score with CRFs is 0.86 and for the SVMs is 0.64 as compared to a baseline of 0.60.","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115545257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 106
CBR-Tagger: a case-based reasoning approach to the gene/protein mention problem CBR-Tagger:基于案例的基因/蛋白质提及问题推理方法
Workshop on Biomedical Natural Language Processing Pub Date : 2008-06-19 DOI: 10.3115/1572306.1572333
M. Neves, M. Chagoyen, J. Carazo, A. Pascual-Montano
{"title":"CBR-Tagger: a case-based reasoning approach to the gene/protein mention problem","authors":"M. Neves, M. Chagoyen, J. Carazo, A. Pascual-Montano","doi":"10.3115/1572306.1572333","DOIUrl":"https://doi.org/10.3115/1572306.1572333","url":null,"abstract":"This work proposes a case-based classifier to tackle the gene/protein mention problem in biomedical literature. The so called gene mention problem consists of the recognition of gene and protein entities in scientific texts. A classification process aiming at deciding if a term is a gene mention or not is carried out for each word in the text. It is based on the selection of the best or most similar case in a base of known and unknown cases. The approach was evaluated on several datasets for different organisms and results show the suitability of this approach for the gene mention problem.","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121479359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Adaptive Information Extraction for Complex Biomedical Tasks 复杂生物医学任务的自适应信息提取
Workshop on Biomedical Natural Language Processing Pub Date : 2008-06-19 DOI: 10.3115/1572306.1572339
D. Feng, Gully A. Burns, E. Hovy
{"title":"Adaptive Information Extraction for Complex Biomedical Tasks","authors":"D. Feng, Gully A. Burns, E. Hovy","doi":"10.3115/1572306.1572339","DOIUrl":"https://doi.org/10.3115/1572306.1572339","url":null,"abstract":"Biomedical information extraction tasks are often more complex and contain uncertainty at each step during problem solving processes. We present an adaptive information extraction framework and demonstrate how to explore uncertainty using feedback integration.","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115207302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Extracting Clinical Relationships from Patient Narratives 从病人叙述中提取临床关系
Workshop on Biomedical Natural Language Processing Pub Date : 2008-06-19 DOI: 10.3115/1572306.1572309
A. Roberts, R. Gaizauskas, Mark Hepple
{"title":"Extracting Clinical Relationships from Patient Narratives","authors":"A. Roberts, R. Gaizauskas, Mark Hepple","doi":"10.3115/1572306.1572309","DOIUrl":"https://doi.org/10.3115/1572306.1572309","url":null,"abstract":"The Clinical E-Science Framework (CLEF) project has built a system to extract clinically significant information from the textual component of medical records, for clinical research, evidence-based healthcare and genotype-meets-phenotype informatics. One part of this system is the identification of relationships between clinically important entities in the text. Typical approaches to relationship extraction in this domain have used full parses, domain-specific grammars, and large knowledge bases encoding domain knowledge. In other areas of biomedical NLP, statistical machine learning approaches are now routinely applied to relationship extraction. We report on the novel application of these statistical techniques to clinical relationships. \u0000 \u0000We describe a supervised machine learning system, trained with a corpus of oncology narratives hand-annotated with clinically important relationships. Various shallow features are extracted from these texts, and used to train statistical classifiers. We compare the suitability of these features for clinical relationship extraction, how extraction varies between inter- and intra-sentential relationships, and examine the amount of training data needed to learn various relationships.","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132998206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 66
Raising the Compatibility of Heterogeneous Annotations: A Case Study on 提高异构注释的兼容性:以
Workshop on Biomedical Natural Language Processing Pub Date : 2008-06-01 DOI: 10.3115/1572306.1572338
Yue Wang, Kazuhiro Yoshida, Jin-Dong Kim, Rune Saetre, Junichi Tsujii
{"title":"Raising the Compatibility of Heterogeneous Annotations: A Case Study on","authors":"Yue Wang, Kazuhiro Yoshida, Jin-Dong Kim, Rune Saetre, Junichi Tsujii","doi":"10.3115/1572306.1572338","DOIUrl":"https://doi.org/10.3115/1572306.1572338","url":null,"abstract":"While there are several corpora which claim to have annotations for protein references, the heterogeneity between the annotations is recognized as an obstacle to develop expensive resources in a synergistic way. Here we present a series of experimental results which show the differences of protein mention annotations made to two corpora, GENIA and AImed.","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123271773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ADEQA: A Question Answer based approach for joint ADE-Suspect Extraction using Sequence-To-Sequence Transformers ADEQA:一种基于问答的方法,用于使用序列到序列变压器的联合ade可疑提取
Workshop on Biomedical Natural Language Processing Pub Date : 1900-01-01 DOI: 10.18653/v1/2023.bionlp-1.17
Vinayak Arannil, Tomal Deb, Atanu Roy
{"title":"ADEQA: A Question Answer based approach for joint ADE-Suspect Extraction using Sequence-To-Sequence Transformers","authors":"Vinayak Arannil, Tomal Deb, Atanu Roy","doi":"10.18653/v1/2023.bionlp-1.17","DOIUrl":"https://doi.org/10.18653/v1/2023.bionlp-1.17","url":null,"abstract":"Early identification of Adverse Drug Events (ADE) is critical for taking prompt actions while introducing new drugs into the market. These ADEs information are available through various unstructured data sources like clinical study reports, patient health records, social media posts, etc. Extracting ADEs and the related suspect drugs using machine learning is a challenging task due to the complex linguistic relations between drug ADE pairs in textual data and unavailability of large corpus of labelled datasets. This paper introduces ADEQA, a question- answer(QA) based approach using quasi supervised labelled data and sequence-to-sequence transformers to extract ADEs, drug suspects and the relationships between them. Unlike traditional QA models, natural language generation (NLG) based models don’t require extensive token level labelling and thereby reduces the adoption barrier significantly. On a public ADE corpus, we were able to achieve state-of-the-art results with an F1 score of 94% on establishing the relationships between ADEs and the respective suspects.","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123415083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Preamble Detection in Dictated Medical Reports 口述医学报告的自动前言检测
Workshop on Biomedical Natural Language Processing Pub Date : 1900-01-01 DOI: 10.18653/v1/W17-2336
Wael Salloum, Greg P. Finley, Erik Edwards, Mark Miller, David Suendermann-Oeft
{"title":"Automated Preamble Detection in Dictated Medical Reports","authors":"Wael Salloum, Greg P. Finley, Erik Edwards, Mark Miller, David Suendermann-Oeft","doi":"10.18653/v1/W17-2336","DOIUrl":"https://doi.org/10.18653/v1/W17-2336","url":null,"abstract":"Dictated medical reports very often feature a preamble containing metainformation about the report such as patient and physician names, location and name of the clinic, date of procedure, and so on. In the medical transcription process, the preamble is usually omitted from the final report, as it contains information already available in the electronic medical record. We present a method which is able to automatically identify preambles in medical dictations. The method makes use of stateof-the-art NLP techniques including word embeddings and Bi-LSTMs and achieves preamble detection performance superior to humans.","PeriodicalId":200974,"journal":{"name":"Workshop on Biomedical Natural Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115542861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信