在生物医学文献中寻找否定和不确定表达的混合方法

Kazuki Fujikawa, Kazuhiro Seki, K. Uehara
{"title":"在生物医学文献中寻找否定和不确定表达的混合方法","authors":"Kazuki Fujikawa, Kazuhiro Seki, K. Uehara","doi":"10.1145/2389672.2389685","DOIUrl":null,"url":null,"abstract":"More and more biomedical documents are digitally written and stored. To make the most of the rich resources, it is crucial to precisely locate the information pertinent to users' interests. One of the obstacles in finding information in natural language text is negations, which deny or reverse the meaning of a sentence or clause. This is especially problematic in the biomedical domain since scientific findings and clinical records often contain negated expressions to explicitly state negative effects or the absence of symptoms. Ignoring such negated expressions result in more irrelevant information and may even lead to false conclusions. Therefore, identifying negative words and their scopes are important sub-tasks in biomedical information processing. This paper reports on our ongoing work on a hybrid approach to negation identification combining statistical and heuristic approaches. Our approach is evaluated on three types of biomedical documents in comparison with an existing machine learning approach. In addition, the empirical results are manually analyzed to better understand the nature of the problems.","PeriodicalId":91363,"journal":{"name":"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...","volume":"58 1","pages":"67-74"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"A hybrid approach to finding negated and uncertain expressions in biomedical documents\",\"authors\":\"Kazuki Fujikawa, Kazuhiro Seki, K. Uehara\",\"doi\":\"10.1145/2389672.2389685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"More and more biomedical documents are digitally written and stored. To make the most of the rich resources, it is crucial to precisely locate the information pertinent to users' interests. One of the obstacles in finding information in natural language text is negations, which deny or reverse the meaning of a sentence or clause. This is especially problematic in the biomedical domain since scientific findings and clinical records often contain negated expressions to explicitly state negative effects or the absence of symptoms. Ignoring such negated expressions result in more irrelevant information and may even lead to false conclusions. Therefore, identifying negative words and their scopes are important sub-tasks in biomedical information processing. This paper reports on our ongoing work on a hybrid approach to negation identification combining statistical and heuristic approaches. Our approach is evaluated on three types of biomedical documents in comparison with an existing machine learning approach. In addition, the empirical results are manually analyzed to better understand the nature of the problems.\",\"PeriodicalId\":91363,\"journal\":{\"name\":\"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...\",\"volume\":\"58 1\",\"pages\":\"67-74\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2389672.2389685\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MIX-HS'12 : proceedings of the 2nd International Workshop on Managing Interoperability and Complexity in Health Systems October 29, 2012, Maui, Hawaii, USA. International Workshop on Managing Interoperability and Complexity in Health Sy...","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2389672.2389685","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

越来越多的生物医学文献以数字方式书写和存储。为了充分利用丰富的资源,准确定位与用户兴趣相关的信息是至关重要的。在自然语言文本中寻找信息的障碍之一是否定,即否定或颠倒句子或分句的意思。这在生物医学领域尤其成问题,因为科学发现和临床记录经常包含否定表达,以明确说明负面影响或没有症状。忽略这些否定的表达会导致更多不相关的信息,甚至可能导致错误的结论。因此,否定词及其范围识别是生物医学信息处理的重要子任务。本文报告了我们正在进行的一种结合统计和启发式方法的否定识别混合方法的工作。与现有的机器学习方法相比,我们的方法在三种类型的生物医学文档上进行了评估。此外,为了更好地理解问题的本质,还对实证结果进行了人工分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A hybrid approach to finding negated and uncertain expressions in biomedical documents
More and more biomedical documents are digitally written and stored. To make the most of the rich resources, it is crucial to precisely locate the information pertinent to users' interests. One of the obstacles in finding information in natural language text is negations, which deny or reverse the meaning of a sentence or clause. This is especially problematic in the biomedical domain since scientific findings and clinical records often contain negated expressions to explicitly state negative effects or the absence of symptoms. Ignoring such negated expressions result in more irrelevant information and may even lead to false conclusions. Therefore, identifying negative words and their scopes are important sub-tasks in biomedical information processing. This paper reports on our ongoing work on a hybrid approach to negation identification combining statistical and heuristic approaches. Our approach is evaluated on three types of biomedical documents in comparison with an existing machine learning approach. In addition, the empirical results are manually analyzed to better understand the nature of the problems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信