生物医学文本交互式跨度推荐

Clinical Natural Language Processing Workshop Pub Date : 1900-01-01 DOI:10.18653/v1/2023.clinicalnlp-1.40

Louis Blankemeier, Theodore Zhao, Robert Tinn, Sid Kiblawi, Yu Gu, Akshay Chaudhari, Hoifung Poon, Sheng Zhang, Mu-Hsin Wei, J. Preston

{"title":"生物医学文本交互式跨度推荐","authors":"Louis Blankemeier, Theodore Zhao, Robert Tinn, Sid Kiblawi, Yu Gu, Akshay Chaudhari, Hoifung Poon, Sheng Zhang, Mu-Hsin Wei, J. Preston","doi":"10.18653/v1/2023.clinicalnlp-1.40","DOIUrl":null,"url":null,"abstract":"Motivated by the scarcity of high-quality labeled biomedical text, as well as the success of data programming, we introduce KRISS-Search. By leveraging the Unified Medical Language Systems (UMLS) ontology, KRISS-Search addresses an interactive few-shot span recommendation task that we propose. We first introduce unsupervised KRISS-Search and show that our method outperforms existing methods in identifying spans that are semantically similar to a given span of interest, with >50% AUPRC improvement relative to PubMedBERT. We then introduce supervised KRISS-Search, which leverages human interaction to improve the notion of similarity used by unsupervised KRISS-Search. Through simulated human feedback, we demonstrate an enhanced F1 score of 0.68 in classifying spans as semantically similar or different in the low-label setting, outperforming PubMedBERT by 2 F1 points. Finally, supervised KRISS-Search demonstrates competitive or superior performance compared to PubMedBERT in few-shot biomedical named entity recognition (NER) across five benchmark datasets, with an average improvement of 5.6 F1 points. We envision KRISS-Search increasing the efficiency of programmatic data labeling and also providing broader utility as an interactive biomedical search engine.","PeriodicalId":216954,"journal":{"name":"Clinical Natural Language Processing Workshop","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interactive Span Recommendation for Biomedical Text\",\"authors\":\"Louis Blankemeier, Theodore Zhao, Robert Tinn, Sid Kiblawi, Yu Gu, Akshay Chaudhari, Hoifung Poon, Sheng Zhang, Mu-Hsin Wei, J. Preston\",\"doi\":\"10.18653/v1/2023.clinicalnlp-1.40\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Motivated by the scarcity of high-quality labeled biomedical text, as well as the success of data programming, we introduce KRISS-Search. By leveraging the Unified Medical Language Systems (UMLS) ontology, KRISS-Search addresses an interactive few-shot span recommendation task that we propose. We first introduce unsupervised KRISS-Search and show that our method outperforms existing methods in identifying spans that are semantically similar to a given span of interest, with >50% AUPRC improvement relative to PubMedBERT. We then introduce supervised KRISS-Search, which leverages human interaction to improve the notion of similarity used by unsupervised KRISS-Search. Through simulated human feedback, we demonstrate an enhanced F1 score of 0.68 in classifying spans as semantically similar or different in the low-label setting, outperforming PubMedBERT by 2 F1 points. Finally, supervised KRISS-Search demonstrates competitive or superior performance compared to PubMedBERT in few-shot biomedical named entity recognition (NER) across five benchmark datasets, with an average improvement of 5.6 F1 points. We envision KRISS-Search increasing the efficiency of programmatic data labeling and also providing broader utility as an interactive biomedical search engine.\",\"PeriodicalId\":216954,\"journal\":{\"name\":\"Clinical Natural Language Processing Workshop\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical Natural Language Processing Workshop\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18653/v1/2023.clinicalnlp-1.40\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Natural Language Processing Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2023.clinicalnlp-1.40","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

由于高质量标记生物医学文本的稀缺性，以及数据编程的成功，我们引入了KRISS-Search。通过利用统一医学语言系统(UMLS)本体，KRISS-Search解决了我们提出的交互式短时间推荐任务。我们首先引入了无监督的KRISS-Search，并表明我们的方法在识别与给定兴趣跨度语义相似的跨度方面优于现有方法，相对于PubMedBERT, AUPRC提高了>50%。然后，我们引入了监督式KRISS-Search，它利用人类交互来改进无监督式KRISS-Search所使用的相似性概念。通过模拟人类反馈，我们证明了在低标签设置下，在将跨分类为语义相似或不同方面的F1得分提高了0.68，比PubMedBERT高出2个F1分。最后，在5个基准数据集上，与PubMedBERT相比，有监督的KRISS-Search在少量生物医学命名实体识别(NER)方面表现出了竞争或优越的性能，平均提高了5.6 F1分。我们设想KRISS-Search可以提高程序化数据标记的效率，并作为交互式生物医学搜索引擎提供更广泛的效用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Interactive Span Recommendation for Biomedical Text

Motivated by the scarcity of high-quality labeled biomedical text, as well as the success of data programming, we introduce KRISS-Search. By leveraging the Unified Medical Language Systems (UMLS) ontology, KRISS-Search addresses an interactive few-shot span recommendation task that we propose. We first introduce unsupervised KRISS-Search and show that our method outperforms existing methods in identifying spans that are semantically similar to a given span of interest, with >50% AUPRC improvement relative to PubMedBERT. We then introduce supervised KRISS-Search, which leverages human interaction to improve the notion of similarity used by unsupervised KRISS-Search. Through simulated human feedback, we demonstrate an enhanced F1 score of 0.68 in classifying spans as semantically similar or different in the low-label setting, outperforming PubMedBERT by 2 F1 points. Finally, supervised KRISS-Search demonstrates competitive or superior performance compared to PubMedBERT in few-shot biomedical named entity recognition (NER) across five benchmark datasets, with an average improvement of 5.6 F1 points. We envision KRISS-Search increasing the efficiency of programmatic data labeling and also providing broader utility as an interactive biomedical search engine.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Clinical Natural Language Processing Workshop

自引率

0.00%

发文量