A Systematic Approach to Configuring MetaMap for Optimal Performance.

IF 1.3 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS
Xia Jing, Akash Indani, Nina Hubig, Hua Min, Yang Gong, James J Cimino, Dean F Sittig, Lior Rennert, David Robinson, Paul Biondich, Adam Wright, Christian Nøhr, Timothy Law, Arild Faxvaag, Ronald Gimbel
{"title":"A Systematic Approach to Configuring MetaMap for Optimal Performance.","authors":"Xia Jing,&nbsp;Akash Indani,&nbsp;Nina Hubig,&nbsp;Hua Min,&nbsp;Yang Gong,&nbsp;James J Cimino,&nbsp;Dean F Sittig,&nbsp;Lior Rennert,&nbsp;David Robinson,&nbsp;Paul Biondich,&nbsp;Adam Wright,&nbsp;Christian Nøhr,&nbsp;Timothy Law,&nbsp;Arild Faxvaag,&nbsp;Ronald Gimbel","doi":"10.1055/a-1862-0421","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>MetaMap is a valuable tool for processing biomedical texts to identify concepts. Although MetaMap is highly configurative, configuration decisions are not straightforward.</p><p><strong>Objective: </strong>To develop a systematic, data-driven methodology for configuring MetaMap for optimal performance.</p><p><strong>Methods: </strong>MetaMap, the word2vec model, and the phrase model were used to build a pipeline. For unsupervised training, the phrase and word2vec models used abstracts related to clinical decision support as input. During testing, MetaMap was configured with the default option, one behavior option, and two behavior options. For each configuration, cosine and soft cosine similarity scores between identified entities and gold-standard terms were computed for 40 annotated abstracts (422 sentences). The similarity scores were used to calculate and compare the overall percentages of exact matches, similar matches, and missing gold-standard terms among the abstracts for each configuration. The results were manually spot-checked. The precision, recall, and F-measure (<i>β</i> =1) were calculated.</p><p><strong>Results: </strong>The percentages of exact matches and missing gold-standard terms were 0.6-0.79 and 0.09-0.3 for one behavior option, and 0.56-0.8 and 0.09-0.3 for two behavior options, respectively. The percentages of exact matches and missing terms for soft cosine similarity scores exceeded those for cosine similarity scores. The average precision, recall, and F-measure were 0.59, 0.82, and 0.68 for exact matches, and 1.00, 0.53, and 0.69 for missing terms, respectively.</p><p><strong>Conclusion: </strong>We demonstrated a systematic approach that provides objective and accurate evidence guiding MetaMap configurations for optimizing performance. Combining objective evidence and the current practice of using principles, experience, and intuitions outperforms a single strategy in MetaMap configurations. Our methodology, reference codes, measurements, results, and workflow are valuable references for optimizing and configuring MetaMap.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"61 S 02","pages":"e51-e63"},"PeriodicalIF":1.3000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/95/91/10-1055-a-1862-0421.PMC9788913.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods of Information in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/a-1862-0421","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 1

Abstract

Background: MetaMap is a valuable tool for processing biomedical texts to identify concepts. Although MetaMap is highly configurative, configuration decisions are not straightforward.

Objective: To develop a systematic, data-driven methodology for configuring MetaMap for optimal performance.

Methods: MetaMap, the word2vec model, and the phrase model were used to build a pipeline. For unsupervised training, the phrase and word2vec models used abstracts related to clinical decision support as input. During testing, MetaMap was configured with the default option, one behavior option, and two behavior options. For each configuration, cosine and soft cosine similarity scores between identified entities and gold-standard terms were computed for 40 annotated abstracts (422 sentences). The similarity scores were used to calculate and compare the overall percentages of exact matches, similar matches, and missing gold-standard terms among the abstracts for each configuration. The results were manually spot-checked. The precision, recall, and F-measure (β =1) were calculated.

Results: The percentages of exact matches and missing gold-standard terms were 0.6-0.79 and 0.09-0.3 for one behavior option, and 0.56-0.8 and 0.09-0.3 for two behavior options, respectively. The percentages of exact matches and missing terms for soft cosine similarity scores exceeded those for cosine similarity scores. The average precision, recall, and F-measure were 0.59, 0.82, and 0.68 for exact matches, and 1.00, 0.53, and 0.69 for missing terms, respectively.

Conclusion: We demonstrated a systematic approach that provides objective and accurate evidence guiding MetaMap configurations for optimizing performance. Combining objective evidence and the current practice of using principles, experience, and intuitions outperforms a single strategy in MetaMap configurations. Our methodology, reference codes, measurements, results, and workflow are valuable references for optimizing and configuring MetaMap.

Abstract Image

Abstract Image

Abstract Image

配置元地图以获得最佳性能的系统方法。
背景:MetaMap是处理生物医学文本以识别概念的宝贵工具。尽管MetaMap是高度可配置的,但配置决策并不简单。目的:开发一种系统的、数据驱动的方法来配置元地图以获得最佳性能。方法:采用MetaMap、word2vec模型和短语模型构建管道。对于无监督训练,短语和word2vec模型使用与临床决策支持相关的摘要作为输入。在测试期间,MetaMap配置了默认选项、一个行为选项和两个行为选项。对于每种配置,计算了40个注释摘要(422个句子)的识别实体与金标准术语之间的余弦和软余弦相似度得分。相似性分数用于计算和比较每个配置的摘要中精确匹配、相似匹配和缺失金标准术语的总体百分比。结果是手工抽查的。计算精密度、召回率和f测量值(β =1)。结果:一个行为选项的精确匹配和缺失金标准项的百分比分别为0.6-0.79和0.09-0.3,两个行为选项的精确匹配和缺失金标准项的百分比分别为0.56-0.8和0.09-0.3。软余弦相似度分数的精确匹配和缺失项的百分比超过了余弦相似度分数。准确匹配的平均精密度、召回率和F-measure分别为0.59、0.82和0.68,缺失项分别为1.00、0.53和0.69。结论:我们展示了一种系统的方法,提供了客观和准确的证据来指导MetaMap配置以优化性能。将客观证据与使用原则、经验和直觉的当前实践相结合,在MetaMap配置中优于单一策略。我们的方法、参考代码、测量、结果和工作流程是优化和配置MetaMap的宝贵参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Methods of Information in Medicine
Methods of Information in Medicine 医学-计算机:信息系统
CiteScore
3.70
自引率
11.80%
发文量
33
审稿时长
6-12 weeks
期刊介绍: Good medicine and good healthcare demand good information. Since the journal''s founding in 1962, Methods of Information in Medicine has stressed the methodology and scientific fundamentals of organizing, representing and analyzing data, information and knowledge in biomedicine and health care. Covering publications in the fields of biomedical and health informatics, medical biometry, and epidemiology, the journal publishes original papers, reviews, reports, opinion papers, editorials, and letters to the editor. From time to time, the journal publishes articles on particular focus themes as part of a journal''s issue.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信