Tagging Funding Agencies and Grants in Scientific Articles using Sequential Learning Models

Workshop on Biomedical Natural Language Processing Pub Date : 2017-08-01 DOI:10.18653/v1/W17-2327

S. Kayal, Z. Afzal, G. Tsatsaronis, S. Katrenko, Pascal Coupet, Marius A. Doornenbal, M. Gregory

引用次数: 6

Abstract

In this paper we present a solution for tagging funding bodies and grants in scientific articles using a combination of trained sequential learning models, namely conditional random fields (CRF), hidden markov models (HMM) and maximum entropy models (MaxEnt), on a benchmark set created in-house. We apply the trained models to address the BioASQ challenge 5c, which is a newly introduced task that aims to solve the problem of funding information extraction from scientific articles. Results in the dry-run data set of BioASQ task 5c show that the suggested approach can achieve a micro-recall of more than 85% in tagging both funding bodies and grants.

查看原文本刊更多论文

在科学文章中使用顺序学习模型标记资助机构和拨款

在本文中，我们提出了一个在科学文章中标记资助机构和拨款的解决方案，该解决方案使用经过训练的顺序学习模型的组合，即条件随机场(CRF)，隐马尔可夫模型(HMM)和最大熵模型(MaxEnt)，基于内部创建的基准集。我们将训练好的模型应用于解决BioASQ挑战5c，这是一个新引入的任务，旨在解决从科学文章中提取信息的资金问题。BioASQ任务5c的干运行数据集的结果表明，所建议的方法在标记资助机构和资助方面都可以实现85%以上的微召回。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Workshop on Biomedical Natural Language Processing

自引率

0.00%

发文量