Filtering for medical news items using a machine learning approach.

Proceedings. AMIA Symposium Pub Date : 2002-01-01

Wanhong Zheng, Evangelos Milios, Carolyn Watters

引用次数: 0

Abstract

We address the problem of filtering medical news articles for targeted audiences. The approach is based on terms and one of the difficulties is extracting a feature set appropriate for the domain. This paper addresses the medical news-filtering problem using a machine learning approach. We describe the application of two supervised machine learning techniques, Decision Trees and Naïve Bayes, to automatically construct classifiers on the basis of a training set, in which news articles have been pre-classified by a medical expert and four other human readers. The goal is to classify the news articles into three groups: non-medical, medical intended for experts, and medical intended for other readers. While the general accuracy of the machine learning approach is around 78%, the accuracy of distinguishing non-medical articles from medical ones is shown to be 92%.

本刊更多论文

使用机器学习方法过滤医学新闻项目。

我们解决了为目标受众过滤医学新闻文章的问题。该方法基于术语，难点之一是提取适合该领域的特征集。本文使用机器学习方法解决了医学新闻过滤问题。我们描述了两种监督机器学习技术的应用，决策树和Naïve贝叶斯，在训练集的基础上自动构建分类器，其中新闻文章已经由医学专家和其他四个人类读者预分类。目标是将新闻文章分为三组:非医疗类、针对专家的医疗类和针对其他读者的医疗类。虽然机器学习方法的一般准确率约为78%，但区分非医学文章和医学文章的准确率为92%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. AMIA Symposium

自引率

0.00%

发文量