Using background knowledge to improve inductive learning of DNA sequences

Proceedings of the Tenth Conference on Artificial Intelligence for Applications Pub Date : 1994-03-01 DOI:10.1109/CAIA.1994.323654

H. Hirsh, M. Noordewier

引用次数: 52

Abstract

Successful inductive learning requires that training data be expressed in a form where underlying regularities can be recognized by the learning system. Unfortunately, many applications of inductive learning/spl minus/especially in the domain of molecular biology/spl minus/have assumed that data are provided in a form already suitable for learning, whether or not such an assumption is actually justified. This paper describes the use of background knowledge of molecular biology to re-express data into a form more appropriate for learning. Our results show dramatic improvements in classification accuracy for two very different classes of DNA sequences using traditional "off-the-sheIf" decision-tree and neural-network inductive-learning methods.<>

查看原文本刊更多论文

利用背景知识提高DNA序列的归纳学习

成功的归纳学习要求训练数据以一种能够被学习系统识别出潜在规律的形式来表达。不幸的是，归纳学习的许多应用，特别是在分子生物学领域，已经假设数据以一种已经适合学习的形式提供，无论这种假设实际上是否合理。本文描述了利用分子生物学的背景知识将数据重新表达为更适合学习的形式。我们的研究结果表明，使用传统的“现成”决策树和神经网络归纳学习方法，两种非常不同类别的DNA序列的分类精度有了显着提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Tenth Conference on Artificial Intelligence for Applications

自引率

0.00%

发文量