Perspectives on benchmarking foundation models for network biology

Quantitative Biology Pub Date : 2024-07-11 DOI:10.1002/qub2.68

Christina V. Theodoris

引用次数: 1

Abstract

Transfer learning has revolutionized fields including natural language understanding and computer vision by leveraging large‐scale general datasets to pretrain models with foundational knowledge that can then be transferred to improve predictions in a vast range of downstream tasks. More recently, there has been a growth in the adoption of transfer learning approaches in biological fields, where models have been pretrained on massive amounts of biological data and employed to make predictions in a broad range of biological applications. However, unlike in natural language where humans are best suited to evaluate models given a clear understanding of the ground truth, biology presents the unique challenge of being in a setting where there are a plethora of unknowns while at the same time needing to abide by real‐world physical constraints. This perspective provides a discussion of some key points we should consider as a field in designing benchmarks for foundation models in network biology.

查看原文本刊更多论文

网络生物学基础模型基准的视角

迁移学习利用大规模通用数据集对具有基础知识的模型进行预训练，然后将这些知识迁移到大量下游任务中以提高预测能力，从而给自然语言理解和计算机视觉等领域带来了革命性的变化。最近，生物领域采用迁移学习方法的情况越来越多，在大量生物数据上对模型进行预训练，并在广泛的生物应用中进行预测。然而，与自然语言不同的是，在自然语言中，人类最适合在清楚了解基本事实的情况下对模型进行评估，而生物学则面临着独特的挑战，即在存在大量未知因素的环境中，同时还需要遵守现实世界的物理限制。本视角讨论了我们在为网络生物学基础模型设计基准时应考虑的一些要点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Quantitative Biology

自引率

0.00%

发文量