Rohan Shawn Sunil, Shan Chun Lim, Manoj Itharajula, Marek Mutwil
{"title":"The gene function prediction challenge: large language models and knowledge graphs to the rescue","authors":"Rohan Shawn Sunil, Shan Chun Lim, Manoj Itharajula, Marek Mutwil","doi":"arxiv-2408.07222","DOIUrl":null,"url":null,"abstract":"Elucidating gene function is one of the ultimate goals of plant science.\nDespite this, only ~15% of all genes in the model plant Arabidopsis thaliana\nhave comprehensively experimentally verified functions. While bioinformatical\ngene function prediction approaches can guide biologists in their experimental\nefforts, neither the performance of the gene function prediction methods nor\nthe number of experimental characterisation of genes has increased dramatically\nin recent years. In this review, we will discuss the status quo and the\ntrajectory of gene function elucidation and outline the recent advances in gene\nfunction prediction approaches. We will then discuss how recent artificial\nintelligence advances in large language models and knowledge graphs can be\nleveraged to accelerate gene function predictions and keep us updated with\nscientific literature.","PeriodicalId":501325,"journal":{"name":"arXiv - QuanBio - Molecular Networks","volume":"21 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Molecular Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.07222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Elucidating gene function is one of the ultimate goals of plant science.
Despite this, only ~15% of all genes in the model plant Arabidopsis thaliana
have comprehensively experimentally verified functions. While bioinformatical
gene function prediction approaches can guide biologists in their experimental
efforts, neither the performance of the gene function prediction methods nor
the number of experimental characterisation of genes has increased dramatically
in recent years. In this review, we will discuss the status quo and the
trajectory of gene function elucidation and outline the recent advances in gene
function prediction approaches. We will then discuss how recent artificial
intelligence advances in large language models and knowledge graphs can be
leveraged to accelerate gene function predictions and keep us updated with
scientific literature.