CNN for human exons and introns classification

FE Nasr, A. Oueslati
{"title":"CNN for human exons and introns classification","authors":"FE Nasr, A. Oueslati","doi":"10.1109/SSD52085.2021.9429303","DOIUrl":null,"url":null,"abstract":"Modeling properties and functions associated with DNA sequences is a very complex task in genomics. This task is more and more difficult when it concerns the human genome. This genome is composed with coding and non-coding regions which are not yet fully identified. Difficulties seen in this context are particularly related to the fact that 98% of the human genome is made up of non-coding zones. Therefore it seems evident that a powerful predictive model can have a huge advantage in advancing the exploration of the human genome. In this paper, we started from text representation of the DNA sequence to the representation of exons and introns by images to classify them. Here we are introducing a convolutional neural network model to classify human exons and introns. Our model has shown very good results concerning classification learning and testing rate which exceed 90%.","PeriodicalId":6799,"journal":{"name":"2021 18th International Multi-Conference on Systems, Signals & Devices (SSD)","volume":"3527 1","pages":"249-254"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 18th International Multi-Conference on Systems, Signals & Devices (SSD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSD52085.2021.9429303","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Modeling properties and functions associated with DNA sequences is a very complex task in genomics. This task is more and more difficult when it concerns the human genome. This genome is composed with coding and non-coding regions which are not yet fully identified. Difficulties seen in this context are particularly related to the fact that 98% of the human genome is made up of non-coding zones. Therefore it seems evident that a powerful predictive model can have a huge advantage in advancing the exploration of the human genome. In this paper, we started from text representation of the DNA sequence to the representation of exons and introns by images to classify them. Here we are introducing a convolutional neural network model to classify human exons and introns. Our model has shown very good results concerning classification learning and testing rate which exceed 90%.
CNN用于人类外显子和内含子分类
在基因组学中,与DNA序列相关的特性和功能建模是一项非常复杂的任务。当涉及到人类基因组时,这项任务变得越来越困难。该基因组由尚未完全鉴定的编码区和非编码区组成。在这种情况下看到的困难与98%的人类基因组由非编码区组成的事实特别相关。因此,似乎很明显,一个强大的预测模型可以在推进人类基因组的探索中具有巨大的优势。本文从DNA序列的文本表示到外显子和内含子的图像表示进行分类。在这里,我们引入了一个卷积神经网络模型来分类人类的外显子和内含子。我们的模型在分类学习和测试率方面都取得了很好的效果,测试率超过90%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信