{"title":"HilbertEPIs: Enhancer-Promoter Interactions Prediction with Hilbert Curve and CNN Model","authors":"Yujia Hu, Ruichen Peng, Chunlin Long, Min Zhu","doi":"10.1109/ICBCB52223.2021.9459226","DOIUrl":null,"url":null,"abstract":"Enhancers are DNA cis-regulatory sequences that control the transcriptional activities of many gene regulation elements. Due to enhancers always get close to promoters by complex spatial structures, accurately identifying Enhancer-Promoter Interactions will help us understand mechanisms of gene regulations, recognize specific genes associated with diseases, as well as offer help with disease diagnosis and treatment. In this article, we develop a model named HilbertEPIs to predict the interactions between enhancers and promoters. We first transfer 1D sequence into 3D picture representations with Hilbert Curve to preserve the spatial structure of this sequence. Then extract features by CNN model. Finally, using two strategies to deal with unbalanced data. Experimental results have proved that HilbertEPIs has perfect performance compared to existed methods, as well as to show that Hilbert Curve is qualified to represent spatial relationships among different genetic regulatory elements. We train model in two ways and learn from six cell lines, finally achieve the data in 0.908~0.983 of AUROC, 0.926~0.988 of AUPR.","PeriodicalId":178168,"journal":{"name":"2021 IEEE 9th International Conference on Bioinformatics and Computational Biology (ICBCB)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 9th International Conference on Bioinformatics and Computational Biology (ICBCB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBCB52223.2021.9459226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Enhancers are DNA cis-regulatory sequences that control the transcriptional activities of many gene regulation elements. Due to enhancers always get close to promoters by complex spatial structures, accurately identifying Enhancer-Promoter Interactions will help us understand mechanisms of gene regulations, recognize specific genes associated with diseases, as well as offer help with disease diagnosis and treatment. In this article, we develop a model named HilbertEPIs to predict the interactions between enhancers and promoters. We first transfer 1D sequence into 3D picture representations with Hilbert Curve to preserve the spatial structure of this sequence. Then extract features by CNN model. Finally, using two strategies to deal with unbalanced data. Experimental results have proved that HilbertEPIs has perfect performance compared to existed methods, as well as to show that Hilbert Curve is qualified to represent spatial relationships among different genetic regulatory elements. We train model in two ways and learn from six cell lines, finally achieve the data in 0.908~0.983 of AUROC, 0.926~0.988 of AUPR.