{"title":"An Approach for Recognition of Enhancer-promoter Associations based on Random Forest","authors":"Tianjiao Zhang, Yadong Wang","doi":"10.1145/3354031.3354039","DOIUrl":null,"url":null,"abstract":"Enhancers are sequences in the genome that regulate gene expression and are usually located far from transcription start sites. Enhancers regulate gene expression by interacting with promoters. Therefore, the recognition of the association between enhancers and promoters is an important issue in the study of enhancer regulation. At present, computational methods to recognize the association between enhancers and promoters are mainly realized by designing machine learning methods based on the biological signals on the genome sequence. These recognition methods ignore evaluating the classification power of features, resulting in limited recognition performance. In this paper, the classification power of the feature signals near enhancers and promoters in the genome sequence was evaluated, and the features with strong classification power were picked up. This was conducive to improving the recognition accuracy. The correlation between enhancers and promoters was recognized by the random forest method. Compared with the five main recognition methods, the accuracy of the recognition method in this paper is higher.","PeriodicalId":286321,"journal":{"name":"Proceedings of the 4th International Conference on Biomedical Signal and Image Processing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Biomedical Signal and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3354031.3354039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Enhancers are sequences in the genome that regulate gene expression and are usually located far from transcription start sites. Enhancers regulate gene expression by interacting with promoters. Therefore, the recognition of the association between enhancers and promoters is an important issue in the study of enhancer regulation. At present, computational methods to recognize the association between enhancers and promoters are mainly realized by designing machine learning methods based on the biological signals on the genome sequence. These recognition methods ignore evaluating the classification power of features, resulting in limited recognition performance. In this paper, the classification power of the feature signals near enhancers and promoters in the genome sequence was evaluated, and the features with strong classification power were picked up. This was conducive to improving the recognition accuracy. The correlation between enhancers and promoters was recognized by the random forest method. Compared with the five main recognition methods, the accuracy of the recognition method in this paper is higher.