Depthwise Convolutions using Physicochemical Features of DNA for Transcription Factor Binding Site Classification: Physicochemical Features for DNA-Protein Classification with Depthwise Convolutions
Gergely Pap, Krisztian Adam, Zoltan Gyorgypal, Laszlo Toth, Z. Hegedus
{"title":"Depthwise Convolutions using Physicochemical Features of DNA for Transcription Factor Binding Site Classification: Physicochemical Features for DNA-Protein Classification with Depthwise Convolutions","authors":"Gergely Pap, Krisztian Adam, Zoltan Gyorgypal, Laszlo Toth, Z. Hegedus","doi":"10.1145/3571560.3571563","DOIUrl":null,"url":null,"abstract":"Classifying DNA sequences based on a nucleotide representation has enjoyed considerable success with the advancement of Deep Learning, as the proper usage and combination of different layers and architecture choices led to an increase in performance. The most common approaches rely on convolutional, recurrent and attention layer types. Moreover, the inclusion of further information in addition to the nucleotide sequence provides increases in performance, even though the methods of combining the input feature representations with distinct model structures could pose a challenge. To examine this topic, we applied depthwise separable convolutional layers to a physicochemical DNA sequence representation and train models to detect the binding sites of DNA binding proteins. While convolutional kernels learn the local feature patterns of motifs, the behaviour of the depthwise separable convolution better exploits the feature, shape and physicochemical information that could be stored in the input representation. Our models with depthwise separable convolution achieve increases in accuracy compared to the convolutional and nucleotide-based approaches on several datasets.","PeriodicalId":143909,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Advances in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3571560.3571563","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Classifying DNA sequences based on a nucleotide representation has enjoyed considerable success with the advancement of Deep Learning, as the proper usage and combination of different layers and architecture choices led to an increase in performance. The most common approaches rely on convolutional, recurrent and attention layer types. Moreover, the inclusion of further information in addition to the nucleotide sequence provides increases in performance, even though the methods of combining the input feature representations with distinct model structures could pose a challenge. To examine this topic, we applied depthwise separable convolutional layers to a physicochemical DNA sequence representation and train models to detect the binding sites of DNA binding proteins. While convolutional kernels learn the local feature patterns of motifs, the behaviour of the depthwise separable convolution better exploits the feature, shape and physicochemical information that could be stored in the input representation. Our models with depthwise separable convolution achieve increases in accuracy compared to the convolutional and nucleotide-based approaches on several datasets.