Gergely Pap, Krisztian Adam, Zoltan Gyorgypal, Laszlo Toth, Z. Hegedus
{"title":"利用DNA的物理化学特征进行转录因子结合位点分类的深度卷积:利用深度卷积进行DNA-蛋白质分类的物理化学特征","authors":"Gergely Pap, Krisztian Adam, Zoltan Gyorgypal, Laszlo Toth, Z. Hegedus","doi":"10.1145/3571560.3571563","DOIUrl":null,"url":null,"abstract":"Classifying DNA sequences based on a nucleotide representation has enjoyed considerable success with the advancement of Deep Learning, as the proper usage and combination of different layers and architecture choices led to an increase in performance. The most common approaches rely on convolutional, recurrent and attention layer types. Moreover, the inclusion of further information in addition to the nucleotide sequence provides increases in performance, even though the methods of combining the input feature representations with distinct model structures could pose a challenge. To examine this topic, we applied depthwise separable convolutional layers to a physicochemical DNA sequence representation and train models to detect the binding sites of DNA binding proteins. While convolutional kernels learn the local feature patterns of motifs, the behaviour of the depthwise separable convolution better exploits the feature, shape and physicochemical information that could be stored in the input representation. Our models with depthwise separable convolution achieve increases in accuracy compared to the convolutional and nucleotide-based approaches on several datasets.","PeriodicalId":143909,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Depthwise Convolutions using Physicochemical Features of DNA for Transcription Factor Binding Site Classification: Physicochemical Features for DNA-Protein Classification with Depthwise Convolutions\",\"authors\":\"Gergely Pap, Krisztian Adam, Zoltan Gyorgypal, Laszlo Toth, Z. Hegedus\",\"doi\":\"10.1145/3571560.3571563\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Classifying DNA sequences based on a nucleotide representation has enjoyed considerable success with the advancement of Deep Learning, as the proper usage and combination of different layers and architecture choices led to an increase in performance. The most common approaches rely on convolutional, recurrent and attention layer types. Moreover, the inclusion of further information in addition to the nucleotide sequence provides increases in performance, even though the methods of combining the input feature representations with distinct model structures could pose a challenge. To examine this topic, we applied depthwise separable convolutional layers to a physicochemical DNA sequence representation and train models to detect the binding sites of DNA binding proteins. While convolutional kernels learn the local feature patterns of motifs, the behaviour of the depthwise separable convolution better exploits the feature, shape and physicochemical information that could be stored in the input representation. Our models with depthwise separable convolution achieve increases in accuracy compared to the convolutional and nucleotide-based approaches on several datasets.\",\"PeriodicalId\":143909,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Advances in Artificial Intelligence\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Advances in Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3571560.3571563\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Advances in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3571560.3571563","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Depthwise Convolutions using Physicochemical Features of DNA for Transcription Factor Binding Site Classification: Physicochemical Features for DNA-Protein Classification with Depthwise Convolutions
Classifying DNA sequences based on a nucleotide representation has enjoyed considerable success with the advancement of Deep Learning, as the proper usage and combination of different layers and architecture choices led to an increase in performance. The most common approaches rely on convolutional, recurrent and attention layer types. Moreover, the inclusion of further information in addition to the nucleotide sequence provides increases in performance, even though the methods of combining the input feature representations with distinct model structures could pose a challenge. To examine this topic, we applied depthwise separable convolutional layers to a physicochemical DNA sequence representation and train models to detect the binding sites of DNA binding proteins. While convolutional kernels learn the local feature patterns of motifs, the behaviour of the depthwise separable convolution better exploits the feature, shape and physicochemical information that could be stored in the input representation. Our models with depthwise separable convolution achieve increases in accuracy compared to the convolutional and nucleotide-based approaches on several datasets.