{"title":"Principal Component Analysis Provides Insights on the Evolutionary Divergence of Conotoxins","authors":"Akira Kikuchi, L. Tayo","doi":"10.1109/icbcb55259.2022.9802130","DOIUrl":null,"url":null,"abstract":"Conotoxins are fast-evolving and highly diverse small, structured peptides found in venoms of predatory marine gastropods of the genus Conus. They are diverse yet exhibit superior stability, potency, and selectivity for various target receptors and subtypes; hence, they were proven valuable probes for drug discovery studies and models for protein evolution. The data structure of conotoxin peptide sequences in each gene superfamily was analyzed. Feature extraction showed diet- and geographical region-specific gene superfamilies. Using fast-Fourier transform (FFT) and principal component analysis (PCA), data structures indicated patterns of sequence similarity in conopeptides isolated in the same diet type. Clusters based on diet are prominent in the PCA plots and dendrograms. PCA clustering, node grouping, and feature extraction suggest the increasing role of dietary preference, providing insights on the driving force that pushed conotoxins to diverge and diversify.","PeriodicalId":429633,"journal":{"name":"2022 10th International Conference on Bioinformatics and Computational Biology (ICBCB)","volume":"722 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 10th International Conference on Bioinformatics and Computational Biology (ICBCB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icbcb55259.2022.9802130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Conotoxins are fast-evolving and highly diverse small, structured peptides found in venoms of predatory marine gastropods of the genus Conus. They are diverse yet exhibit superior stability, potency, and selectivity for various target receptors and subtypes; hence, they were proven valuable probes for drug discovery studies and models for protein evolution. The data structure of conotoxin peptide sequences in each gene superfamily was analyzed. Feature extraction showed diet- and geographical region-specific gene superfamilies. Using fast-Fourier transform (FFT) and principal component analysis (PCA), data structures indicated patterns of sequence similarity in conopeptides isolated in the same diet type. Clusters based on diet are prominent in the PCA plots and dendrograms. PCA clustering, node grouping, and feature extraction suggest the increasing role of dietary preference, providing insights on the driving force that pushed conotoxins to diverge and diversify.