{"title":"Geometric Feature of DNA Sequences","authors":"Hongjie Xu","doi":"10.2174/0118722121271190230928072933","DOIUrl":null,"url":null,"abstract":"Background:: The primary goal of molecular phylogenetics is to characterize the similarity/ dissimilarity of DNA sequences. Existing sequence comparison methods with some patented are mostly alignment-based and remain computationally arduous. background: The primary goal of molecular phylogenetics is to characterize similarity/dissimilarity of DNA sequences. Existing sequence comparison methods are mostly alignment-based and remain computationally arduous. Objective:: In this study, we propose a novel alignment-free approach based on a previous DNA curve representation without degeneracy. Method:: The method combines two important geometric elements that describe the global and local features of the curve, respectively. It allows us to use a 24-dimensional vector called a characterization vector to numerically characterize a DNA sequence. We then measure the dissimilarity/ similarity of various DNA sequences by the Euclidean distances between their characterization vectors. Results:: we compare our approach with other existing algorithms on 4 data sets including COVID-19, and find that our apporach can produce consistent results and is faster than the alignment-based methods. Conclusion:: The method stated in this study, can assist in analyzing biological molecular sequences efficiently and will be helpful to molecular biologists. other: none","PeriodicalId":40022,"journal":{"name":"Recent Patents on Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Recent Patents on Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/0118722121271190230928072933","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
Background:: The primary goal of molecular phylogenetics is to characterize the similarity/ dissimilarity of DNA sequences. Existing sequence comparison methods with some patented are mostly alignment-based and remain computationally arduous. background: The primary goal of molecular phylogenetics is to characterize similarity/dissimilarity of DNA sequences. Existing sequence comparison methods are mostly alignment-based and remain computationally arduous. Objective:: In this study, we propose a novel alignment-free approach based on a previous DNA curve representation without degeneracy. Method:: The method combines two important geometric elements that describe the global and local features of the curve, respectively. It allows us to use a 24-dimensional vector called a characterization vector to numerically characterize a DNA sequence. We then measure the dissimilarity/ similarity of various DNA sequences by the Euclidean distances between their characterization vectors. Results:: we compare our approach with other existing algorithms on 4 data sets including COVID-19, and find that our apporach can produce consistent results and is faster than the alignment-based methods. Conclusion:: The method stated in this study, can assist in analyzing biological molecular sequences efficiently and will be helpful to molecular biologists. other: none
期刊介绍:
Recent Patents on Engineering publishes review articles by experts on recent patents in the major fields of engineering. A selection of important and recent patents on engineering is also included in the journal. The journal is essential reading for all researchers involved in engineering sciences.