Nazife Çevik, Taner Çevik, Jawad Rasheed, Sachi Nandan Mohanty, Halil Ibrahim Cakar, Shtwai Alsubai
{"title":"Sequence-similarity-based approach to SARS-CoV-2 genome sequence and lung cancer-related genes via multivariate feature extraction method.","authors":"Nazife Çevik, Taner Çevik, Jawad Rasheed, Sachi Nandan Mohanty, Halil Ibrahim Cakar, Shtwai Alsubai","doi":"10.1080/10255842.2025.2530645","DOIUrl":null,"url":null,"abstract":"<p><p>The COVID-19 pandemic has prompted genomic studies linking SARS-CoV-2 and lung cancer-related genes. This study explores sequence similarity and motif patterns to assess disease susceptibility. We applied a data mining approach to compare human and SARS-CoV-2 genomes, revealing high sequence identity (0.74-0.99%) with lung cancer-related genes. Low-entropy motifs were associated with higher genetic risk. We identified shared patterns of lengths 4, 5, and 10, selecting the most significant motifs. These findings support the hypothesis that sequence similarity and conserved motifs provide insights into gene function, evolutionary processes, and the genetic links between cancer and viral infections.</p>","PeriodicalId":50640,"journal":{"name":"Computer Methods in Biomechanics and Biomedical Engineering","volume":" ","pages":"1-20"},"PeriodicalIF":1.7000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Biomechanics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/10255842.2025.2530645","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The COVID-19 pandemic has prompted genomic studies linking SARS-CoV-2 and lung cancer-related genes. This study explores sequence similarity and motif patterns to assess disease susceptibility. We applied a data mining approach to compare human and SARS-CoV-2 genomes, revealing high sequence identity (0.74-0.99%) with lung cancer-related genes. Low-entropy motifs were associated with higher genetic risk. We identified shared patterns of lengths 4, 5, and 10, selecting the most significant motifs. These findings support the hypothesis that sequence similarity and conserved motifs provide insights into gene function, evolutionary processes, and the genetic links between cancer and viral infections.
期刊介绍:
The primary aims of Computer Methods in Biomechanics and Biomedical Engineering are to provide a means of communicating the advances being made in the areas of biomechanics and biomedical engineering and to stimulate interest in the continually emerging computer based technologies which are being applied in these multidisciplinary subjects. Computer Methods in Biomechanics and Biomedical Engineering will also provide a focus for the importance of integrating the disciplines of engineering with medical technology and clinical expertise. Such integration will have a major impact on health care in the future.