{"title":"利用以最佳基因水平和突变水平特征训练的改进型循环神经网络识别和分离基因。","authors":"Irfan Rashid Pukhta, Ranjeet Kumar Rout","doi":"10.1080/10255842.2024.2311322","DOIUrl":null,"url":null,"abstract":"<p><p>Even though many different approaches have been employed to address the complex mutational heterogeneity of cancer, finding driver genes is still problematic since other genomic factors cannot be fully integrated for combined analyses. This research paper presents a novel gene identification and segregation model with five key processes (a) pre-processing, (b) treatment of class imbalances, (c) feature extraction, (d) feature selection, and (e) gene classification. To increase the data quality, the gathered initial information is first pre-processed utilizing data cleaning and data normalization. This turns the raw data into something that is both useful and effective. In actuality, the sample is skewed against drivers because passenger mutation markers appear in proportionally less instances than drivers do. To address the Class Imbalance Problem, improved K-Means + SMOTE are applied to the preprocessed data. The most crucial characteristics, including those at the gene and mutation levels, are then extracted from the balanced dataset. To lessen the computational load in terms of time, the best features from the retrieved features are selected using Forensic interpretation tailored hunger food search optimization (FIHFSO). The ideal features are used to train the deep learning classifier that conducts the separation procedure. In this research, an Improved Recurrent Neural Network (I-RNN) is used to make a final decision about genes. At 90% of learning percentage, the accuracy of the proposed method achieves 0.98% of 0.83, 0.81, 0.65, 0.80, 0.92 and 0.63% which is compared to the other methods like HGS, FBIO, AOA, AO, GOA and PRO respectively.</p>","PeriodicalId":50640,"journal":{"name":"Computer Methods in Biomechanics and Biomedical Engineering","volume":" ","pages":"1111-1126"},"PeriodicalIF":1.7000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification and segregation of genes with improved recurrent neural network trained with optimal gene level and mutation level features.\",\"authors\":\"Irfan Rashid Pukhta, Ranjeet Kumar Rout\",\"doi\":\"10.1080/10255842.2024.2311322\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Even though many different approaches have been employed to address the complex mutational heterogeneity of cancer, finding driver genes is still problematic since other genomic factors cannot be fully integrated for combined analyses. This research paper presents a novel gene identification and segregation model with five key processes (a) pre-processing, (b) treatment of class imbalances, (c) feature extraction, (d) feature selection, and (e) gene classification. To increase the data quality, the gathered initial information is first pre-processed utilizing data cleaning and data normalization. This turns the raw data into something that is both useful and effective. In actuality, the sample is skewed against drivers because passenger mutation markers appear in proportionally less instances than drivers do. To address the Class Imbalance Problem, improved K-Means + SMOTE are applied to the preprocessed data. The most crucial characteristics, including those at the gene and mutation levels, are then extracted from the balanced dataset. To lessen the computational load in terms of time, the best features from the retrieved features are selected using Forensic interpretation tailored hunger food search optimization (FIHFSO). The ideal features are used to train the deep learning classifier that conducts the separation procedure. In this research, an Improved Recurrent Neural Network (I-RNN) is used to make a final decision about genes. At 90% of learning percentage, the accuracy of the proposed method achieves 0.98% of 0.83, 0.81, 0.65, 0.80, 0.92 and 0.63% which is compared to the other methods like HGS, FBIO, AOA, AO, GOA and PRO respectively.</p>\",\"PeriodicalId\":50640,\"journal\":{\"name\":\"Computer Methods in Biomechanics and Biomedical Engineering\",\"volume\":\" \",\"pages\":\"1111-1126\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Methods in Biomechanics and Biomedical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1080/10255842.2024.2311322\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/2/29 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Biomechanics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/10255842.2024.2311322","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/29 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Identification and segregation of genes with improved recurrent neural network trained with optimal gene level and mutation level features.
Even though many different approaches have been employed to address the complex mutational heterogeneity of cancer, finding driver genes is still problematic since other genomic factors cannot be fully integrated for combined analyses. This research paper presents a novel gene identification and segregation model with five key processes (a) pre-processing, (b) treatment of class imbalances, (c) feature extraction, (d) feature selection, and (e) gene classification. To increase the data quality, the gathered initial information is first pre-processed utilizing data cleaning and data normalization. This turns the raw data into something that is both useful and effective. In actuality, the sample is skewed against drivers because passenger mutation markers appear in proportionally less instances than drivers do. To address the Class Imbalance Problem, improved K-Means + SMOTE are applied to the preprocessed data. The most crucial characteristics, including those at the gene and mutation levels, are then extracted from the balanced dataset. To lessen the computational load in terms of time, the best features from the retrieved features are selected using Forensic interpretation tailored hunger food search optimization (FIHFSO). The ideal features are used to train the deep learning classifier that conducts the separation procedure. In this research, an Improved Recurrent Neural Network (I-RNN) is used to make a final decision about genes. At 90% of learning percentage, the accuracy of the proposed method achieves 0.98% of 0.83, 0.81, 0.65, 0.80, 0.92 and 0.63% which is compared to the other methods like HGS, FBIO, AOA, AO, GOA and PRO respectively.
期刊介绍:
The primary aims of Computer Methods in Biomechanics and Biomedical Engineering are to provide a means of communicating the advances being made in the areas of biomechanics and biomedical engineering and to stimulate interest in the continually emerging computer based technologies which are being applied in these multidisciplinary subjects. Computer Methods in Biomechanics and Biomedical Engineering will also provide a focus for the importance of integrating the disciplines of engineering with medical technology and clinical expertise. Such integration will have a major impact on health care in the future.