{"title":"Deep recurrent neural network with fractional addax optimization algorithm for influenza virus host prediction","authors":"Shweta Ashish Koparde , Sonali Kothari , Sharad Adsure , Kapil Netaji Vhatkar , Vinod V. Kimbahune","doi":"10.1016/j.mex.2025.103319","DOIUrl":null,"url":null,"abstract":"<div><div>The accurate prediction of the host of influenza viruses is a significant challenge in bioinformatics, as it is crucial for understanding viral transmission dynamics and host-virus interactions. This research<ul><li><span>•</span><span><div>Introduces a novel approach for predicting the host of influenza viruses by leveraging protein sequences.</div></span></li><li><span>•</span><span><div>Extraction of features, including sequence length, Amino Acid Composition (AAC), Dipeptide Composition (DPC), Tripeptide Composition (TPC), aromaticity, secondary structure fraction, and entropy from protein sequence.</div></span></li><li><span>•</span><span><div>Addresses the data imbalance and improves model generalization, the oversampling technique is applied for data augmentation.</div></span></li></ul></div><div>The prediction model employs a Deep Recurrent Neural Network (DRNN) optimized by Fractional Addax Optimization 34 Algorithm (FAOA), a hybrid of Addax Optimization Algorithm (AOA) and Fractional Concept (FC), designed to perform 35 influenza virus host prediction. The model's performance is evaluated using metrics, such as Matthews's Correlation 36 Coefficient (MCC), F1-Score, and Mean Squared Error (MSE). Experimental results demonstrate that the DRNN_FAOA 37 model significantly outperforms existing methods, achieving the highest MCC of 0.937, F1-Score of 0.917, and the 38 lowest MSE of 0.038. The proposed DRNN_FAOA model's ability to accurately predict influenza virus hosts suggests its 39 potential as a robust model in virus-host prediction.</div></div>","PeriodicalId":18446,"journal":{"name":"MethodsX","volume":"14 ","pages":"Article 103319"},"PeriodicalIF":1.6000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MethodsX","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215016125001657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The accurate prediction of the host of influenza viruses is a significant challenge in bioinformatics, as it is crucial for understanding viral transmission dynamics and host-virus interactions. This research
•
Introduces a novel approach for predicting the host of influenza viruses by leveraging protein sequences.
•
Extraction of features, including sequence length, Amino Acid Composition (AAC), Dipeptide Composition (DPC), Tripeptide Composition (TPC), aromaticity, secondary structure fraction, and entropy from protein sequence.
•
Addresses the data imbalance and improves model generalization, the oversampling technique is applied for data augmentation.
The prediction model employs a Deep Recurrent Neural Network (DRNN) optimized by Fractional Addax Optimization 34 Algorithm (FAOA), a hybrid of Addax Optimization Algorithm (AOA) and Fractional Concept (FC), designed to perform 35 influenza virus host prediction. The model's performance is evaluated using metrics, such as Matthews's Correlation 36 Coefficient (MCC), F1-Score, and Mean Squared Error (MSE). Experimental results demonstrate that the DRNN_FAOA 37 model significantly outperforms existing methods, achieving the highest MCC of 0.937, F1-Score of 0.917, and the 38 lowest MSE of 0.038. The proposed DRNN_FAOA model's ability to accurately predict influenza virus hosts suggests its 39 potential as a robust model in virus-host prediction.