{"title":"Computational modeling and prediction of the human immunodeficiency virus (HIV) strains","authors":"G.B. Singh","doi":"10.1109/IJSIS.1998.685423","DOIUrl":null,"url":null,"abstract":"This paper describes a stochastic approach for modeling the changes observed in the DNA sequence of a highly mutating virus, such as the human immunodeficiency virus (HIV). This modeling process is begun by clustering the known DNA sequences from the virus population into groups such that the individual clusters represent biological strains of the modeled virus. Next, a hidden Markov model (HMM) is associated with each cluster, and its parameters computed using Baum-Welch's expectation maximization procedure. In this manner, the sequences within a cluster represent a maximally likely random sample drawn from the learned HMM. After the HMM for each strain cluster has thus been learned, it can further be used to generate additional samples of viral DNA sequences that are expected from the same underlying HMM. These newly predicted sequences would represent a maximally likely set of sequences belonging to a given viral strain modeled by the underlying HMM.","PeriodicalId":289764,"journal":{"name":"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE International Joint Symposia on Intelligence and Systems (Cat. No.98EX174)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJSIS.1998.685423","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper describes a stochastic approach for modeling the changes observed in the DNA sequence of a highly mutating virus, such as the human immunodeficiency virus (HIV). This modeling process is begun by clustering the known DNA sequences from the virus population into groups such that the individual clusters represent biological strains of the modeled virus. Next, a hidden Markov model (HMM) is associated with each cluster, and its parameters computed using Baum-Welch's expectation maximization procedure. In this manner, the sequences within a cluster represent a maximally likely random sample drawn from the learned HMM. After the HMM for each strain cluster has thus been learned, it can further be used to generate additional samples of viral DNA sequences that are expected from the same underlying HMM. These newly predicted sequences would represent a maximally likely set of sequences belonging to a given viral strain modeled by the underlying HMM.