Huihai Wu, Prudence W. H. Wong, M. Caddick, Christopher Sibthorp
{"title":"利用位置依赖模型寻找DNA调控基序","authors":"Huihai Wu, Prudence W. H. Wong, M. Caddick, Christopher Sibthorp","doi":"10.12720/JOMB.2.2.103-109","DOIUrl":null,"url":null,"abstract":"We consider the problem of de novo DNA motif discovery. The position weight matrix (PWM) model has been extensively used, yet this model makes the assumption that nucleotides at different positions are independent of each other. Recent results have shown that nucleotides bound by transcription factors often exhibit adjacent or nonadjacent dependencies. We address this problem by devising positional dependency models capable of capturing adjacent dependencies and non-adjacent dependencies (SPWDM). Our algorithms are based on Gibbs sampling to update the model parameter and dependencies structure. We compare two scoring functions: -score and a conditional probability based score. We also improve several Gibbs sampling stages. Experiments are carried out on simulated and real data, showing that the SPWDM model makes improvement over pure PWM. The modifications to the Gibbs sampling algorithm are also shown to be effective. ","PeriodicalId":437476,"journal":{"name":"Journal of medical and bioengineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Finding DNA Regulatory Motifs with Position-dependent Models\",\"authors\":\"Huihai Wu, Prudence W. H. Wong, M. Caddick, Christopher Sibthorp\",\"doi\":\"10.12720/JOMB.2.2.103-109\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider the problem of de novo DNA motif discovery. The position weight matrix (PWM) model has been extensively used, yet this model makes the assumption that nucleotides at different positions are independent of each other. Recent results have shown that nucleotides bound by transcription factors often exhibit adjacent or nonadjacent dependencies. We address this problem by devising positional dependency models capable of capturing adjacent dependencies and non-adjacent dependencies (SPWDM). Our algorithms are based on Gibbs sampling to update the model parameter and dependencies structure. We compare two scoring functions: -score and a conditional probability based score. We also improve several Gibbs sampling stages. Experiments are carried out on simulated and real data, showing that the SPWDM model makes improvement over pure PWM. The modifications to the Gibbs sampling algorithm are also shown to be effective. \",\"PeriodicalId\":437476,\"journal\":{\"name\":\"Journal of medical and bioengineering\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of medical and bioengineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12720/JOMB.2.2.103-109\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of medical and bioengineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12720/JOMB.2.2.103-109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Finding DNA Regulatory Motifs with Position-dependent Models
We consider the problem of de novo DNA motif discovery. The position weight matrix (PWM) model has been extensively used, yet this model makes the assumption that nucleotides at different positions are independent of each other. Recent results have shown that nucleotides bound by transcription factors often exhibit adjacent or nonadjacent dependencies. We address this problem by devising positional dependency models capable of capturing adjacent dependencies and non-adjacent dependencies (SPWDM). Our algorithms are based on Gibbs sampling to update the model parameter and dependencies structure. We compare two scoring functions: -score and a conditional probability based score. We also improve several Gibbs sampling stages. Experiments are carried out on simulated and real data, showing that the SPWDM model makes improvement over pure PWM. The modifications to the Gibbs sampling algorithm are also shown to be effective.