基于贝叶斯原型的神经网络学习

Conference Record Southcon Pub Date : 1994-03-29 DOI:10.1109/SOUTHC.1994.498076

P. Myllymaki, H. Tirri

{"title":"基于贝叶斯原型的神经网络学习","authors":"P. Myllymaki, H. Tirri","doi":"10.1109/SOUTHC.1994.498076","DOIUrl":null,"url":null,"abstract":"Given a set of samples of a probability distribution on a set of discrete random variables, we study the problem of constructing a good approximate neural network model of the underlying probability distribution. Our approach is based on an unsupervised learning scheme where the samples are first divided into separate clusters, and each cluster is then coded as a single vector. These Bayesian prototype vectors consist of conditional probabilities representing the attribute-value distribution inside the corresponding cluster. Using these prototype vectors, it is possible to model the underlying joint probability distribution as a simple Bayesian network (a tree), which can be realized as a feedforward neural network capable of probabilistic reasoning. We describe how the prototypes can be determined, given a partition of the samples, and present a method for evaluating the likelihood of the corresponding Bayesian tree. We also present a greedy heuristic for searching through the space of different partition schemes with different numbers of clusters, aiming at an optimal approximation of the probability distribution.","PeriodicalId":164672,"journal":{"name":"Conference Record Southcon","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Learning in neural networks with Bayesian prototypes\",\"authors\":\"P. Myllymaki, H. Tirri\",\"doi\":\"10.1109/SOUTHC.1994.498076\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Given a set of samples of a probability distribution on a set of discrete random variables, we study the problem of constructing a good approximate neural network model of the underlying probability distribution. Our approach is based on an unsupervised learning scheme where the samples are first divided into separate clusters, and each cluster is then coded as a single vector. These Bayesian prototype vectors consist of conditional probabilities representing the attribute-value distribution inside the corresponding cluster. Using these prototype vectors, it is possible to model the underlying joint probability distribution as a simple Bayesian network (a tree), which can be realized as a feedforward neural network capable of probabilistic reasoning. We describe how the prototypes can be determined, given a partition of the samples, and present a method for evaluating the likelihood of the corresponding Bayesian tree. We also present a greedy heuristic for searching through the space of different partition schemes with different numbers of clusters, aiming at an optimal approximation of the probability distribution.\",\"PeriodicalId\":164672,\"journal\":{\"name\":\"Conference Record Southcon\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-03-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference Record Southcon\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SOUTHC.1994.498076\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Record Southcon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SOUTHC.1994.498076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

给定一组样本的概率分布在一组离散随机变量上，我们研究了构造一个良好的近似神经网络的概率分布模型的问题。我们的方法基于无监督学习方案，首先将样本分成单独的簇，然后将每个簇编码为单个向量。这些贝叶斯原型向量由表示相应簇内属性值分布的条件概率组成。使用这些原型向量，可以将底层的联合概率分布建模为一个简单的贝叶斯网络(树)，这可以实现为具有概率推理能力的前馈神经网络。我们描述了如何确定原型，给出了样本的分区，并提出了一种评估相应贝叶斯树的可能性的方法。我们还提出了一种贪心启发式算法，用于搜索具有不同簇数的不同分区方案的空间，以获得概率分布的最优逼近。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learning in neural networks with Bayesian prototypes

Given a set of samples of a probability distribution on a set of discrete random variables, we study the problem of constructing a good approximate neural network model of the underlying probability distribution. Our approach is based on an unsupervised learning scheme where the samples are first divided into separate clusters, and each cluster is then coded as a single vector. These Bayesian prototype vectors consist of conditional probabilities representing the attribute-value distribution inside the corresponding cluster. Using these prototype vectors, it is possible to model the underlying joint probability distribution as a simple Bayesian network (a tree), which can be realized as a feedforward neural network capable of probabilistic reasoning. We describe how the prototypes can be determined, given a partition of the samples, and present a method for evaluating the likelihood of the corresponding Bayesian tree. We also present a greedy heuristic for searching through the space of different partition schemes with different numbers of clusters, aiming at an optimal approximation of the probability distribution.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Conference Record Southcon

自引率

0.00%

发文量