Weight-Agnostic Hierarchical Stick-Breaking Process

M. Das, C. Bhattacharyya
{"title":"Weight-Agnostic Hierarchical Stick-Breaking Process","authors":"M. Das, C. Bhattacharyya","doi":"10.1109/ICBK.2018.00053","DOIUrl":null,"url":null,"abstract":"Learning from multiple groups of observations are often useful due to the advantage of sharing of statistical information. Hierarchical Bayesian models provide a natural mechanism to achieve this, and hierarchical Dirichlet processes (HDPs) have shown significant impact in this field. HDPs define a collection of probability measures one for each group. All the measures provide support on a common countably infinite set of atoms to share information. The fundamental mechanism in all the variants of HDP make the weights on these atoms positively correlated across groups. This structural limitation is impossible to resolve without changing the sharing principle. But this property hinders the applicability of HDP priors to many problems, when an atom may be highly probable in some groups despite being rare in all other groups. This becomes evident in clustering through association of atoms and observations. Some clusters may be weakly present in most of the groups in spite of being prominent in some groups and vice-versa. In this paper, we pose the problem of weight agnosticism, that of constructing a collection of probability measures on a common countably infinite set of atoms with mutually independent weights across groups. This implies that, a cluster can contain observations from all groups, but popularities of a cluster across groups are mutually independent. So the size of a cluster in a group does not interfere in the participation of observations in other groups to that cluster. Our contribution is also to construct a novel hierarchical Bayesian nonparametric prior, Weight-Agnostic hierarchical Stick-breaking process (was), which models weight agnosticism. was extends the framework of stick-breaking process (SBP) in a novel direction. However, was becomes non-exchangeable and that makes inference process non-standard. But, We derive tractable predictive probability functions for was, which is useful in deriving efficient truncation-free MCMC inference competitive with those in HDP settings. We discuss few real life applications of was in topic moeling and information retrieval. Furthermore, experimenting with five real life datasets we show that, was significantly outperforms HDP in various settings.","PeriodicalId":144958,"journal":{"name":"2018 IEEE International Conference on Big Knowledge (ICBK)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Big Knowledge (ICBK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBK.2018.00053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Learning from multiple groups of observations are often useful due to the advantage of sharing of statistical information. Hierarchical Bayesian models provide a natural mechanism to achieve this, and hierarchical Dirichlet processes (HDPs) have shown significant impact in this field. HDPs define a collection of probability measures one for each group. All the measures provide support on a common countably infinite set of atoms to share information. The fundamental mechanism in all the variants of HDP make the weights on these atoms positively correlated across groups. This structural limitation is impossible to resolve without changing the sharing principle. But this property hinders the applicability of HDP priors to many problems, when an atom may be highly probable in some groups despite being rare in all other groups. This becomes evident in clustering through association of atoms and observations. Some clusters may be weakly present in most of the groups in spite of being prominent in some groups and vice-versa. In this paper, we pose the problem of weight agnosticism, that of constructing a collection of probability measures on a common countably infinite set of atoms with mutually independent weights across groups. This implies that, a cluster can contain observations from all groups, but popularities of a cluster across groups are mutually independent. So the size of a cluster in a group does not interfere in the participation of observations in other groups to that cluster. Our contribution is also to construct a novel hierarchical Bayesian nonparametric prior, Weight-Agnostic hierarchical Stick-breaking process (was), which models weight agnosticism. was extends the framework of stick-breaking process (SBP) in a novel direction. However, was becomes non-exchangeable and that makes inference process non-standard. But, We derive tractable predictive probability functions for was, which is useful in deriving efficient truncation-free MCMC inference competitive with those in HDP settings. We discuss few real life applications of was in topic moeling and information retrieval. Furthermore, experimenting with five real life datasets we show that, was significantly outperforms HDP in various settings.
权重不可知的分层断棒过程
由于共享统计信息的优势,从多组观察中学习通常是有用的。层次贝叶斯模型为实现这一目标提供了一种自然的机制,层次狄利克雷过程(hdp)在这一领域显示出显著的影响。hdp定义了一组概率度量,每组一个。所有的措施都在一个公共的可数无限原子集合上提供支持,以共享信息。所有HDP变体的基本机制使得这些原子的权重在基团之间呈正相关。如果不改变共享原则,这种结构性限制是不可能解决的。但是这种性质阻碍了HDP在许多问题之前的适用性,当一个原子在某些基团中可能是非常可能的,尽管在所有其他基团中都是罕见的。这一点在原子和观测相结合的聚类中变得很明显。有些星团可能在大多数星团中很弱,尽管在某些星团中很突出,反之亦然。在本文中,我们提出了权不可知论问题,即在群上具有相互独立权的可数无限原子集合上构造概率测度集合的问题。这意味着,集群可以包含来自所有组的观察结果,但集群在组之间的受欢迎程度是相互独立的。因此,一组中集群的大小不会影响该集群中其他组的观察参与。我们的贡献还在于构建了一种新的分层贝叶斯非参数先验,即权重不可知论的分层断棒过程(was),该过程对权重不可知论进行了建模。向一个新的方向扩展了断棒过程的框架。然而,was变得不可交换,这使得推理过程变得不标准。但是,我们为was导出了易于处理的预测概率函数,这有助于推导出与HDP设置相竞争的有效的无截断MCMC推理。我们讨论了在主题建模和信息检索方面的一些实际应用。此外,通过对五个真实生活数据集的实验,我们发现,在各种设置下,它的性能明显优于HDP。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信