Entropy Estimation: Simulation, Theory and a Case Study

2006 IEEE Information Theory Workshop - ITW '06 Punta del Este Pub Date : 2006-03-13 DOI:10.1109/ITW.2006.1633823

Ioannis Kontoyiannis

{"title":"Entropy Estimation: Simulation, Theory and a Case Study","authors":"Ioannis Kontoyiannis","doi":"10.1109/ITW.2006.1633823","DOIUrl":null,"url":null,"abstract":"We consider the statistical problem of estimating the entropy of finite-alphabet data generated from an unknown stationary process. We examine a series of estimators, including: (1) The standard maximum-likelihood or \"plug-in\" estimator; (2) Four different estimators based on the family of Lempel-Ziv compression algorithms; (3) A different plug-in estimator especially tailored to renewal processes; and (4) The natural estimator derived from the Context-Tree Weighting method (CTW). Some of these estimators are well-known, and some are new. We first summarize numerous theoretical properties of these estimators: Conditions for consistency, estimates of their bias and variance, methods for approximating the estimation error and for obtaining confidence intervals. Several new theoretical results are developed. We show how the theory offers preliminary indications results offer guidelines for tuning the parameters involved in the estimation process. Then we present an extensive simulation study on various types of synthetic data and under various conditions. We compare their performance and comment on the strengths and weaknesses of the various methods. For each estimator, we develop a precise method for calculating the estimation error based on any specific data set. Finally we report the performance of these entropy estimators on the (binary) spike trains of 28 neurons recorded simultaneously for a one-hour period from the primary motor and dorsal premotor cortices of a quietly seated monkey not engaged in a task behavior. Based on joint work with Yun Gao and Elie Bienenstock.","PeriodicalId":293144,"journal":{"name":"2006 IEEE Information Theory Workshop - ITW '06 Punta del Este","volume":"26 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE Information Theory Workshop - ITW '06 Punta del Este","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITW.2006.1633823","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

We consider the statistical problem of estimating the entropy of finite-alphabet data generated from an unknown stationary process. We examine a series of estimators, including: (1) The standard maximum-likelihood or "plug-in" estimator; (2) Four different estimators based on the family of Lempel-Ziv compression algorithms; (3) A different plug-in estimator especially tailored to renewal processes; and (4) The natural estimator derived from the Context-Tree Weighting method (CTW). Some of these estimators are well-known, and some are new. We first summarize numerous theoretical properties of these estimators: Conditions for consistency, estimates of their bias and variance, methods for approximating the estimation error and for obtaining confidence intervals. Several new theoretical results are developed. We show how the theory offers preliminary indications results offer guidelines for tuning the parameters involved in the estimation process. Then we present an extensive simulation study on various types of synthetic data and under various conditions. We compare their performance and comment on the strengths and weaknesses of the various methods. For each estimator, we develop a precise method for calculating the estimation error based on any specific data set. Finally we report the performance of these entropy estimators on the (binary) spike trains of 28 neurons recorded simultaneously for a one-hour period from the primary motor and dorsal premotor cortices of a quietly seated monkey not engaged in a task behavior. Based on joint work with Yun Gao and Elie Bienenstock.

查看原文本刊更多论文

熵估计:模拟、理论与个案研究

研究由未知平稳过程产生的有限字母数据的熵估计的统计问题。我们研究了一系列估计量，包括:(1)标准最大似然估计量或“插件”估计量;(2)基于Lempel-Ziv压缩算法族的四种不同估计量;(3)针对更新过程量身定制的不同插件估算器;(4)基于上下文树加权法(CTW)的自然估计量。这些估算器中有些是众所周知的，有些是新的。我们首先总结了这些估计器的许多理论性质:一致性的条件，它们的偏差和方差的估计，近似估计误差和获得置信区间的方法。提出了几个新的理论结果。我们展示了该理论如何提供初步指示，结果为调整估计过程中涉及的参数提供了指导方针。然后，我们在各种类型的合成数据和各种条件下进行了广泛的模拟研究。我们比较了它们的性能，并评论了各种方法的优缺点。对于每个估计器，我们开发了一种精确的方法来计算基于任何特定数据集的估计误差。最后，我们报告了这些熵估计器在一个小时内同时记录的28个神经元的(二进制)尖峰序列上的表现，这些神经元来自一只安静坐着的猴子的初级运动和背侧运动前皮层，没有从事任务行为。基于与Yun Gao和Elie Bienenstock的合作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2006 IEEE Information Theory Workshop - ITW '06 Punta del Este

自引率

0.00%

发文量