A Critical Reassessment of the Saerens-Latinne-Decaestecker Algorithm for Posterior Probability Adjustment

ACM Transactions on Information Systems (TOIS) Pub Date : 2020-12-31 DOI:10.1145/3433164

Andrea Esuli, Alessio Molinari, F. Sebastiani

{"title":"A Critical Reassessment of the Saerens-Latinne-Decaestecker Algorithm for Posterior Probability Adjustment","authors":"Andrea Esuli, Alessio Molinari, F. Sebastiani","doi":"10.1145/3433164","DOIUrl":null,"url":null,"abstract":"We critically re-examine the Saerens-Latinne-Decaestecker (SLD) algorithm, a well-known method for estimating class prior probabilities (“priors”) and adjusting posterior probabilities (“posteriors”) in scenarios characterized by distribution shift, i.e., difference in the distribution of the priors between the training and the unlabelled documents. Given a machine learned classifier and a set of unlabelled documents for which the classifier has returned posterior probabilities and estimates of the prior probabilities, SLD updates them both in an iterative, mutually recursive way, with the goal of making both more accurate; this is of key importance in downstream tasks such as single-label multiclass classification and cost-sensitive text classification. Since its publication, SLD has become the standard algorithm for improving the quality of the posteriors in the presence of distribution shift, and SLD is still considered a top contender when we need to estimate the priors (a task that has become known as “quantification”). However, its real effectiveness in improving the quality of the posteriors has been questioned. We here present the results of systematic experiments conducted on a large, publicly available dataset, across multiple amounts of distribution shift and multiple learners. Our experiments show that SLD improves the quality of the posterior probabilities and of the estimates of the prior probabilities, but only when the number of classes in the classification scheme is very small and the classifier is calibrated. As the number of classes grows, or as we use non-calibrated classifiers, SLD converges more slowly (and often does not converge at all), performance degrades rapidly, and the impact of SLD on the quality of the prior estimates and of the posteriors becomes negative rather than positive.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"59 1","pages":"1 - 34"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Information Systems (TOIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3433164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

We critically re-examine the Saerens-Latinne-Decaestecker (SLD) algorithm, a well-known method for estimating class prior probabilities (“priors”) and adjusting posterior probabilities (“posteriors”) in scenarios characterized by distribution shift, i.e., difference in the distribution of the priors between the training and the unlabelled documents. Given a machine learned classifier and a set of unlabelled documents for which the classifier has returned posterior probabilities and estimates of the prior probabilities, SLD updates them both in an iterative, mutually recursive way, with the goal of making both more accurate; this is of key importance in downstream tasks such as single-label multiclass classification and cost-sensitive text classification. Since its publication, SLD has become the standard algorithm for improving the quality of the posteriors in the presence of distribution shift, and SLD is still considered a top contender when we need to estimate the priors (a task that has become known as “quantification”). However, its real effectiveness in improving the quality of the posteriors has been questioned. We here present the results of systematic experiments conducted on a large, publicly available dataset, across multiple amounts of distribution shift and multiple learners. Our experiments show that SLD improves the quality of the posterior probabilities and of the estimates of the prior probabilities, but only when the number of classes in the classification scheme is very small and the classifier is calibrated. As the number of classes grows, or as we use non-calibrated classifiers, SLD converges more slowly (and often does not converge at all), performance degrades rapidly, and the impact of SLD on the quality of the prior estimates and of the posteriors becomes negative rather than positive.

查看原文本刊更多论文

对后验概率调整的Saerens-Latinne-Decaestecker算法的重新评估

我们批判性地重新审视了saerens - latin - decaestecker (SLD)算法，这是一种在分布移位(即训练和未标记文档之间的先验分布差异)的情况下估计类先验概率(“先验”)和调整后验概率(“后验”)的著名方法。给定一个机器学习分类器和一组未标记的文档，其中分类器已经返回后验概率和先验概率的估计，SLD以迭代，相互递归的方式更新它们，目的是使两者更准确;这在诸如单标签多类分类和成本敏感文本分类等下游任务中至关重要。自发表以来，SLD已经成为在存在分布移位的情况下提高后验质量的标准算法，并且当我们需要估计先验(一项被称为“量化”的任务)时，SLD仍然被认为是首选的竞争者。然而，它在提高后壁质量方面的真正有效性一直受到质疑。我们在这里展示了在一个大型的、公开可用的数据集上进行的系统实验的结果，该数据集跨越了多个分布位移量和多个学习器。我们的实验表明，SLD提高了后验概率和先验概率估计的质量，但只有在分类方案中的类数非常小并且分类器经过校准的情况下。随着类数量的增长，或者当我们使用未校准的分类器时，SLD收敛得更慢(通常根本不收敛)，性能迅速下降，并且SLD对先前估计和后验质量的影响变为负的而不是正的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Information Systems (TOIS)

自引率

0.00%

发文量