Journal of Applied Probability最新文献

The dutch draw: constructing a universal baseline for binary classification problems 荷兰平局：构建二元分类问题的通用基线

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-09-19 DOI: 10.1017/jpr.2024.52

Etienne van de Bijl, Jan Klein, Joris Pries, Sandjai Bhulai, Mark Hoogendoorn, Rob van der Mei

{"title":"The dutch draw: constructing a universal baseline for binary classification problems","authors":"Etienne van de Bijl, Jan Klein, Joris Pries, Sandjai Bhulai, Mark Hoogendoorn, Rob van der Mei","doi":"10.1017/jpr.2024.52","DOIUrl":"https://doi.org/10.1017/jpr.2024.52","url":null,"abstract":"Novel prediction methods should always be compared to a baseline to determine their performance. Without this frame of reference, the performance score of a model is basically meaningless. What does it mean when a model achieves an <img data-mimesubtype=\"png\" data-type=\"\" src=\"https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20240918134025706-0265:S0021900224000524:S0021900224000524_inline1.png\">$F_1$</img> of 0.8 on a test set? A proper baseline is, therefore, required to evaluate the ‘goodness’ of a performance score. Comparing results with the latest state-of-the-art model is usually insightful. However, being state-of-the-art is dynamic, as newer models are continuously developed. Contrary to an advanced model, it is also possible to use a simple dummy classifier. However, the latter model could be beaten too easily, making the comparison less valuable. Furthermore, most existing baselines are stochastic and need to be computed repeatedly to get a reliable expected performance, which could be computationally expensive. We present a universal baseline method for all binary classification models, named the Dutch Draw (DD). This approach weighs simple classifiers and determines the best classifier to use as a baseline. Theoretically, we derive the DD baseline for many commonly used evaluation measures and show that in most situations it reduces to (almost) always predicting either zero or one. Summarizing, the DD baseline is general, as it is applicable to any binary classification problem; simple, as it can be quickly determined without training or parameter tuning; and informative, as insightful conclusions can be drawn from the results. The DD baseline serves two purposes. First, it is a robust and universal baseline that enables comparisons across research papers. Second, it provides a sanity check during the prediction model’s development process. When a model does not outperform the DD baseline, it is a major warning sign.","PeriodicalId":50256,"journal":{"name":"Journal of Applied Probability","volume":"194 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142248308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Transience of continuous-time conservative random walks 连续时间保守随机游走的短暂性

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-09-18 DOI: 10.1017/jpr.2024.46

Satyaki Bhattacharya, Stanislav Volkov

{"title":"Transience of continuous-time conservative random walks","authors":"Satyaki Bhattacharya, Stanislav Volkov","doi":"10.1017/jpr.2024.46","DOIUrl":"https://doi.org/10.1017/jpr.2024.46","url":null,"abstract":"We consider two continuous-time generalizations of conservative random walks introduced in Englander and Volkov (2022), an orthogonal and a spherically symmetrical one; the latter model is also known as random flights. For both models, we show the transience of the walks when $dge 2$ and that the rate of direction changing follows a power law $t^{-alpha}$ , $0<alphale 1$ , or the law $(!ln t)^{-beta}$ where $beta>2$ .","PeriodicalId":50256,"journal":{"name":"Journal of Applied Probability","volume":"14 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142247915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Efficiency of reversible MCMC methods: elementary derivations and applications to composite methods 可逆 MCMC 方法的效率：基本推导及对复合方法的应用

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-09-18 DOI: 10.1017/jpr.2024.48

Radford M. Neal, Jeffrey S. Rosenthal

{"title":"Efficiency of reversible MCMC methods: elementary derivations and applications to composite methods","authors":"Radford M. Neal, Jeffrey S. Rosenthal","doi":"10.1017/jpr.2024.48","DOIUrl":"https://doi.org/10.1017/jpr.2024.48","url":null,"abstract":"We review criteria for comparing the efficiency of Markov chain Monte Carlo (MCMC) methods with respect to the asymptotic variance of estimates of expectations of functions of state, and show how such criteria can justify ways of combining improvements to MCMC methods. We say that a chain on a finite state space with transition matrix P efficiency-dominates one with transition matrix Q if for every function of state it has lower (or equal) asymptotic variance. We give elementary proofs of some previous results regarding efficiency dominance, leading to a self-contained demonstration that a reversible chain with transition matrix P efficiency-dominates a reversible chain with transition matrix Q if and only if none of the eigenvalues of $Q-P$ are negative. This allows us to conclude that modifying a reversible MCMC method to improve its efficiency will also improve the efficiency of a method that randomly chooses either this or some other reversible method, and to conclude that improving the efficiency of a reversible update for one component of state (as in Gibbs sampling) will improve the overall efficiency of a reversible method that combines this and other updates. It also explains how antithetic MCMC can be more efficient than independent and identically distributed sampling. We also establish conditions that can guarantee that a method is not efficiency-dominated by any other method.","PeriodicalId":50256,"journal":{"name":"Journal of Applied Probability","volume":"36 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142247916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A non-homogeneous alternating renewal process model for interval censoring 区间剔除的非均质交替更新过程模型

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-09-16 DOI: 10.1017/jpr.2024.54

M. N. M. van Lieshout, R. L. Markwitz

引用次数: 0

An algorithm to construct coherent systems using signatures 利用签名构建连贯系统的算法

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-09-16 DOI: 10.1017/jpr.2024.60

T. V. Rao, Sameen Naqvi

引用次数: 0

Quenched worst-case scenario for root deletion in targeted cutting of random recursive trees 随机递归树定向切割中根删除的淬火最坏情况假设

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-09-03 DOI: 10.1017/jpr.2024.40

Laura Eslava, Sergio I. López, Marco L. Ortiz

{"title":"Quenched worst-case scenario for root deletion in targeted cutting of random recursive trees","authors":"Laura Eslava, Sergio I. López, Marco L. Ortiz","doi":"10.1017/jpr.2024.40","DOIUrl":"https://doi.org/10.1017/jpr.2024.40","url":null,"abstract":"We propose a method for cutting down a random recursive tree that focuses on its higher-degree vertices. Enumerate the vertices of a random recursive tree of size <jats:italic>n</jats:italic> according to the decreasing order of their degrees; namely, let <jats:inline-formula> <jats:alternatives> <jats:inline-graphic xmlns:xlink=\"http://www.w3.org/1999/xlink\" mime-subtype=\"png\" xlink:href=\"S0021900224000408_inline1.png\"/> <jats:tex-math>$(v^{(i)})_{i=1}^{n}$</jats:tex-math> </jats:alternatives> </jats:inline-formula> be such that <jats:inline-formula> <jats:alternatives> <jats:inline-graphic xmlns:xlink=\"http://www.w3.org/1999/xlink\" mime-subtype=\"png\" xlink:href=\"S0021900224000408_inline2.png\"/> <jats:tex-math>$deg(v^{(1)}) geq cdots geq deg (v^{(n)})$</jats:tex-math> </jats:alternatives> </jats:inline-formula>. The targeted vertex-cutting process is performed by sequentially removing vertices <jats:inline-formula> <jats:alternatives> <jats:inline-graphic xmlns:xlink=\"http://www.w3.org/1999/xlink\" mime-subtype=\"png\" xlink:href=\"S0021900224000408_inline3.png\"/> <jats:tex-math>$v^{(1)}, v^{(2)}, ldots, v^{(n)}$</jats:tex-math> </jats:alternatives> </jats:inline-formula> and keeping only the subtree containing the root after each removal. The algorithm ends when the root is picked to be removed. The total number of steps for this procedure, <jats:inline-formula> <jats:alternatives> <jats:inline-graphic xmlns:xlink=\"http://www.w3.org/1999/xlink\" mime-subtype=\"png\" xlink:href=\"S0021900224000408_inline4.png\"/> <jats:tex-math>$K_n$</jats:tex-math> </jats:alternatives> </jats:inline-formula>, is upper bounded by <jats:inline-formula> <jats:alternatives> <jats:inline-graphic xmlns:xlink=\"http://www.w3.org/1999/xlink\" mime-subtype=\"png\" xlink:href=\"S0021900224000408_inline5.png\"/> <jats:tex-math>$Z_{geq D}$</jats:tex-math> </jats:alternatives> </jats:inline-formula>, which denotes the number of vertices that have degree at least as large as the degree of the root. We prove that <jats:inline-formula> <jats:alternatives> <jats:inline-graphic xmlns:xlink=\"http://www.w3.org/1999/xlink\" mime-subtype=\"png\" xlink:href=\"S0021900224000408_inline6.png\"/> <jats:tex-math>$ln Z_{geq D}$</jats:tex-math> </jats:alternatives> </jats:inline-formula> grows as <jats:inline-formula> <jats:alternatives> <jats:inline-graphic xmlns:xlink=\"http://www.w3.org/1999/xlink\" mime-subtype=\"png\" xlink:href=\"S0021900224000408_inline7.png\"/> <jats:tex-math>$ln n$</jats:tex-math> </jats:alternatives> </jats:inline-formula> asymptotically and obtain its limiting behavior in probability. Moreover, we obtain that the <jats:italic>k</jats:italic>th moment of <jats:inline-formula> <jats:alternatives> <jats:inline-graphic xmlns:xlink=\"http://www.w3.org/1999/xlink\" mime-subtype=\"png\" xlink:href=\"S0021900224000408_inline8.png\"/> <jats:tex-math>$ln Z_{geq D}$</jats:tex-math> </jats:alternatives> </jats:inline-formula> is proportional to <jats:inline-formula> <jats:alternatives> <jats:inline-graphic","PeriodicalId":50256,"journal":{"name":"Journal of Applied Probability","volume":"58 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stochastic gradient descent for barycenters in Wasserstein space 瓦瑟尔斯坦空间双曲中心的随机梯度下降

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-09-03 DOI: 10.1017/jpr.2024.39

Julio Backhoff, Joaquin Fontbona, Gonzalo Rios, Felipe Tobar

{"title":"Stochastic gradient descent for barycenters in Wasserstein space","authors":"Julio Backhoff, Joaquin Fontbona, Gonzalo Rios, Felipe Tobar","doi":"10.1017/jpr.2024.39","DOIUrl":"https://doi.org/10.1017/jpr.2024.39","url":null,"abstract":"We present and study a novel algorithm for the computation of 2-Wasserstein population barycenters of absolutely continuous probability measures on Euclidean space. The proposed method can be seen as a stochastic gradient descent procedure in the 2-Wasserstein space, as well as a manifestation of a law of large numbers therein. The algorithm aims to find a Karcher mean or critical point in this setting, and can be implemented ‘online’, sequentially using independent and identically distributed random measures sampled from the population law. We provide natural sufficient conditions for this algorithm to almost surely converge in the Wasserstein space towards the population barycenter, and we introduce a novel, general condition which ensures uniqueness of Karcher means and, moreover, allows us to obtain explicit, parametric convergence rates for the expected optimality gap. We also study the mini-batch version of this algorithm, and discuss examples of families of population laws to which our method and results can be applied. This work expands and deepens ideas and results introduced in an early version of Backhoff-Veraguas et al. (2022), in which a statistical application (and numerical implementation) of this method is developed in the context of Bayesian learning.","PeriodicalId":50256,"journal":{"name":"Journal of Applied Probability","volume":"23 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Uncertainty quantification and confidence intervals for naive rare-event estimators 天真罕见事件估计器的不确定性量化和置信区间

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-09-02 DOI: 10.1017/jpr.2024.43

Yuanlu Bai, Henry Lam

{"title":"Uncertainty quantification and confidence intervals for naive rare-event estimators","authors":"Yuanlu Bai, Henry Lam","doi":"10.1017/jpr.2024.43","DOIUrl":"https://doi.org/10.1017/jpr.2024.43","url":null,"abstract":"We consider the estimation of rare-event probabilities using sample proportions output by naive Monte Carlo or collected data. Unlike using variance reduction techniques, this naive estimator does not have an a priori relative efficiency guarantee. On the other hand, due to the recent surge of sophisticated rare-event problems arising in safety evaluations of intelligent systems, efficiency-guaranteed variance reduction may face implementation challenges which, coupled with the availability of computation or data collection power, motivate the use of such a naive estimator. In this paper we study the uncertainty quantification, namely the construction, coverage validity, and tightness of confidence intervals, for rare-event probabilities using only sample proportions. In addition to the known normality, Wilson, and exact intervals, we investigate and compare them with two new intervals derived from Chernoff’s inequality and the Berry–Esseen theorem. Moreover, we generalize our results to the natural situation where sampling stops by reaching a target number of rare-event hits. Our findings show that the normality and Wilson intervals are not always valid, but they are close to the newly developed valid intervals in terms of half-width. In contrast, the exact interval is conservative, but safely guarantees the attainment of the nominal confidence level. Our new intervals, while being more conservative than the exact interval, provide useful insights into understanding the tightness of the considered intervals.","PeriodicalId":50256,"journal":{"name":"Journal of Applied Probability","volume":"41 1","pages":""},"PeriodicalIF":1.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142194705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Limit theorems of occupation times of normalized binary contact path processes on lattices 网格上归一化二元接触路径过程占用时间的极限定理

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-08-27 DOI: 10.1017/jpr.2024.41

Xiaofeng Xue

引用次数: 0

Maximizing the probability of visiting a set infinitely often for a Markov decision process with Borel state and action spaces 最大化具有 Borel 状态和行动空间的马尔可夫决策过程无限次访问集合的概率

IF 1 4区数学

Journal of Applied Probability Pub Date : 2024-08-22 DOI: 10.1017/jpr.2024.25

François Dufour, Tomás Prieto-Rumeau

引用次数: 0