{"title":"分布变化检测的多假设检验方法","authors":"G. Golubev, M. Safarian","doi":"10.3103/s1066530719020054","DOIUrl":null,"url":null,"abstract":"Let <i>X</i><sub>1</sub>, <i>X</i><sub>2</sub>,... be independent random variables observed sequentially and such that <i>X</i><sub>1</sub>,..., <i>X</i><sub><i>θ</i>−1</sub> have a common probability density <i>p</i><sub><i>0</i></sub>, while <i>X</i><sub><i>θ</i></sub>, <i>X</i><sub><i>θ</i>+1</sub>,... are all distributed according to <i>p</i><sub>1</sub> ≠ <i>p</i><sub>0</sub>. It is assumed that <i>p</i><sub>0</sub> and <i>p</i><sub>1</sub> are known, but the time change <i>θ</i> ∈ ℤ<sup>+</sup> is unknown and the goal is to construct a stopping time <i>τ</i> that detects the change-point <i>θ</i> as soon as possible. The standard approaches to this problem rely essentially on some prior information about <i>θ</i>. For instance, in the Bayes approach, it is assumed that <i>θ</i> is a random variable with a known probability distribution. In the methods related to hypothesis testing, this a priori information is hidden in the so-called average run length. The main goal in this paper is to construct stopping times that are free from a priori information about <i>θ.</i> More formally, we propose an approach to solving approximately the following minimization problem:<span>$$\\Delta(\\theta;{\\tau^\\alpha})\\rightarrow\\min_{\\tau^\\alpha}\\;\\;\\text{subject}\\;\\text{to}\\;\\;\\alpha(\\theta;{\\tau^\\alpha})\\leq\\alpha\\;\\text{for}\\;\\text{any}\\;\\theta\\geq1,$$</span>where <i>α</i>(<i>θ; τ</i>) = P<sub><i>θ</i></sub>{<i>τ < θ</i>} is <i>the false alarm probability</i> and <i>Δ</i>(<i>θ</i>; <i>τ</i>) = E<sub><i>θ</i></sub>(<i>τ − θ</i>)<sub>+</sub> is <i>the average detection delay</i> computed for a given stopping time <i>τ</i>. In contrast to the standard CUSUM algorithm based on the sequential maximum likelihood test, our approach is related to a multiple hypothesis testing methods and permits, in particular, to construct universal stopping times with nearly Bayes detection delays.","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.8000,"publicationDate":"2019-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Multiple Hypothesis Testing Approach to Detection Changes in Distribution\",\"authors\":\"G. Golubev, M. Safarian\",\"doi\":\"10.3103/s1066530719020054\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Let <i>X</i><sub>1</sub>, <i>X</i><sub>2</sub>,... be independent random variables observed sequentially and such that <i>X</i><sub>1</sub>,..., <i>X</i><sub><i>θ</i>−1</sub> have a common probability density <i>p</i><sub><i>0</i></sub>, while <i>X</i><sub><i>θ</i></sub>, <i>X</i><sub><i>θ</i>+1</sub>,... are all distributed according to <i>p</i><sub>1</sub> ≠ <i>p</i><sub>0</sub>. It is assumed that <i>p</i><sub>0</sub> and <i>p</i><sub>1</sub> are known, but the time change <i>θ</i> ∈ ℤ<sup>+</sup> is unknown and the goal is to construct a stopping time <i>τ</i> that detects the change-point <i>θ</i> as soon as possible. The standard approaches to this problem rely essentially on some prior information about <i>θ</i>. For instance, in the Bayes approach, it is assumed that <i>θ</i> is a random variable with a known probability distribution. In the methods related to hypothesis testing, this a priori information is hidden in the so-called average run length. The main goal in this paper is to construct stopping times that are free from a priori information about <i>θ.</i> More formally, we propose an approach to solving approximately the following minimization problem:<span>$$\\\\Delta(\\\\theta;{\\\\tau^\\\\alpha})\\\\rightarrow\\\\min_{\\\\tau^\\\\alpha}\\\\;\\\\;\\\\text{subject}\\\\;\\\\text{to}\\\\;\\\\;\\\\alpha(\\\\theta;{\\\\tau^\\\\alpha})\\\\leq\\\\alpha\\\\;\\\\text{for}\\\\;\\\\text{any}\\\\;\\\\theta\\\\geq1,$$</span>where <i>α</i>(<i>θ; τ</i>) = P<sub><i>θ</i></sub>{<i>τ < θ</i>} is <i>the false alarm probability</i> and <i>Δ</i>(<i>θ</i>; <i>τ</i>) = E<sub><i>θ</i></sub>(<i>τ − θ</i>)<sub>+</sub> is <i>the average detection delay</i> computed for a given stopping time <i>τ</i>. In contrast to the standard CUSUM algorithm based on the sequential maximum likelihood test, our approach is related to a multiple hypothesis testing methods and permits, in particular, to construct universal stopping times with nearly Bayes detection delays.\",\"PeriodicalId\":46039,\"journal\":{\"name\":\"Mathematical Methods of Statistics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2019-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Methods of Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3103/s1066530719020054\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Methods of Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3103/s1066530719020054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
A Multiple Hypothesis Testing Approach to Detection Changes in Distribution
Let X1, X2,... be independent random variables observed sequentially and such that X1,..., Xθ−1 have a common probability density p0, while Xθ, Xθ+1,... are all distributed according to p1 ≠ p0. It is assumed that p0 and p1 are known, but the time change θ ∈ ℤ+ is unknown and the goal is to construct a stopping time τ that detects the change-point θ as soon as possible. The standard approaches to this problem rely essentially on some prior information about θ. For instance, in the Bayes approach, it is assumed that θ is a random variable with a known probability distribution. In the methods related to hypothesis testing, this a priori information is hidden in the so-called average run length. The main goal in this paper is to construct stopping times that are free from a priori information about θ. More formally, we propose an approach to solving approximately the following minimization problem:$$\Delta(\theta;{\tau^\alpha})\rightarrow\min_{\tau^\alpha}\;\;\text{subject}\;\text{to}\;\;\alpha(\theta;{\tau^\alpha})\leq\alpha\;\text{for}\;\text{any}\;\theta\geq1,$$where α(θ; τ) = Pθ{τ < θ} is the false alarm probability and Δ(θ; τ) = Eθ(τ − θ)+ is the average detection delay computed for a given stopping time τ. In contrast to the standard CUSUM algorithm based on the sequential maximum likelihood test, our approach is related to a multiple hypothesis testing methods and permits, in particular, to construct universal stopping times with nearly Bayes detection delays.
期刊介绍:
Mathematical Methods of Statistics is an is an international peer reviewed journal dedicated to the mathematical foundations of statistical theory. It primarily publishes research papers with complete proofs and, occasionally, review papers on particular problems of statistics. Papers dealing with applications of statistics are also published if they contain new theoretical developments to the underlying statistical methods. The journal provides an outlet for research in advanced statistical methodology and for studies where such methodology is effectively used or which stimulate its further development.