{"title":"高维计数的超常规局部尖锐拟合优度测试","authors":"Subhodh Kotekal, Julien Chhor, Chao Gao","doi":"arxiv-2409.08871","DOIUrl":null,"url":null,"abstract":"We consider testing the goodness-of-fit of a distribution against\nalternatives separated in sup norm. We study the twin settings of\nPoisson-generated count data with a large number of categories and\nhigh-dimensional multinomials. In previous studies of different separation\nmetrics, it has been found that the local minimax separation rate exhibits\nsubstantial heterogeneity and is a complicated function of the null\ndistribution; the rate-optimal test requires careful tailoring to the null. In\nthe setting of sup norm, this remains the case and we establish that the local\nminimax separation rate is determined by the finer decay behavior of the\ncategory rates. The upper bound is obtained by a test involving the sample\nmaximum, and the lower bound argument involves reducing the original\nheteroskedastic null to an auxiliary homoskedastic null determined by the decay\nof the rates. Further, in a particular asymptotic setup, the sharp constants\nare identified.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"209 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Locally sharp goodness-of-fit testing in sup norm for high-dimensional counts\",\"authors\":\"Subhodh Kotekal, Julien Chhor, Chao Gao\",\"doi\":\"arxiv-2409.08871\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider testing the goodness-of-fit of a distribution against\\nalternatives separated in sup norm. We study the twin settings of\\nPoisson-generated count data with a large number of categories and\\nhigh-dimensional multinomials. In previous studies of different separation\\nmetrics, it has been found that the local minimax separation rate exhibits\\nsubstantial heterogeneity and is a complicated function of the null\\ndistribution; the rate-optimal test requires careful tailoring to the null. In\\nthe setting of sup norm, this remains the case and we establish that the local\\nminimax separation rate is determined by the finer decay behavior of the\\ncategory rates. The upper bound is obtained by a test involving the sample\\nmaximum, and the lower bound argument involves reducing the original\\nheteroskedastic null to an auxiliary homoskedastic null determined by the decay\\nof the rates. Further, in a particular asymptotic setup, the sharp constants\\nare identified.\",\"PeriodicalId\":501379,\"journal\":{\"name\":\"arXiv - STAT - Statistics Theory\",\"volume\":\"209 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Statistics Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.08871\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Statistics Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08871","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Locally sharp goodness-of-fit testing in sup norm for high-dimensional counts
We consider testing the goodness-of-fit of a distribution against
alternatives separated in sup norm. We study the twin settings of
Poisson-generated count data with a large number of categories and
high-dimensional multinomials. In previous studies of different separation
metrics, it has been found that the local minimax separation rate exhibits
substantial heterogeneity and is a complicated function of the null
distribution; the rate-optimal test requires careful tailoring to the null. In
the setting of sup norm, this remains the case and we establish that the local
minimax separation rate is determined by the finer decay behavior of the
category rates. The upper bound is obtained by a test involving the sample
maximum, and the lower bound argument involves reducing the original
heteroskedastic null to an auxiliary homoskedastic null determined by the decay
of the rates. Further, in a particular asymptotic setup, the sharp constants
are identified.