{"title":"Asymptotics for conformal inference","authors":"Ulysse Gazin","doi":"arxiv-2409.12019","DOIUrl":null,"url":null,"abstract":"Conformal inference is a versatile tool for building prediction sets in\nregression or classification. In this paper, we consider the false coverage\nproportion (FCP) in a transductive setting with a calibration sample of n\npoints and a test sample of m points. We identify the exact, distribution-free,\nasymptotic distribution of the FCP when both n and m tend to infinity. This\nshows in particular that FCP control can be achieved by using the well-known\nKolmogorov distribution, and puts forward that the asymptotic variance is\ndecreasing in the ratio n/m. We then provide a number of extensions by\nconsidering the novelty detection problem, weighted conformal inference and\ndistribution shift between the calibration sample and the test sample. In\nparticular, our asymptotical results allow to accurately quantify the\nasymptotical behavior of the errors when weighted conformal inference is used.","PeriodicalId":501379,"journal":{"name":"arXiv - STAT - Statistics Theory","volume":"55 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Statistics Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.12019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Conformal inference is a versatile tool for building prediction sets in
regression or classification. In this paper, we consider the false coverage
proportion (FCP) in a transductive setting with a calibration sample of n
points and a test sample of m points. We identify the exact, distribution-free,
asymptotic distribution of the FCP when both n and m tend to infinity. This
shows in particular that FCP control can be achieved by using the well-known
Kolmogorov distribution, and puts forward that the asymptotic variance is
decreasing in the ratio n/m. We then provide a number of extensions by
considering the novelty detection problem, weighted conformal inference and
distribution shift between the calibration sample and the test sample. In
particular, our asymptotical results allow to accurately quantify the
asymptotical behavior of the errors when weighted conformal inference is used.
共形推理是建立回归或分类预测集的通用工具。在本文中,我们考虑了在具有 n 个点的校准样本和 m 个点的测试样本的反演环境中的虚假覆盖率(FCP)。我们确定了当 n 和 m 都趋于无穷大时,FCP 的精确、无分布、渐近分布。这特别表明,使用著名的科尔莫戈罗夫分布可以实现 FCP 控制,并提出渐近方差随 n/m 之比递减。然后,我们通过考虑新颖性检测问题、加权保形推理以及校准样本和测试样本之间的分布偏移,提出了一些扩展方法。特别是,当使用加权保形推理时,我们的渐近结果可以准确量化误差的渐近行为。