Sébastien Farkas, Antoine Heranval, Olivier Lopez, Maud Thomas
{"title":"用于极端事件分析的广义帕累托回归树","authors":"Sébastien Farkas, Antoine Heranval, Olivier Lopez, Maud Thomas","doi":"10.1007/s10687-024-00485-1","DOIUrl":null,"url":null,"abstract":"<p>This paper derives finite sample results to assess the consistency of Generalized Pareto regression trees introduced by Farkas et al. (Insur. Math. Econ. 98:92–105, 2021) as tools to perform extreme value regression for heavy-tailed distributions. This procedure allows the constitution of classes of observations with similar tail behaviors depending on the value of the covariates, based on a recursive partition of the sample and simple model selection rules. The results we provide are obtained from concentration inequalities, and are valid for a finite sample size. A misspecification bias that arises from the use of a “Peaks over Threshold” approach is also taken into account. Moreover, the derived properties legitimate the pruning strategies, that is the model selection rules, used to select a proper tree that achieves a compromise between simplicity and goodness-of-fit. The methodology is illustrated through a simulation study, and a real data application in insurance for natural disasters.</p>","PeriodicalId":49274,"journal":{"name":"Extremes","volume":"16 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2024-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generalized pareto regression trees for extreme event analysis\",\"authors\":\"Sébastien Farkas, Antoine Heranval, Olivier Lopez, Maud Thomas\",\"doi\":\"10.1007/s10687-024-00485-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper derives finite sample results to assess the consistency of Generalized Pareto regression trees introduced by Farkas et al. (Insur. Math. Econ. 98:92–105, 2021) as tools to perform extreme value regression for heavy-tailed distributions. This procedure allows the constitution of classes of observations with similar tail behaviors depending on the value of the covariates, based on a recursive partition of the sample and simple model selection rules. The results we provide are obtained from concentration inequalities, and are valid for a finite sample size. A misspecification bias that arises from the use of a “Peaks over Threshold” approach is also taken into account. Moreover, the derived properties legitimate the pruning strategies, that is the model selection rules, used to select a proper tree that achieves a compromise between simplicity and goodness-of-fit. The methodology is illustrated through a simulation study, and a real data application in insurance for natural disasters.</p>\",\"PeriodicalId\":49274,\"journal\":{\"name\":\"Extremes\",\"volume\":\"16 1\",\"pages\":\"\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2024-03-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Extremes\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1007/s10687-024-00485-1\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Extremes","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10687-024-00485-1","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
摘要
本文推导了有限样本结果,以评估 Farkas 等人(Insur. Math. Econ. 98:92-105, 2021 年)引入的广义帕累托回归树作为重尾分布极值回归工具的一致性。该程序允许根据协变量的值,基于样本的递归分区和简单的模型选择规则,构成具有相似尾部行为的观测类别。我们提供的结果是从集中不等式中得到的,对有限样本量有效。使用 "峰值超过阈值 "的方法所产生的规范偏差也被考虑在内。此外,推导出的属性使剪枝策略(即模型选择规则)合法化,用于选择适当的树,在简洁性和拟合度之间取得折中。该方法通过模拟研究和自然灾害保险中的实际数据应用进行了说明。
Generalized pareto regression trees for extreme event analysis
This paper derives finite sample results to assess the consistency of Generalized Pareto regression trees introduced by Farkas et al. (Insur. Math. Econ. 98:92–105, 2021) as tools to perform extreme value regression for heavy-tailed distributions. This procedure allows the constitution of classes of observations with similar tail behaviors depending on the value of the covariates, based on a recursive partition of the sample and simple model selection rules. The results we provide are obtained from concentration inequalities, and are valid for a finite sample size. A misspecification bias that arises from the use of a “Peaks over Threshold” approach is also taken into account. Moreover, the derived properties legitimate the pruning strategies, that is the model selection rules, used to select a proper tree that achieves a compromise between simplicity and goodness-of-fit. The methodology is illustrated through a simulation study, and a real data application in insurance for natural disasters.
ExtremesMATHEMATICS, INTERDISCIPLINARY APPLICATIONS-STATISTICS & PROBABILITY
CiteScore
2.20
自引率
7.70%
发文量
15
审稿时长
>12 weeks
期刊介绍:
Extremes publishes original research on all aspects of statistical extreme value theory and its applications in science, engineering, economics and other fields. Authoritative and timely reviews of theoretical advances and of extreme value methods and problems in important applied areas, including detailed case studies, are welcome and will be a regular feature. All papers are refereed. Publication will be swift: in particular electronic submission and correspondence is encouraged.
Statistical extreme value methods encompass a very wide range of problems: Extreme waves, rainfall, and floods are of basic importance in oceanography and hydrology, as are high windspeeds and extreme temperatures in meteorology and catastrophic claims in insurance. The waveforms and extremes of random loads determine lifelengths in structural safety, corrosion and metal fatigue.