为二进制数据构建灵活、可识别和可解释的统计模型

IF 1.7 3区数学 Q1 STATISTICS & PROBABILITY

International Statistical Review Pub Date : 2022-01-30 DOI:10.1111/insr.12485

Henry R. Scharf, Xinyi Lu, Perry J. Williams, Mevin B. Hooten

{"title":"为二进制数据构建灵活、可识别和可解释的统计模型","authors":"Henry R. Scharf, Xinyi Lu, Perry J. Williams, Mevin B. Hooten","doi":"10.1111/insr.12485","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Binary regression models are ubiquitous in virtually every scientific field. Frequently, traditional generalised linear models fail to capture the variability in the probability surface that gives rise to the binary observations, and remedial methods are required. This has generated a substantial literature composed of binary regression models motivated by various applications. We describe an organisation of generalisations to traditional binary regression methods based on the familiar three-part structure of generalised linear models (random component, systematic component and link function). This perspective facilitates both the comparison of existing approaches and the development of flexible models with interpretable parameters that capture application-specific data-generating mechanisms. We use our proposed organisational structure to discuss concerns with certain existing models for binary data based on quantile regression. We then use the framework to develop and compare several binary regression models tailored to occupancy data for European red squirrels (<i>Sciurus vulgaris</i>).</p>\n </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"90 2","pages":"328-345"},"PeriodicalIF":1.7000,"publicationDate":"2022-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Constructing Flexible, Identifiable and Interpretable Statistical Models for Binary Data\",\"authors\":\"Henry R. Scharf, Xinyi Lu, Perry J. Williams, Mevin B. Hooten\",\"doi\":\"10.1111/insr.12485\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Binary regression models are ubiquitous in virtually every scientific field. Frequently, traditional generalised linear models fail to capture the variability in the probability surface that gives rise to the binary observations, and remedial methods are required. This has generated a substantial literature composed of binary regression models motivated by various applications. We describe an organisation of generalisations to traditional binary regression methods based on the familiar three-part structure of generalised linear models (random component, systematic component and link function). This perspective facilitates both the comparison of existing approaches and the development of flexible models with interpretable parameters that capture application-specific data-generating mechanisms. We use our proposed organisational structure to discuss concerns with certain existing models for binary data based on quantile regression. We then use the framework to develop and compare several binary regression models tailored to occupancy data for European red squirrels (<i>Sciurus vulgaris</i>).</p>\\n </div>\",\"PeriodicalId\":14479,\"journal\":{\"name\":\"International Statistical Review\",\"volume\":\"90 2\",\"pages\":\"328-345\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2022-01-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Statistical Review\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/insr.12485\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Statistical Review","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/insr.12485","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 3

摘要

二元回归模型几乎在每个科学领域都是普遍存在的。通常，传统的广义线性模型无法捕捉到导致二元观测结果的概率面变异性，因此需要采取补救方法。这产生了大量由各种应用驱动的二元回归模型组成的文献。我们描述了一种基于熟悉的广义线性模型的三部分结构(随机成分，系统成分和链接函数)的传统二元回归方法的一般化组织。这个透视图既便于对现有方法进行比较，也便于开发具有可解释参数的灵活模型，这些参数捕获特定于应用程序的数据生成机制。我们使用我们提出的组织结构来讨论基于分位数回归的二元数据的某些现有模型的关注点。然后，我们使用该框架开发并比较了针对欧洲红松鼠(Sciurus vulgaris)占用数据量身定制的几种二元回归模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Constructing Flexible, Identifiable and Interpretable Statistical Models for Binary Data

Binary regression models are ubiquitous in virtually every scientific field. Frequently, traditional generalised linear models fail to capture the variability in the probability surface that gives rise to the binary observations, and remedial methods are required. This has generated a substantial literature composed of binary regression models motivated by various applications. We describe an organisation of generalisations to traditional binary regression methods based on the familiar three-part structure of generalised linear models (random component, systematic component and link function). This perspective facilitates both the comparison of existing approaches and the development of flexible models with interpretable parameters that capture application-specific data-generating mechanisms. We use our proposed organisational structure to discuss concerns with certain existing models for binary data based on quantile regression. We then use the framework to develop and compare several binary regression models tailored to occupancy data for European red squirrels (Sciurus vulgaris).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Statistical Review 数学-统计学与概率论

CiteScore

4.30

自引率

5.00%

发文量

审稿时长

>12 weeks

期刊介绍： International Statistical Review is the flagship journal of the International Statistical Institute (ISI) and of its family of Associations. It publishes papers of broad and general interest in statistics and probability. The term Review is to be interpreted broadly. The types of papers that are suitable for publication include (but are not limited to) the following: reviews/surveys of significant developments in theory, methodology, statistical computing and graphics, statistical education, and application areas; tutorials on important topics; expository papers on emerging areas of research or application; papers describing new developments and/or challenges in relevant areas; papers addressing foundational issues; papers on the history of statistics and probability; white papers on topics of importance to the profession or society; and historical assessment of seminal papers in the field and their impact.