Henry R. Scharf, Xinyi Lu, Perry J. Williams, Mevin B. Hooten
{"title":"为二进制数据构建灵活、可识别和可解释的统计模型","authors":"Henry R. Scharf, Xinyi Lu, Perry J. Williams, Mevin B. Hooten","doi":"10.1111/insr.12485","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Binary regression models are ubiquitous in virtually every scientific field. Frequently, traditional generalised linear models fail to capture the variability in the probability surface that gives rise to the binary observations, and remedial methods are required. This has generated a substantial literature composed of binary regression models motivated by various applications. We describe an organisation of generalisations to traditional binary regression methods based on the familiar three-part structure of generalised linear models (random component, systematic component and link function). This perspective facilitates both the comparison of existing approaches and the development of flexible models with interpretable parameters that capture application-specific data-generating mechanisms. We use our proposed organisational structure to discuss concerns with certain existing models for binary data based on quantile regression. We then use the framework to develop and compare several binary regression models tailored to occupancy data for European red squirrels (<i>Sciurus vulgaris</i>).</p>\n </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":"90 2","pages":"328-345"},"PeriodicalIF":1.7000,"publicationDate":"2022-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Constructing Flexible, Identifiable and Interpretable Statistical Models for Binary Data\",\"authors\":\"Henry R. Scharf, Xinyi Lu, Perry J. Williams, Mevin B. Hooten\",\"doi\":\"10.1111/insr.12485\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Binary regression models are ubiquitous in virtually every scientific field. Frequently, traditional generalised linear models fail to capture the variability in the probability surface that gives rise to the binary observations, and remedial methods are required. This has generated a substantial literature composed of binary regression models motivated by various applications. We describe an organisation of generalisations to traditional binary regression methods based on the familiar three-part structure of generalised linear models (random component, systematic component and link function). This perspective facilitates both the comparison of existing approaches and the development of flexible models with interpretable parameters that capture application-specific data-generating mechanisms. We use our proposed organisational structure to discuss concerns with certain existing models for binary data based on quantile regression. We then use the framework to develop and compare several binary regression models tailored to occupancy data for European red squirrels (<i>Sciurus vulgaris</i>).</p>\\n </div>\",\"PeriodicalId\":14479,\"journal\":{\"name\":\"International Statistical Review\",\"volume\":\"90 2\",\"pages\":\"328-345\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2022-01-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Statistical Review\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/insr.12485\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Statistical Review","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/insr.12485","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
Constructing Flexible, Identifiable and Interpretable Statistical Models for Binary Data
Binary regression models are ubiquitous in virtually every scientific field. Frequently, traditional generalised linear models fail to capture the variability in the probability surface that gives rise to the binary observations, and remedial methods are required. This has generated a substantial literature composed of binary regression models motivated by various applications. We describe an organisation of generalisations to traditional binary regression methods based on the familiar three-part structure of generalised linear models (random component, systematic component and link function). This perspective facilitates both the comparison of existing approaches and the development of flexible models with interpretable parameters that capture application-specific data-generating mechanisms. We use our proposed organisational structure to discuss concerns with certain existing models for binary data based on quantile regression. We then use the framework to develop and compare several binary regression models tailored to occupancy data for European red squirrels (Sciurus vulgaris).
期刊介绍:
International Statistical Review is the flagship journal of the International Statistical Institute (ISI) and of its family of Associations. It publishes papers of broad and general interest in statistics and probability. The term Review is to be interpreted broadly. The types of papers that are suitable for publication include (but are not limited to) the following: reviews/surveys of significant developments in theory, methodology, statistical computing and graphics, statistical education, and application areas; tutorials on important topics; expository papers on emerging areas of research or application; papers describing new developments and/or challenges in relevant areas; papers addressing foundational issues; papers on the history of statistics and probability; white papers on topics of importance to the profession or society; and historical assessment of seminal papers in the field and their impact.