{"title":"利用贝叶斯分层抽样加权零膨胀回归建立县级罕见病患病率模型","authors":"Hui Xie, Deborah B Rolka, Lawrence E Barker","doi":"10.6339/22-JDS1049","DOIUrl":null,"url":null,"abstract":"<p><p>Estimates of county-level disease prevalence have a variety of applications. Such estimation is often done via model-based small-area estimation using survey data. However, for conditions with low prevalence (i.e., rare diseases or newly diagnosed diseases), counties with a high fraction of zero counts in surveys are common. They are often more common than the model used would lead one to expect; such zeros are called 'excess zeros'. The excess zeros can be structural (there are no cases to find) or sampling (there are cases, but none were selected for sampling). These issues are often addressed by combining multiple years of data. However, this approach can obscure trends in annual estimates and prevent estimates from being timely. Using single-year survey data, we proposed a Bayesian weighted Binomial Zero-inflated (BBZ) model to estimate county-level rare diseases prevalence. The BBZ model accounts for excess zero counts, the sampling weights and uses a power prior. We evaluated BBZ with American Community Survey results and simulated data. We showed that BBZ yielded less bias and smaller variance than estimates based on the binomial distribution, a common approach to this problem. Since BBZ uses only a single year of survey data, BBZ produces more timely county-level incidence estimates. These timely estimates help pinpoint the special areas of county-level needs and help medical researchers and public health practitioners promptly evaluate rare diseases trends and associations with other health conditions.</p>","PeriodicalId":47722,"journal":{"name":"COMMUNICATION EDUCATION","volume":"67 1","pages":"145-157"},"PeriodicalIF":0.9000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11119276/pdf/","citationCount":"0","resultStr":"{\"title\":\"Modeling County-Level Rare Disease Prevalence Using Bayesian Hierarchical Sampling Weighted Zero-Inflated Regression.\",\"authors\":\"Hui Xie, Deborah B Rolka, Lawrence E Barker\",\"doi\":\"10.6339/22-JDS1049\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Estimates of county-level disease prevalence have a variety of applications. Such estimation is often done via model-based small-area estimation using survey data. However, for conditions with low prevalence (i.e., rare diseases or newly diagnosed diseases), counties with a high fraction of zero counts in surveys are common. They are often more common than the model used would lead one to expect; such zeros are called 'excess zeros'. The excess zeros can be structural (there are no cases to find) or sampling (there are cases, but none were selected for sampling). These issues are often addressed by combining multiple years of data. However, this approach can obscure trends in annual estimates and prevent estimates from being timely. Using single-year survey data, we proposed a Bayesian weighted Binomial Zero-inflated (BBZ) model to estimate county-level rare diseases prevalence. The BBZ model accounts for excess zero counts, the sampling weights and uses a power prior. We evaluated BBZ with American Community Survey results and simulated data. We showed that BBZ yielded less bias and smaller variance than estimates based on the binomial distribution, a common approach to this problem. Since BBZ uses only a single year of survey data, BBZ produces more timely county-level incidence estimates. These timely estimates help pinpoint the special areas of county-level needs and help medical researchers and public health practitioners promptly evaluate rare diseases trends and associations with other health conditions.</p>\",\"PeriodicalId\":47722,\"journal\":{\"name\":\"COMMUNICATION EDUCATION\",\"volume\":\"67 1\",\"pages\":\"145-157\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11119276/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"COMMUNICATION EDUCATION\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.6339/22-JDS1049\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMMUNICATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"COMMUNICATION EDUCATION","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.6339/22-JDS1049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMMUNICATION","Score":null,"Total":0}
Estimates of county-level disease prevalence have a variety of applications. Such estimation is often done via model-based small-area estimation using survey data. However, for conditions with low prevalence (i.e., rare diseases or newly diagnosed diseases), counties with a high fraction of zero counts in surveys are common. They are often more common than the model used would lead one to expect; such zeros are called 'excess zeros'. The excess zeros can be structural (there are no cases to find) or sampling (there are cases, but none were selected for sampling). These issues are often addressed by combining multiple years of data. However, this approach can obscure trends in annual estimates and prevent estimates from being timely. Using single-year survey data, we proposed a Bayesian weighted Binomial Zero-inflated (BBZ) model to estimate county-level rare diseases prevalence. The BBZ model accounts for excess zero counts, the sampling weights and uses a power prior. We evaluated BBZ with American Community Survey results and simulated data. We showed that BBZ yielded less bias and smaller variance than estimates based on the binomial distribution, a common approach to this problem. Since BBZ uses only a single year of survey data, BBZ produces more timely county-level incidence estimates. These timely estimates help pinpoint the special areas of county-level needs and help medical researchers and public health practitioners promptly evaluate rare diseases trends and associations with other health conditions.
期刊介绍:
Communication Education is a peer-reviewed publication of the National Communication Association. Communication Education publishes original scholarship that advances understanding of the role of communication in the teaching and learning process in diverse spaces, structures, and interactions, within and outside of academia. Communication Education welcomes scholarship from diverse perspectives and methodologies, including quantitative, qualitative, and critical/textual approaches. All submissions must be methodologically rigorous and theoretically grounded and geared toward advancing knowledge production in communication, teaching, and learning. Scholarship in Communication Education addresses the intersections of communication, teaching, and learning related to topics and contexts that include but are not limited to: • student/teacher relationships • student/teacher characteristics • student/teacher identity construction • student learning outcomes • student engagement • diversity, inclusion, and difference • social justice • instructional technology/social media • the basic communication course • service learning • communication across the curriculum • communication instruction in business and the professions • communication instruction in civic arenas In addition to articles, the journal will publish occasional scholarly exchanges on topics related to communication, teaching, and learning, such as: • Analytic review articles: agenda-setting pieces including examinations of key questions about the field • Forum essays: themed pieces for dialogue or debate on current communication, teaching, and learning issues