使用分类和回归树对青少年BMI、身高和体重数据的缺失进行建模。

IF 2.2 4区 医学 Q2 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Amanda Doggett, Ashok Chaurasia, Jean-Philippe Chaput, Scott T Leatherdale
{"title":"使用分类和回归树对青少年BMI、身高和体重数据的缺失进行建模。","authors":"Amanda Doggett,&nbsp;Ashok Chaurasia,&nbsp;Jean-Philippe Chaput,&nbsp;Scott T Leatherdale","doi":"10.24095/hpcdp.43.5.03","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Research suggests that there is often a high degree of missingness in youth body mass index (BMI) data derived from self-reported measures, which may have a large effect on research findings. The first step in handling missing data is to examine the levels and patterns of missingness. However, previous studies examining youth BMI missingness used logistic regression, which is limited in its ability to discern subgroups or identify a hierarchy of importance for variables, aspects that may go a long way in helping understand missing data patterns.</p><p><strong>Methods: </strong>This study used sex-stratified classification and regression tree (CART) models to examine missingness in height, body mass and BMI data among 74 501 youth participating in the 2018/19 COMPASS study (a prospective cohort study examining health behaviours among Canadian youth), where 31% of BMI data were missing. Diet, movement, academic, mental health and substance use variables were examined for associations with missingness in height, body mass and BMI.</p><p><strong>Results: </strong>CART models indicated that the combination of being younger, having a selfperception of being overweight, being less physically active and having poorer mental health yielded female and male subgroups highly likely to be missing BMI values. Survey respondents who did not perceive themselves as overweight and who were older were unlikely to be missing BMI values.</p><p><strong>Conclusion: </strong>The subgroups identified by the CART models indicate that a sample that deletes cases with missing BMI would be biased towards physically, emotionally and mentally healthier youth. Given the ability of CART models to identify these subgroups and a hierarchy of variable importance, they are an invaluable tool for examining missing data patterns and appropriate handling of missing data.</p>","PeriodicalId":51316,"journal":{"name":"Health Promotion and Chronic Disease Prevention in Canada-Research Policy and Practice","volume":"43 5","pages":"231-242"},"PeriodicalIF":2.2000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10237263/pdf/43_5_3.pdf","citationCount":"1","resultStr":"{\"title\":\"Using classification and regression trees to model missingness in youth BMI, height and body mass data.\",\"authors\":\"Amanda Doggett,&nbsp;Ashok Chaurasia,&nbsp;Jean-Philippe Chaput,&nbsp;Scott T Leatherdale\",\"doi\":\"10.24095/hpcdp.43.5.03\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Research suggests that there is often a high degree of missingness in youth body mass index (BMI) data derived from self-reported measures, which may have a large effect on research findings. The first step in handling missing data is to examine the levels and patterns of missingness. However, previous studies examining youth BMI missingness used logistic regression, which is limited in its ability to discern subgroups or identify a hierarchy of importance for variables, aspects that may go a long way in helping understand missing data patterns.</p><p><strong>Methods: </strong>This study used sex-stratified classification and regression tree (CART) models to examine missingness in height, body mass and BMI data among 74 501 youth participating in the 2018/19 COMPASS study (a prospective cohort study examining health behaviours among Canadian youth), where 31% of BMI data were missing. Diet, movement, academic, mental health and substance use variables were examined for associations with missingness in height, body mass and BMI.</p><p><strong>Results: </strong>CART models indicated that the combination of being younger, having a selfperception of being overweight, being less physically active and having poorer mental health yielded female and male subgroups highly likely to be missing BMI values. Survey respondents who did not perceive themselves as overweight and who were older were unlikely to be missing BMI values.</p><p><strong>Conclusion: </strong>The subgroups identified by the CART models indicate that a sample that deletes cases with missing BMI would be biased towards physically, emotionally and mentally healthier youth. Given the ability of CART models to identify these subgroups and a hierarchy of variable importance, they are an invaluable tool for examining missing data patterns and appropriate handling of missing data.</p>\",\"PeriodicalId\":51316,\"journal\":{\"name\":\"Health Promotion and Chronic Disease Prevention in Canada-Research Policy and Practice\",\"volume\":\"43 5\",\"pages\":\"231-242\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2023-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10237263/pdf/43_5_3.pdf\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Promotion and Chronic Disease Prevention in Canada-Research Policy and Practice\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.24095/hpcdp.43.5.03\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Promotion and Chronic Disease Prevention in Canada-Research Policy and Practice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.24095/hpcdp.43.5.03","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 1

摘要

导论:研究表明,青少年自我报告的体重指数(BMI)数据往往存在高度缺失,这可能对研究结果产生很大影响。处理缺失数据的第一步是检查缺失的程度和模式。然而,之前研究青少年BMI缺失的研究使用的是逻辑回归,这在辨别亚组或确定变量的重要性层次的能力上是有限的,这些方面可能有助于理解缺失的数据模式。方法:本研究使用性别分层分类和回归树(CART)模型来检查参加2018/19 COMPASS研究(一项研究加拿大青年健康行为的前瞻性队列研究)的74 501名青年的身高、体重和BMI数据缺失,其中31%的BMI数据缺失。研究人员检查了饮食、运动、学业、心理健康和物质使用等变量与身高、体重和体重指数缺失的关系。结果:CART模型表明,年轻、自我感觉超重、身体活动较少和心理健康状况较差的组合,导致女性和男性亚组极有可能失去BMI值。不认为自己超重且年龄较大的受访者不太可能错过BMI值。结论:CART模型确定的亚组表明,删除BMI缺失病例的样本将偏向于身体、情感和心理更健康的年轻人。由于CART模型能够识别这些子组和不同重要性的层次结构,因此它们是检查缺失数据模式和适当处理缺失数据的宝贵工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using classification and regression trees to model missingness in youth BMI, height and body mass data.

Introduction: Research suggests that there is often a high degree of missingness in youth body mass index (BMI) data derived from self-reported measures, which may have a large effect on research findings. The first step in handling missing data is to examine the levels and patterns of missingness. However, previous studies examining youth BMI missingness used logistic regression, which is limited in its ability to discern subgroups or identify a hierarchy of importance for variables, aspects that may go a long way in helping understand missing data patterns.

Methods: This study used sex-stratified classification and regression tree (CART) models to examine missingness in height, body mass and BMI data among 74 501 youth participating in the 2018/19 COMPASS study (a prospective cohort study examining health behaviours among Canadian youth), where 31% of BMI data were missing. Diet, movement, academic, mental health and substance use variables were examined for associations with missingness in height, body mass and BMI.

Results: CART models indicated that the combination of being younger, having a selfperception of being overweight, being less physically active and having poorer mental health yielded female and male subgroups highly likely to be missing BMI values. Survey respondents who did not perceive themselves as overweight and who were older were unlikely to be missing BMI values.

Conclusion: The subgroups identified by the CART models indicate that a sample that deletes cases with missing BMI would be biased towards physically, emotionally and mentally healthier youth. Given the ability of CART models to identify these subgroups and a hierarchy of variable importance, they are an invaluable tool for examining missing data patterns and appropriate handling of missing data.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.10
自引率
0.00%
发文量
65
审稿时长
40 weeks
期刊介绍: Health Promotion and Chronic Disease Prevention in Canada: Research, Policy and Practice (the HPCDP Journal) is the monthly, online scientific journal of the Health Promotion and Chronic Disease Prevention Branch of the Public Health Agency of Canada. The journal publishes articles on disease prevention, health promotion and health equity in the areas of chronic diseases, injuries and life course health. Content includes research from fields such as public/community health, epidemiology, biostatistics, the behavioural and social sciences, and health services or economics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信