A prediction model for classifying maternal pregnancy smoking using California state birth certificate information.

IF 2.7 3区 医学 Q2 OBSTETRICS & GYNECOLOGY
Paediatric and perinatal epidemiology Pub Date : 2024-02-01 Epub Date: 2023-11-15 DOI:10.1111/ppe.13021
Di He, Xiwen Huang, Onyebuchi A Arah, Douglas I Walker, Dean P Jones, Beate Ritz, Julia E Heck
{"title":"A prediction model for classifying maternal pregnancy smoking using California state birth certificate information.","authors":"Di He, Xiwen Huang, Onyebuchi A Arah, Douglas I Walker, Dean P Jones, Beate Ritz, Julia E Heck","doi":"10.1111/ppe.13021","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Systematically recorded smoking data are not always available in vital statistics records, and even when available it can underestimate true smoking rates.</p><p><strong>Objective: </strong>To develop a prediction model for maternal tobacco smoking in late pregnancy based on birth certificate information using a combination of self- or provider-reported smoking and biomarkers (smoking metabolites) in neonatal blood spots as the alloyed gold standard.</p><p><strong>Methods: </strong>We designed a case-control study where childhood cancer cases were identified from the California Cancer Registry and controls were from the California birth rolls between 1983 and 2011 who were cancer-free by the age of six. In this analysis, we included 894 control participants and performed high-resolution metabolomics analyses in their neonatal dried blood spots, where we extracted cotinine [mass-to-charge ratio (m/z) = 177.1023] and hydroxycotinine (m/z = 193.0973). Potential predictors of smoking were selected from California birth certificates. Logistic regression with stepwise backward selection was used to build a prediction model. Model performance was evaluated in a training sample, a bootstrapped sample, and an external validation sample.</p><p><strong>Results: </strong>Out of seven predictor variables entered into the logistic model, five were selected by the stepwise procedure: maternal race/ethnicity, maternal education, child's birth year, parity, and child's birth weight. We calculated an overall discrimination accuracy of 0.72 and an area under the receiver operating characteristic curve (AUC) of 0.81 (95% confidence interval [CI] 0.77, 0.84) in the training set. Similar accuracies were achieved in the internal (AUC 0.81, 95% CI 0.77, 0.84) and external (AUC 0.69, 95% CI 0.64, 0.74) validation sets.</p><p><strong>Conclusions: </strong>This easy-to-apply model may benefit future birth registry-based studies when there is missing maternal smoking information; however, some smoking status misclassification remains a concern when only variables from the birth certificate are used to predict maternal smoking.</p>","PeriodicalId":19698,"journal":{"name":"Paediatric and perinatal epidemiology","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10922711/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Paediatric and perinatal epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/ppe.13021","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/11/15 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Systematically recorded smoking data are not always available in vital statistics records, and even when available it can underestimate true smoking rates.

Objective: To develop a prediction model for maternal tobacco smoking in late pregnancy based on birth certificate information using a combination of self- or provider-reported smoking and biomarkers (smoking metabolites) in neonatal blood spots as the alloyed gold standard.

Methods: We designed a case-control study where childhood cancer cases were identified from the California Cancer Registry and controls were from the California birth rolls between 1983 and 2011 who were cancer-free by the age of six. In this analysis, we included 894 control participants and performed high-resolution metabolomics analyses in their neonatal dried blood spots, where we extracted cotinine [mass-to-charge ratio (m/z) = 177.1023] and hydroxycotinine (m/z = 193.0973). Potential predictors of smoking were selected from California birth certificates. Logistic regression with stepwise backward selection was used to build a prediction model. Model performance was evaluated in a training sample, a bootstrapped sample, and an external validation sample.

Results: Out of seven predictor variables entered into the logistic model, five were selected by the stepwise procedure: maternal race/ethnicity, maternal education, child's birth year, parity, and child's birth weight. We calculated an overall discrimination accuracy of 0.72 and an area under the receiver operating characteristic curve (AUC) of 0.81 (95% confidence interval [CI] 0.77, 0.84) in the training set. Similar accuracies were achieved in the internal (AUC 0.81, 95% CI 0.77, 0.84) and external (AUC 0.69, 95% CI 0.64, 0.74) validation sets.

Conclusions: This easy-to-apply model may benefit future birth registry-based studies when there is missing maternal smoking information; however, some smoking status misclassification remains a concern when only variables from the birth certificate are used to predict maternal smoking.

使用加州出生证明信息分类孕妇吸烟的预测模型。
背景:在生命统计记录中,系统记录的吸烟数据并不总是可用的,即使可用,也可能低估真实的吸烟率。目的:建立一种基于出生证明信息的孕妇妊娠晚期吸烟预测模型,将新生儿血斑中吸烟代谢物(吸烟代谢物)与自我报告吸烟相结合作为合金金标准。方法:我们设计了一项病例对照研究,从加州癌症登记处确定儿童癌症病例,对照来自1983年至2011年期间6岁前无癌症的加州出生名单。在这项分析中,我们纳入了894名对照参与者,并对他们的新生儿干血斑点进行了高分辨率代谢组学分析,其中我们提取了可替宁[质量电荷比(m/z) = 177.1023]和羟基可替宁(m/z = 193.0973)。吸烟的潜在预测因素是从加州的出生证明中挑选出来的。采用逐步回归选择的逻辑回归方法建立预测模型。模型性能在训练样本、自举样本和外部验证样本中进行评估。结果:在进入logistic模型的7个预测变量中,通过逐步程序选择了5个预测变量:母亲种族/民族、母亲教育程度、孩子出生年份、胎次和孩子出生体重。我们计算出训练集中的总体识别精度为0.72,接收者工作特征曲线下面积(AUC)为0.81(95%置信区间[CI] 0.77, 0.84)。内部验证集(AUC 0.81, 95% CI 0.77, 0.84)和外部验证集(AUC 0.69, 95% CI 0.64, 0.74)的准确度相似。结论:当缺少产妇吸烟信息时,这个易于应用的模型可能有利于未来基于出生登记的研究;然而,当仅使用出生证明中的变量来预测母亲吸烟时,一些吸烟状况的错误分类仍然是一个问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.40
自引率
7.10%
发文量
84
审稿时长
1 months
期刊介绍: Paediatric and Perinatal Epidemiology crosses the boundaries between the epidemiologist and the paediatrician, obstetrician or specialist in child health, ensuring that important paediatric and perinatal studies reach those clinicians for whom the results are especially relevant. In addition to original research articles, the Journal also includes commentaries, book reviews and annotations.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信