一种计算效率高的多层次数据序列回归插值算法

IF 16.4 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Tugba Akkaya Hocagil, Recai M. Yucel
{"title":"一种计算效率高的多层次数据序列回归插值算法","authors":"Tugba Akkaya Hocagil, Recai M. Yucel","doi":"10.1080/02664763.2023.2277669","DOIUrl":null,"url":null,"abstract":"ABSTRACTDue to the computational burden, especially in high-dimensional settings, sequential imputation may not be practical. In this paper, we adopt computationally advantageous methods by sampling the missing data from their perspective predictive distributions, which leads to significantly improved computation time in the class of variable-by-variable imputation algorithms. We assess the computational performance in a comprehensive simulation study. We then compare and contrast the performance of our algorithm with commonly used alternatives. The results show that our method has a significant advantage over the commonly used alternatives with respect to computational efficiency and inferential quality. Finally, we demonstrate our methods in a substantive problem aimed at investigating the effects of area-level behavioral, socioeconomic, and demographic characteristics on poor birth outcomes in New York State among singleton births.KEYWORDS: Sequential regression imputationmultilevel datacomputational efficiencyfast variable by variable imputationmultiple imputation by chained equations AcknowledgmentsWe thank Dr. Tabassum Insaf for providing assistance in accessing the New York State Vital Records Registry data.Disclosure statementNo potential conflict of interest was reported by the author(s).","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A computationally efficient sequential regression imputation algorithm for multilevel data\",\"authors\":\"Tugba Akkaya Hocagil, Recai M. Yucel\",\"doi\":\"10.1080/02664763.2023.2277669\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACTDue to the computational burden, especially in high-dimensional settings, sequential imputation may not be practical. In this paper, we adopt computationally advantageous methods by sampling the missing data from their perspective predictive distributions, which leads to significantly improved computation time in the class of variable-by-variable imputation algorithms. We assess the computational performance in a comprehensive simulation study. We then compare and contrast the performance of our algorithm with commonly used alternatives. The results show that our method has a significant advantage over the commonly used alternatives with respect to computational efficiency and inferential quality. Finally, we demonstrate our methods in a substantive problem aimed at investigating the effects of area-level behavioral, socioeconomic, and demographic characteristics on poor birth outcomes in New York State among singleton births.KEYWORDS: Sequential regression imputationmultilevel datacomputational efficiencyfast variable by variable imputationmultiple imputation by chained equations AcknowledgmentsWe thank Dr. Tabassum Insaf for providing assistance in accessing the New York State Vital Records Registry data.Disclosure statementNo potential conflict of interest was reported by the author(s).\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2023-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/02664763.2023.2277669\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/02664763.2023.2277669","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

摘要由于计算量大,特别是在高维环境下,序贯输入可能不太实用。在本文中,我们采用了计算优势的方法,从缺失数据的预测分布角度对缺失数据进行采样,从而显著提高了变量逐变量插值算法的计算时间。我们在一个全面的模拟研究中评估了计算性能。然后,我们将我们的算法与常用替代算法的性能进行比较和对比。结果表明,我们的方法在计算效率和推理质量方面比常用的替代方法具有显著的优势。最后,我们在一个实质性问题中展示了我们的方法,该问题旨在调查纽约州单胎分娩中区域层面行为、社会经济和人口特征对不良出生结果的影响。关键词:序贯回归、多层次数据、计算效率、逐变量快速、链式方程多元、感谢Tabassum Insaf博士在获取纽约州生命记录登记处数据方面提供的帮助。披露声明作者未报告潜在的利益冲突。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A computationally efficient sequential regression imputation algorithm for multilevel data
ABSTRACTDue to the computational burden, especially in high-dimensional settings, sequential imputation may not be practical. In this paper, we adopt computationally advantageous methods by sampling the missing data from their perspective predictive distributions, which leads to significantly improved computation time in the class of variable-by-variable imputation algorithms. We assess the computational performance in a comprehensive simulation study. We then compare and contrast the performance of our algorithm with commonly used alternatives. The results show that our method has a significant advantage over the commonly used alternatives with respect to computational efficiency and inferential quality. Finally, we demonstrate our methods in a substantive problem aimed at investigating the effects of area-level behavioral, socioeconomic, and demographic characteristics on poor birth outcomes in New York State among singleton births.KEYWORDS: Sequential regression imputationmultilevel datacomputational efficiencyfast variable by variable imputationmultiple imputation by chained equations AcknowledgmentsWe thank Dr. Tabassum Insaf for providing assistance in accessing the New York State Vital Records Registry data.Disclosure statementNo potential conflict of interest was reported by the author(s).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Accounts of Chemical Research
Accounts of Chemical Research 化学-化学综合
CiteScore
31.40
自引率
1.10%
发文量
312
审稿时长
2 months
期刊介绍: Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance. Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信