Semiparametric Recovery of Central Dimension Reduction Space with Nonignorable Nonresponse‪

IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY
Siming Zheng, Alan T.K. Wan, Yong Zhou
{"title":"Semiparametric Recovery of Central Dimension Reduction Space with Nonignorable Nonresponse‪","authors":"Siming Zheng, Alan T.K. Wan, Yong Zhou","doi":"10.1111/stan.12321","DOIUrl":null,"url":null,"abstract":"Sufficient dimension reduction (SDR) methods are effective tools for handling high dimensional data. Classical SDR methods are developed under the assumption that the data are completely observed. When the data are incomplete due to missing values, SDR has only been considered when the data are randomly missing, but not when they are non‐ignorably missing, which is arguably more difficult to handle due to the missing values' dependence on the reasons they are missing. The purpose of this paper is to fill this void. We propose an intuitive, easy‐to‐implement SDR estimator based on a semiparametric propensity score function for response data with non‐ignorable missing values. We refer to it as the dimension reduction‐based imputed estimator. We establish the theoretical properties of this estimator and examine its empirical performance via an extensive numerical study on real and simulated data. As well, we compare the performance of our proposed dimension reduction‐based imputed estimator with two competing estimators, including the fusion refined estimator and cumulative slicing estimator. A distinguishing feature of our method is that it requires no validation sample. The SDR theory developed in this paper is a non‐trivial extension of the existing literature, due to the technical challenges posed by non‐ignorable missingness. All the technical proofs of the theorems are given in the Online Supplementary Material.This article is protected by copyright. All rights reserved.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":"1 1","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2023-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistica Neerlandica","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1111/stan.12321","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

Sufficient dimension reduction (SDR) methods are effective tools for handling high dimensional data. Classical SDR methods are developed under the assumption that the data are completely observed. When the data are incomplete due to missing values, SDR has only been considered when the data are randomly missing, but not when they are non‐ignorably missing, which is arguably more difficult to handle due to the missing values' dependence on the reasons they are missing. The purpose of this paper is to fill this void. We propose an intuitive, easy‐to‐implement SDR estimator based on a semiparametric propensity score function for response data with non‐ignorable missing values. We refer to it as the dimension reduction‐based imputed estimator. We establish the theoretical properties of this estimator and examine its empirical performance via an extensive numerical study on real and simulated data. As well, we compare the performance of our proposed dimension reduction‐based imputed estimator with two competing estimators, including the fusion refined estimator and cumulative slicing estimator. A distinguishing feature of our method is that it requires no validation sample. The SDR theory developed in this paper is a non‐trivial extension of the existing literature, due to the technical challenges posed by non‐ignorable missingness. All the technical proofs of the theorems are given in the Online Supplementary Material.This article is protected by copyright. All rights reserved.
具有不可忽略非响应的中心降维空间半参数恢复
充分降维方法是处理高维数据的有效工具。经典的SDR方法是在完全观测数据的假设下发展起来的。当数据由于缺失值而不完整时,SDR只在数据随机缺失时被考虑,而在数据不可忽略缺失时则不被考虑,由于缺失值依赖于它们缺失的原因,这可以说是更难以处理。本文的目的就是填补这一空白。我们提出了一个直观的,易于实现的基于半参数倾向评分函数的SDR估计器,用于具有不可忽略缺失值的响应数据。我们将其称为基于降维的估算估计器。我们建立了该估计器的理论性质,并通过对真实和模拟数据的广泛数值研究来检验其经验性能。此外,我们还比较了我们提出的基于降维的估计器与两种竞争估计器的性能,包括融合改进估计器和累积切片估计器。我们的方法的一个显著特征是它不需要验证样本。由于不可忽视的缺失所带来的技术挑战,本文中发展的SDR理论是对现有文献的非平凡扩展。所有这些定理的技术证明都在在线补充材料中给出。这篇文章受版权保护。版权所有。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Statistica Neerlandica
Statistica Neerlandica 数学-统计学与概率论
CiteScore
2.60
自引率
6.70%
发文量
26
审稿时长
>12 weeks
期刊介绍: Statistica Neerlandica has been the journal of the Netherlands Society for Statistics and Operations Research since 1946. It covers all areas of statistics, from theoretical to applied, with a special emphasis on mathematical statistics, statistics for the behavioural sciences and biostatistics. This wide scope is reflected by the expertise of the journal’s editors representing these areas. The diverse editorial board is committed to a fast and fair reviewing process, and will judge submissions on quality, correctness, relevance and originality. Statistica Neerlandica encourages transparency and reproducibility, and offers online resources to make data, code, simulation results and other additional materials publicly available.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信