arXiv: Methodology最新文献_第2页

Statistical Inference on the Cure Time 治愈时间的统计推断

arXiv: Methodology Pub Date : 2020-09-28 DOI: 10.6342/NTU201904221

Yueh Wang, Hung Hung

{"title":"Statistical Inference on the Cure Time","authors":"Yueh Wang, Hung Hung","doi":"10.6342/NTU201904221","DOIUrl":"https://doi.org/10.6342/NTU201904221","url":null,"abstract":"In population-based cancer survival analysis, the net survival is important for government to assess health care programs. For decades, it is observed that the net survival reaches a plateau after long-term follow-up, this is so called ``statistical cure''. Several methods were proposed to address the statistical cure. Besides, the cure time can be used to evaluate the time period of a health care program for a specific patient population, and it also can be helpful for a clinician to explain the prognosis for patients, therefore the cure time is an important health care index. However, those proposed methods assume the cure time to be infinity, thus it is inconvenient to make inference on the cure time. In this dissertation, we define a more general concept of statistical cure via conditional survival. Based on the newly defined statistical cure, the cure time is well defined. We develop cure time model methodologies and show a variety of properties through simulation. In data analysis, cure times are estimated for 22 major cancers in Taiwan, we further use colorectal cancer data as an example to conduct statistical inference via cure time model with covariate sex, age group, and stage. This dissertation provides a methodology to obtain cure time estimate, which can contribute to public health policy making.","PeriodicalId":186390,"journal":{"name":"arXiv: Methodology","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134620431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Parsimonious Feature Extraction Methods: Extending Robust Probabilistic Projections with Generalized Skew-t 简化特征提取方法:扩展广义Skew-t稳健概率投影

arXiv: Methodology Pub Date : 2020-09-24 DOI: 10.2139/ssrn.3678383

Dorota Toczydlowska, G. Peters, P. Shevchenko

引用次数: 1

An Introduction to Proximal Causal Learning 近端因果学习导论

arXiv: Methodology Pub Date : 2020-09-23 DOI: 10.1101/2020.09.21.20198762

E. T. Tchetgen, Andrew Ying, Yifan Cui, Xu Shi, Wang Miao

{"title":"An Introduction to Proximal Causal Learning","authors":"E. T. Tchetgen, Andrew Ying, Yifan Cui, Xu Shi, Wang Miao","doi":"10.1101/2020.09.21.20198762","DOIUrl":"https://doi.org/10.1101/2020.09.21.20198762","url":null,"abstract":"A standard assumption for causal inference from observational data is that one has measured a sufficiently rich set of covariates to ensure that within covariate strata, subjects are exchangeable across observed treatment values. Skepticism about the exchangeability assumption in observational studies is often warranted because it hinges on investigators' ability to accurately measure covariates capturing all potential sources of confounding. Realistically, confounding mechanisms can rarely if ever, be learned with certainty from measured covariates. One can therefore only ever hope that covariate measurements are at best proxies of true underlying confounding mechanisms operating in an observational study, thus invalidating causal claims made on basis of standard exchangeability conditions. Causal learning from proxies is a challenging inverse problem which has to date remained unresolved. In this paper, we introduce a formal potential outcome framework for proximal causal learning, which while explicitly acknowledging covariate measurements as imperfect proxies of confounding mechanisms, offers an opportunity to learn about causal effects in settings where exchangeability on the basis of measured covariates fails. Sufficient conditions for nonparametric identification are given, leading to the proximal g-formula and corresponding proximal g-computation algorithm for estimation. These may be viewed as generalizations of Robins' foundational g-formula and g-computation algorithm, which account explicitly for bias due to unmeasured confounding. Both point treatment and time-varying treatment settings are considered, and an application of proximal g-computation of causal effects is given for illustration.","PeriodicalId":186390,"journal":{"name":"arXiv: Methodology","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123629812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 116

Bayesian Causal Inference in Probit Graphical Models 概率图模型中的贝叶斯因果推理

arXiv: Methodology Pub Date : 2020-09-10 DOI: 10.1214/21-BA1260

F. Castelletti, G. Consonni

{"title":"Bayesian Causal Inference in Probit Graphical Models","authors":"F. Castelletti, G. Consonni","doi":"10.1214/21-BA1260","DOIUrl":"https://doi.org/10.1214/21-BA1260","url":null,"abstract":"We consider a binary response which is potentially affected by a set of continuous variables. Of special interest is the causal effect on the response due to an intervention on a specific variable. The latter can be meaningfully determined on the basis of observational data through suitable assumptions on the data generating mechanism. In particular we assume that the joint distribution obeys the conditional independencies (Markov properties) inherent in a Directed Acyclic Graph (DAG), and the DAG is given a causal interpretation through the notion of interventional distribution. We propose a DAG-probit model where the response is generated by discretization through a random threshold of a continuous latent variable and the latter, jointly with the remaining continuous variables, has a distribution belonging to a zero-mean Gaussian model whose covariance matrix is constrained to satisfy the Markov properties of the DAG. Our model leads to a natural definition of causal effect conditionally on a given DAG. Since the DAG which generates the observations is unknown, we present an efficient MCMC algorithm whose target is the posterior distribution on the space of DAGs, the Cholesky parameters of the concentration matrix, and the threshold linking the response to the latent. Our end result is a Bayesian Model Averaging estimate of the causal effect which incorporates parameter, as well as model, uncertainty. The methodology is assessed using simulation experiments and applied to a gene expression data set originating from breast cancer stem cells.","PeriodicalId":186390,"journal":{"name":"arXiv: Methodology","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121625492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Ensemble Riemannian Data Assimilation over the WassersteinSpace WassersteinSpace上的集合黎曼数据同化

arXiv: Methodology Pub Date : 2020-09-07 DOI: 10.5194/NPG-2021-11

S. Tamang, A. Ebtehaj, P. V. van Leeuwen, Dongmian Zou, Gilad Lerman

引用次数: 4

Evaluating Catchment Models as Multiple Working Hypotheses: on the Role of Error Metrics, Parameter Sampling, Model Structure, and Data Information Content 评价集水区模型作为多个工作假设:关于误差度量、参数抽样、模型结构和数据信息内容的作用

arXiv: Methodology Pub Date : 2020-09-01 DOI: 10.1002/essoar.10504066.1

S. Khatami, T. Peterson, M. Peel, A. Western

{"title":"Evaluating Catchment Models as Multiple Working Hypotheses: on the Role of Error Metrics, Parameter Sampling, Model Structure, and Data Information Content","authors":"S. Khatami, T. Peterson, M. Peel, A. Western","doi":"10.1002/essoar.10504066.1","DOIUrl":"https://doi.org/10.1002/essoar.10504066.1","url":null,"abstract":"To evaluate models as hypotheses, we developed the method of Flux Mapping to construct a hypothesis space based on dominant runoff generating mechanisms. Acceptable model runs, defined as total simulated flow with similar (and minimal) model error, are mapped to the hypothesis space given their simulated runoff components. In each modeling case, the hypothesis space is the result of an interplay of factors: model structure and parameterization, chosen error metric, and data information content. The aim of this study is to disentangle the role of each factor in model evaluation. We used two model structures (SACRAMENTO and SIMHYD), two parameter sampling approaches (Latin Hypercube Sampling of the parameter space and guided-search of the solution space), three widely used error metrics (Nash-Sutcliffe Efficiency - NSE, Kling-Gupta Efficiency skill score - KGEss, and Willmott refined Index of Agreement - WIA), and hydrological data from a large sample of Australian catchments. First, we characterized how the three error metrics behave under different error types and magnitudes independent of any modeling. We then conducted a series of controlled experiments to unpack the role of each factor in runoff generation hypotheses. We show that KGEss is a more reliable metric compared to NSE and WIA for model evaluation. We further demonstrate that only changing the error metric -- while other factors remain constant -- can change the model solution space and hence vary model performance, parameter sampling sufficiency, and or the flux map. We show how unreliable error metrics and insufficient parameter sampling impair model-based inferences, particularly runoff generation hypotheses.","PeriodicalId":186390,"journal":{"name":"arXiv: Methodology","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121393771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

A Cramér–von Mises Test of Uniformity on the Hypersphere 关于超球均匀性的cram<s:1> - von Mises检验

arXiv: Methodology Pub Date : 2020-08-25 DOI: 10.1007/978-3-030-69944-4_12

Eduardo Garc'ia-Portugu'es, Paula Navarro-Esteban, J. A. Cuesta-Albertos

引用次数: 3

A Maximin $Phi_{p}$-Efficient Design for Multivariate GLM 一个Maximin $Phi_{p}$-高效的多元GLM设计

arXiv: Methodology Pub Date : 2020-08-14 DOI: 10.5705/ss.202020.0278

Yiou Li, Lulu Kang, Xinwei Deng

引用次数: 1

A Bayesian Approach to Spherical Factor Analysis for Binary Data 二值数据球面因子分析的贝叶斯方法

arXiv: Methodology Pub Date : 2020-08-12 DOI: 10.2139/ssrn.3672055

Xingchen Yu, Abel Rodríguez

引用次数: 0

Hypothesis tests for structured rank correlation matrices 结构化秩相关矩阵的假设检验

arXiv: Methodology Pub Date : 2020-07-19 DOI: 10.1080/01621459.2022.2096619

S. Perreault, J. Nešlehová, T. Duchesne

{"title":"Hypothesis tests for structured rank correlation matrices","authors":"S. Perreault, J. Nešlehová, T. Duchesne","doi":"10.1080/01621459.2022.2096619","DOIUrl":"https://doi.org/10.1080/01621459.2022.2096619","url":null,"abstract":"Joint modeling of a large number of variables often requires dimension reduction strategies that lead to structural assumptions of the underlying correlation matrix, such as equal pair-wise correlations within subsets of variables. The underlying correlation matrix is thus of interest for both model specification and model validation. In this paper, we develop tests of the hypothesis that the entries of the Kendall rank correlation matrix are linear combinations of a smaller number of parameters. The asymptotic behaviour of the proposed test statistics is investigated both when the dimension is fixed and when it grows with the sample size. We pay special attention to the restricted hypothesis of partial exchangeability, which contains full exchangeability as a special case. We show that under partial exchangeability, the test statistics and their large-sample distributions simplify, which leads to computational advantages and better performance of the tests. We propose various scalable numerical strategies for implementation of the proposed procedures, investigate their finite sample behaviour through simulations, and demonstrate their use on a real dataset of mean sea levels at various geographical locations.","PeriodicalId":186390,"journal":{"name":"arXiv: Methodology","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115813299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4