{"title":"Efficient distributed estimation for expectile regression in increasing dimensions","authors":"Xiaoyan Li, Zhimin Zhang","doi":"10.1016/j.apm.2025.115974","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, we introduce an efficient surrogate loss method for large-scale expectile regression in non-randomly distributed scenarios. Specifically, a Poisson subsampling-based distributed asymmetric least squares estimator is proposed. Our theoretical analysis establishes the consistency and asymptotic normality as the dimensionality tends to infinity, demonstrating that the proposed estimator achieves statistical efficiency comparable to that of the global estimator. A practical three-step algorithm is presented, offering an efficient implementation in practical applications. The proposed estimator exhibits two notable advantages: (i) it is communication-efficient, utilising all the data but only requiring the transmission of a small subsample and the local gradient from each local site; and (ii) it can effectively adapt to unevenly distributed data and non-randomly stored data. Within the Newton-Raphson algorithm, the initial value and the Hessian matrix are computed with enhanced robustness using the Poisson subsampling-derived subsample than using one local dataset or uniform subsampling-derived subsample. Both simulation studies and empirical results confirm that the proposed estimator enhanced estimation efficiency relative to existing methods.</div></div>","PeriodicalId":50980,"journal":{"name":"Applied Mathematical Modelling","volume":"142 ","pages":"Article 115974"},"PeriodicalIF":4.4000,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Mathematical Modelling","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0307904X25000496","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we introduce an efficient surrogate loss method for large-scale expectile regression in non-randomly distributed scenarios. Specifically, a Poisson subsampling-based distributed asymmetric least squares estimator is proposed. Our theoretical analysis establishes the consistency and asymptotic normality as the dimensionality tends to infinity, demonstrating that the proposed estimator achieves statistical efficiency comparable to that of the global estimator. A practical three-step algorithm is presented, offering an efficient implementation in practical applications. The proposed estimator exhibits two notable advantages: (i) it is communication-efficient, utilising all the data but only requiring the transmission of a small subsample and the local gradient from each local site; and (ii) it can effectively adapt to unevenly distributed data and non-randomly stored data. Within the Newton-Raphson algorithm, the initial value and the Hessian matrix are computed with enhanced robustness using the Poisson subsampling-derived subsample than using one local dataset or uniform subsampling-derived subsample. Both simulation studies and empirical results confirm that the proposed estimator enhanced estimation efficiency relative to existing methods.
期刊介绍:
Applied Mathematical Modelling focuses on research related to the mathematical modelling of engineering and environmental processes, manufacturing, and industrial systems. A significant emerging area of research activity involves multiphysics processes, and contributions in this area are particularly encouraged.
This influential publication covers a wide spectrum of subjects including heat transfer, fluid mechanics, CFD, and transport phenomena; solid mechanics and mechanics of metals; electromagnets and MHD; reliability modelling and system optimization; finite volume, finite element, and boundary element procedures; modelling of inventory, industrial, manufacturing and logistics systems for viable decision making; civil engineering systems and structures; mineral and energy resources; relevant software engineering issues associated with CAD and CAE; and materials and metallurgical engineering.
Applied Mathematical Modelling is primarily interested in papers developing increased insights into real-world problems through novel mathematical modelling, novel applications or a combination of these. Papers employing existing numerical techniques must demonstrate sufficient novelty in the solution of practical problems. Papers on fuzzy logic in decision-making or purely financial mathematics are normally not considered. Research on fractional differential equations, bifurcation, and numerical methods needs to include practical examples. Population dynamics must solve realistic scenarios. Papers in the area of logistics and business modelling should demonstrate meaningful managerial insight. Submissions with no real-world application will not be considered.