{"title":"非随机分布数据的隐私保护估计","authors":"Xirui Liu , Ke Yang , Liwen Xu , Mixia Wu","doi":"10.1016/j.jspi.2025.106326","DOIUrl":null,"url":null,"abstract":"<div><div>This paper investigates data distributed across various machines in a non-random manner. We introduce two innovative distributed estimators, tailored to accommodate varying levels of communication cost and data privacy protection. The proposed estimators adeptly addresses the challenges associated with the non-random distribution of data. Both methods are communication-efficient, necessitating only two rounds of communication between the Master and worker machines, and safeguard data privacy by solely sharing summary statistics. Under mild conditions, we establish the <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span>-error bound and the asymptotic distribution of the estimators. Theoretical analysis confirms that the proposed estimators are statistically efficient. Additionally, numerical simulations and two real-world applications demonstrate the good performance of the proposed methods.</div></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"241 ","pages":"Article 106326"},"PeriodicalIF":0.8000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Privacy-preserving estimation for non-randomly distributed data\",\"authors\":\"Xirui Liu , Ke Yang , Liwen Xu , Mixia Wu\",\"doi\":\"10.1016/j.jspi.2025.106326\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper investigates data distributed across various machines in a non-random manner. We introduce two innovative distributed estimators, tailored to accommodate varying levels of communication cost and data privacy protection. The proposed estimators adeptly addresses the challenges associated with the non-random distribution of data. Both methods are communication-efficient, necessitating only two rounds of communication between the Master and worker machines, and safeguard data privacy by solely sharing summary statistics. Under mild conditions, we establish the <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span>-error bound and the asymptotic distribution of the estimators. Theoretical analysis confirms that the proposed estimators are statistically efficient. Additionally, numerical simulations and two real-world applications demonstrate the good performance of the proposed methods.</div></div>\",\"PeriodicalId\":50039,\"journal\":{\"name\":\"Journal of Statistical Planning and Inference\",\"volume\":\"241 \",\"pages\":\"Article 106326\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2025-07-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Statistical Planning and Inference\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378375825000643\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistical Planning and Inference","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378375825000643","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
Privacy-preserving estimation for non-randomly distributed data
This paper investigates data distributed across various machines in a non-random manner. We introduce two innovative distributed estimators, tailored to accommodate varying levels of communication cost and data privacy protection. The proposed estimators adeptly addresses the challenges associated with the non-random distribution of data. Both methods are communication-efficient, necessitating only two rounds of communication between the Master and worker machines, and safeguard data privacy by solely sharing summary statistics. Under mild conditions, we establish the -error bound and the asymptotic distribution of the estimators. Theoretical analysis confirms that the proposed estimators are statistically efficient. Additionally, numerical simulations and two real-world applications demonstrate the good performance of the proposed methods.
期刊介绍:
The Journal of Statistical Planning and Inference offers itself as a multifaceted and all-inclusive bridge between classical aspects of statistics and probability, and the emerging interdisciplinary aspects that have a potential of revolutionizing the subject. While we maintain our traditional strength in statistical inference, design, classical probability, and large sample methods, we also have a far more inclusive and broadened scope to keep up with the new problems that confront us as statisticians, mathematicians, and scientists.
We publish high quality articles in all branches of statistics, probability, discrete mathematics, machine learning, and bioinformatics. We also especially welcome well written and up to date review articles on fundamental themes of statistics, probability, machine learning, and general biostatistics. Thoughtful letters to the editors, interesting problems in need of a solution, and short notes carrying an element of elegance or beauty are equally welcome.