{"title":"关于差分私有高斯假设检验","authors":"Kwassi H. Degue, J. L. Ny","doi":"10.1109/ALLERTON.2018.8635911","DOIUrl":null,"url":null,"abstract":"Data analysis for emerging systems such as syndromic surveillance or intelligent transportation systems requires testing statistical models based on privacy-sensitive data collected from individuals, e.g., medical records or location traces. In this paper, we design a differentially private hypothesis test based on the generalized likelihood ratio method to decide if data modeled as a sequence of independent and identically distributed Gaussian random variables has a given mean value. Analytic formulas for decision thresholds and for the test’s receiver operating characteristic curve show explicitly the performance impact of the privacy constraint. We then apply the algorithm to the design of a differentially private anomaly (or fault) detector and study its performance for the analysis of a syndromic surveillance dataset from the Centers for Disease Control and Prevention in the United States.","PeriodicalId":299280,"journal":{"name":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"On Differentially Private Gaussian Hypothesis Testing\",\"authors\":\"Kwassi H. Degue, J. L. Ny\",\"doi\":\"10.1109/ALLERTON.2018.8635911\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data analysis for emerging systems such as syndromic surveillance or intelligent transportation systems requires testing statistical models based on privacy-sensitive data collected from individuals, e.g., medical records or location traces. In this paper, we design a differentially private hypothesis test based on the generalized likelihood ratio method to decide if data modeled as a sequence of independent and identically distributed Gaussian random variables has a given mean value. Analytic formulas for decision thresholds and for the test’s receiver operating characteristic curve show explicitly the performance impact of the privacy constraint. We then apply the algorithm to the design of a differentially private anomaly (or fault) detector and study its performance for the analysis of a syndromic surveillance dataset from the Centers for Disease Control and Prevention in the United States.\",\"PeriodicalId\":299280,\"journal\":{\"name\":\"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ALLERTON.2018.8635911\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ALLERTON.2018.8635911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On Differentially Private Gaussian Hypothesis Testing
Data analysis for emerging systems such as syndromic surveillance or intelligent transportation systems requires testing statistical models based on privacy-sensitive data collected from individuals, e.g., medical records or location traces. In this paper, we design a differentially private hypothesis test based on the generalized likelihood ratio method to decide if data modeled as a sequence of independent and identically distributed Gaussian random variables has a given mean value. Analytic formulas for decision thresholds and for the test’s receiver operating characteristic curve show explicitly the performance impact of the privacy constraint. We then apply the algorithm to the design of a differentially private anomaly (or fault) detector and study its performance for the analysis of a syndromic surveillance dataset from the Centers for Disease Control and Prevention in the United States.