{"title":"Transformations for Semi-Continuous Data","authors":"Galit Shmueli, Wolfgang Jank, Valerie Hyde","doi":"10.2139/ssrn.956938","DOIUrl":null,"url":null,"abstract":"Semi-continuous data arise in many applications where naturally-continuous data become contaminated by the data generating mechanism. The resulting data contain several values that are ''too frequent'', and in that sense are a hybrid between discrete and continuous data. The main problem is that standard statistical methods, which are geared towards continuous or discrete data, cannot be applied adequately to semi-continuous data. We propose a new set of two transformations for semi-continuous data that ''iron out'' the too-frequent values thereby transforming the data to completely continuous. We show that the transformed data maintain the properties of the original data, but are suitable for standard analysis. The transformations and their performance are illustrated using simulated data and real auction data from the online auction site eBay.","PeriodicalId":145189,"journal":{"name":"Robert H. Smith School of Business Research Paper Series","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robert H. Smith School of Business Research Paper Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.956938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Semi-continuous data arise in many applications where naturally-continuous data become contaminated by the data generating mechanism. The resulting data contain several values that are ''too frequent'', and in that sense are a hybrid between discrete and continuous data. The main problem is that standard statistical methods, which are geared towards continuous or discrete data, cannot be applied adequately to semi-continuous data. We propose a new set of two transformations for semi-continuous data that ''iron out'' the too-frequent values thereby transforming the data to completely continuous. We show that the transformed data maintain the properties of the original data, but are suitable for standard analysis. The transformations and their performance are illustrated using simulated data and real auction data from the online auction site eBay.