{"title":"A Data-Driven Method for Reconstructing a Distribution from a Truncated Sample with an Application to Inferring Car-Sharing Demand","authors":"Evan Fields, C. Osorio, Tianli Zhou","doi":"10.1287/TRSC.2020.1028","DOIUrl":null,"url":null,"abstract":"This paper proposes a method to recover an unknown probability distribution given a censored or truncated sample from that distribution. The proposed method is a novel and conceptually simple detruncation method based on sampling the observed data according to weights learned by solving a simulation-based optimization problem; this method is especially appropriate in cases where little analytic information is available but the truncation process can be simulated. The proposed method is compared with the ubiquitous maximum likelihood estimation (MLE) method in a variety of synthetic validation experiments, where it is found that the proposed method performs slightly worse than perfectly specified MLE and competitively with slightly misspecified MLE. The practical application of this method is then demonstrated via a pair of case studies in which the proposed detruncation method is used alongside a car-sharing service simulator to estimate demand for round-trip car-sharing services in the Boston and New York metropolitan areas.","PeriodicalId":23247,"journal":{"name":"Transp. Sci.","volume":"34 1","pages":"616-636"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transp. Sci.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/TRSC.2020.1028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
This paper proposes a method to recover an unknown probability distribution given a censored or truncated sample from that distribution. The proposed method is a novel and conceptually simple detruncation method based on sampling the observed data according to weights learned by solving a simulation-based optimization problem; this method is especially appropriate in cases where little analytic information is available but the truncation process can be simulated. The proposed method is compared with the ubiquitous maximum likelihood estimation (MLE) method in a variety of synthetic validation experiments, where it is found that the proposed method performs slightly worse than perfectly specified MLE and competitively with slightly misspecified MLE. The practical application of this method is then demonstrated via a pair of case studies in which the proposed detruncation method is used alongside a car-sharing service simulator to estimate demand for round-trip car-sharing services in the Boston and New York metropolitan areas.
本文提出了一种从未知概率分布中恢复截短样本的方法。该方法是一种新颖且概念简单的去截断方法,该方法基于求解基于仿真的优化问题所获得的权重对观测数据进行采样;这种方法特别适用于分析信息很少但可以模拟截断过程的情况。在各种综合验证实验中,将所提方法与泛在极大似然估计(ubiquitous maximum likelihood estimation, MLE)方法进行了比较,发现所提方法的性能略差于完全指定极大似然估计,并与轻微错指定极大似然估计竞争。然后通过一对案例研究证明了该方法的实际应用,其中所提出的去截断方法与汽车共享服务模拟器一起使用,以估计波士顿和纽约大都市地区往返汽车共享服务的需求。