{"title":"Crime Rates Won’t Work: Analyzing Crime for Small Areas Taking into Account More Than Population","authors":"J. Ottensmann","doi":"10.2139/ssrn.3440309","DOIUrl":"https://doi.org/10.2139/ssrn.3440309","url":null,"abstract":"Crime rates — numbers of crimes divided by the population living in an area — have problems when used for small areas. Some small areas include substantial nonresidential areas that contribute to the risk of crime, can be the location of crimes, but that have no populations. Negative binomial models to predict counts of the numbers of crimes in small areas are used to incorporate multiple measures of the risk or exposure to crime that cannot be accomplished using crime rates. Population, several measures of employment, and numbers of students in small areas from a transportation planning dataset all contribute to exposure and the prediction of crime in Indianapolis. Because these data are specific to Indianapolis, models using generally available data from the Census Transportation Planning Products and only data from the census of population are evaluated as alternatives. As the initial exposure data were available for the entire metropolitan area, alternative crime rates using these data are estimated and compared with the traditional population-based crime rates for 14 municipalities in the metropolitan area.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122802597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Case for Data: Road System of Zimbabwe","authors":"Peter Charambira","doi":"10.2139/ssrn.3415749","DOIUrl":"https://doi.org/10.2139/ssrn.3415749","url":null,"abstract":"Zimbabwe has crumbling infrastructure which needs to be repaired or replaced. Road systems in particular, there is a need to know the current state. To know the current state requires collecting data on the system. The data collected should be all data pertaining to the system and the data should be publicly available in a machine-readable form. Collection of all data pertaining to the system allows for the current state of the system to be known. The publicly available data on the road system is not sufficient enough to know the current state of the road system. The data available is not of the current road system. Up to date data can allow for policy makers to know the current state of the system and allocate resources accordingly.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132745296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A New Chinese Financial Sentiment Dictionary for Textual Analysis in Accounting and Finance","authors":"Shibo Bian, Dekui Jia, Feng Li, Zhipeng Yan","doi":"10.2139/ssrn.3446388","DOIUrl":"https://doi.org/10.2139/ssrn.3446388","url":null,"abstract":"Using a multi-stage filtering procedure based on both algorithms and human judgement, we develop a Chinese Financial Sentiment Dictionary (CFSD) with the hope to help advance textual analysis of Chinese documents in accounting and finance. There are 1,489 negative words and 1,108 positive words in the CFSD. We translate all 2,597 words into English. We also briefly discuss two unique settings in which researchers can investigate some important and interesting research questions that are rarely studied due to lack of data.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122499322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Clustering Method for Analysis of Data Subject to Pre-Defined Classifications","authors":"Y. Liu","doi":"10.2139/ssrn.3403864","DOIUrl":"https://doi.org/10.2139/ssrn.3403864","url":null,"abstract":"In this paper, we present a methodology to perform clustering and grouping analysis for dataset with classification constraints or definitions. The discussion is demonstrated with a full example based on real data. We start with the observed difference in the CIA and UN subregional definition of European countries, and consider what the impact is from a subregional house price ratio perspective. As documented in this report, we find that the presented approach useful for clustering analysis of the pre-identified subgroups to address subgroup based clustering problems.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127027368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bram Lancee, G. Birkelund, M. Coenders, Valentina Di Stasio, Marina Fernandez Reino, Anthony F. Heath, R. Koopmans, Edvard N. Larsen, Javier G. Polavieja, María Ramos, Lex Thijssen, Susanne Veit, Ruta Yemane, Dieuwke Zwier
{"title":"The GEMM Study: A Cross-National Harmonized Field Experiment on Labour Market Discrimination: Technical Report","authors":"Bram Lancee, G. Birkelund, M. Coenders, Valentina Di Stasio, Marina Fernandez Reino, Anthony F. Heath, R. Koopmans, Edvard N. Larsen, Javier G. Polavieja, María Ramos, Lex Thijssen, Susanne Veit, Ruta Yemane, Dieuwke Zwier","doi":"10.2139/ssrn.3398191","DOIUrl":"https://doi.org/10.2139/ssrn.3398191","url":null,"abstract":"This paper describes and documents the data collection of the GEMM project: A cross-national harmonized correspondence study on ethnic discrimination in hiring. Data is collected in Germany, Norway, The Netherlands, Britain, and Spain. The paper describes the research design, the experimental manipulations, the procedure of data collection and the ethical considerations. Also discussed are the occupations included, and the search procedure for vacancies. In sum, the technical report gives a detailed account of the collection of the GEMM data.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130062280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Unpacking the Valuation of Data in the Data-Driven Economy","authors":"Dan Ciuriak","doi":"10.2139/ssrn.3379133","DOIUrl":"https://doi.org/10.2139/ssrn.3379133","url":null,"abstract":"Data is often said to be the most valuable commodity of our age. It is a curiosity, therefore, that it remains largely invisible on the balance sheets of companies and largely unmeasured in our national economic accounts. This note seeks to unpack what we mean when we refer to data as the “new oil�? or the essential capital of the data-driven economy, how it differs from information in general, how it is transformed into value, and what might be the approximate scale of the value of data in a modern data-driven economy.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"134 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131850955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
D. Barnum, Jason Coupet, J. Gleason, Abagail McWilliams, A. Parhankangas
{"title":"Bias in Transport Efficiency Estimates Caused by Misspecified DEA Models","authors":"D. Barnum, Jason Coupet, J. Gleason, Abagail McWilliams, A. Parhankangas","doi":"10.2139/ssrn.3415133","DOIUrl":"https://doi.org/10.2139/ssrn.3415133","url":null,"abstract":"This paper examines transport modes that have employed Data Envelopment Analysis (DEA). It ascertains whether key DEA specifications necessary for estimating valid efficiency scores are present, the impact on the scores when they are not, and methods for correcting the errors. One critical specification is that for the sample data being used, each output must have been produced by various proportions of the inputs, with the substitutions between each pair of inputs shown on an isoquant. And, for the sample data, each input must have produced various proportions of the outputs, with the transformations between each pair of outputs shown on a production frontier. It is essential that these specifications are met by the data sample being used. DEA estimates input weights based solely on the Marginal Rate of Substitutions (MRS) between inputs on the sample data’s efficient frontier (isoquant). And DEA estimates output weights based solely on the Marginal Rate of Transformations (MRT) between outputs on the sample data’s production frontier. If there are no substitutions or transformations for the sample at hand, then there are no valid MRSs and MRTs, so DEA will utilize false weights and thereby produce false efficiency scores. In this paper, we analyze inputs and outputs that have often been used in DEA articles involving airline, urban transit, and freight rail data. Our data samples show that input substitution and output transformation are often not present. And, for our sample data, these misspecifications result in badly biased estimates of both technical efficiencies and second-stage regression parameters. We suggest methods for identifying and correcting for these specification errors, so future transportation DEA articles can avoid these problems.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134319283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Our Data-Driven Future: Promise, Perils, and Prognoses","authors":"Dana Turjeman, F. Feinberg","doi":"10.2139/ssrn.3440726","DOIUrl":"https://doi.org/10.2139/ssrn.3440726","url":null,"abstract":"In this chapter, we discuss both the great advantages and potential harms ushered in by this era of data collection, as well as ways to mitigate the harms while maintaining the benefits. Specifically, we propose and discuss classes of potential solutions: methods for collecting less data overall, transparency of code and models, federated learning, and identity management tools, among others. Some of these solutions can be implemented now, others require a longer horizon, but all can begin through the advocacy of marketing research. We also discuss possible ways to improve on the benefits of data collection – by developing methods to assist individuals pursue their long-term goals while advocating for privacy in such pursuits.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130436544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Introduction of the Method of Harmonic Weights and Integrated Economic and Statistical Calculations in the Analysis of Socio-Economic Security","authors":"D. Shvaiba","doi":"10.33619/2414-2948/40/31","DOIUrl":"https://doi.org/10.33619/2414-2948/40/31","url":null,"abstract":"In the implementation of comparative analysis of the application of methods of harmonic weights and integrated economic and statistical calculations in the analysis of socio–economic security characteristics, the priority is given to the method of integrated economic and statistical calculations. When predicting the characteristics of socio–economic security by the method of integrated economic and statistical calculations, their absolute levels or the dynamics of their growth are used. The use of the absolute values of socio–economic security characteristics in forecasting seems more appropriate, as it is possible to solve a number of problems of the method — to identify the levels of factors and factors K_ xi in the lead period.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125927372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"R&D, Embodied Technological Change and Employment: Evidence from Italian Microdata","authors":"Laura Barbieri, M. Piva, M. Vivarelli","doi":"10.1093/ICC/DTY001","DOIUrl":"https://doi.org/10.1093/ICC/DTY001","url":null,"abstract":"This paper explores the employment impact of innovation activity, taking into account both RD however, this positive effect is barely significant when the sole in-house R&D expenditures are considered and fades away when ETC is included as a proxy for innovation activities. Moreover, the positive employment impacts of innovation activities and R&D expenditures are totally due to firms operating in high-tech industries and large companies, while no job-creation due to technical change is detectable in traditional sectors and SMEs.","PeriodicalId":433005,"journal":{"name":"Econometrics: Data Collection & Data Estimation Methodology eJournal","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131521754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}