{"title":"困扰商业和金融研究人员的数据质量问题:文献综述和综合分析","authors":"G. Liu","doi":"10.1080/08963568.2020.1847555","DOIUrl":null,"url":null,"abstract":"Abstract The data quality of commercial business and financial databases greatly affects research quality and reliability. The presence of data quality problems can not only distort research results, destroy a research effort but also seriously damage management decisions based upon such research. Although library literature rarely discusses data quality problems, business literature reports a wide range of data quality issues, many of which have been systematically tested with statistical methods. This article reviews a collection of the business literature that provides a critical analysis on the data quality of the most frequently used business and finance databases including the Center for Research in Security Prices (CRSP), Compustat, S&P Capital IQ, I/B/E/S, Datastream, Worldscope, Securities Data Company (SDC) Platinum, and Bureau van Dijk (BvD) Orbis and identifies 11 categories of common data quality problems, including missing values, data errors, discrepancies, biases, inconsistencies, static header data, standardization, changes in historic data, lack of transparency, reporting time issues and misuse of data. Finally, the article provides some practical advice for librarians to facilitate their scholarly communications with researchers on data quality problems.","PeriodicalId":44062,"journal":{"name":"Journal of Business & Finance Librarianship","volume":"25 1","pages":"315 - 371"},"PeriodicalIF":0.8000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/08963568.2020.1847555","citationCount":"9","resultStr":"{\"title\":\"Data quality problems troubling business and financial researchers: A literature review and synthetic analysis\",\"authors\":\"G. Liu\",\"doi\":\"10.1080/08963568.2020.1847555\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract The data quality of commercial business and financial databases greatly affects research quality and reliability. The presence of data quality problems can not only distort research results, destroy a research effort but also seriously damage management decisions based upon such research. Although library literature rarely discusses data quality problems, business literature reports a wide range of data quality issues, many of which have been systematically tested with statistical methods. This article reviews a collection of the business literature that provides a critical analysis on the data quality of the most frequently used business and finance databases including the Center for Research in Security Prices (CRSP), Compustat, S&P Capital IQ, I/B/E/S, Datastream, Worldscope, Securities Data Company (SDC) Platinum, and Bureau van Dijk (BvD) Orbis and identifies 11 categories of common data quality problems, including missing values, data errors, discrepancies, biases, inconsistencies, static header data, standardization, changes in historic data, lack of transparency, reporting time issues and misuse of data. Finally, the article provides some practical advice for librarians to facilitate their scholarly communications with researchers on data quality problems.\",\"PeriodicalId\":44062,\"journal\":{\"name\":\"Journal of Business & Finance Librarianship\",\"volume\":\"25 1\",\"pages\":\"315 - 371\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2020-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/08963568.2020.1847555\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Business & Finance Librarianship\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/08963568.2020.1847555\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Business & Finance Librarianship","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/08963568.2020.1847555","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 9
摘要
摘要商业商业和金融数据库的数据质量在很大程度上影响着研究的质量和可靠性。数据质量问题的存在不仅会扭曲研究结果,破坏研究成果,还会严重损害基于此类研究的管理决策。尽管图书馆文献很少讨论数据质量问题,但商业文献报道了广泛的数据质量问题。其中许多问题已经用统计方法进行了系统测试。本文回顾了一系列商业文献,这些文献对最常用的商业和金融数据库的数据质量进行了批判性分析,包括证券价格研究中心(CRSP)、Compustat、标普资本IQ、I/B/E/S、Datastream、Worldscope、证券数据公司(SDC)Platinum,和Bureau van Dijk(BvD)Orbis,确定了11类常见的数据质量问题,包括缺失值、数据错误、差异、偏差、不一致、静态标题数据、标准化、历史数据变化、缺乏透明度、报告时间问题和数据滥用。最后,本文为图书馆员就数据质量问题与研究人员进行学术交流提供了一些实用的建议。
Data quality problems troubling business and financial researchers: A literature review and synthetic analysis
Abstract The data quality of commercial business and financial databases greatly affects research quality and reliability. The presence of data quality problems can not only distort research results, destroy a research effort but also seriously damage management decisions based upon such research. Although library literature rarely discusses data quality problems, business literature reports a wide range of data quality issues, many of which have been systematically tested with statistical methods. This article reviews a collection of the business literature that provides a critical analysis on the data quality of the most frequently used business and finance databases including the Center for Research in Security Prices (CRSP), Compustat, S&P Capital IQ, I/B/E/S, Datastream, Worldscope, Securities Data Company (SDC) Platinum, and Bureau van Dijk (BvD) Orbis and identifies 11 categories of common data quality problems, including missing values, data errors, discrepancies, biases, inconsistencies, static header data, standardization, changes in historic data, lack of transparency, reporting time issues and misuse of data. Finally, the article provides some practical advice for librarians to facilitate their scholarly communications with researchers on data quality problems.
期刊介绍:
The Journal of Business & Finance Librarianship is an innovative quarterly journal that provides you with useful articles about the creation, organization, dissemination, retrieval, and use of business information. This refereed journal covers the business information needs of special libraries, academic libraries, and public libraries, as well as information services and centers outside of the traditional library setting. You"ll find that the journal is international in scope, reflecting the multinational and international scope of the business community today. The immediate focus of the journal is practice-oriented articles, but it also provides an outlet for new empirical studies on business librarianship and business information.