{"title":"重新列入名单的物种遵循本福德定律吗?","authors":"B. Kriström","doi":"10.2139/ssrn.3266483","DOIUrl":null,"url":null,"abstract":"Benford's law describes how different numbers are distributed as first figures in statistics. The law states, for example, that the number 1 should be the first digit in 30.1% of cases, the figure 2 in 17.6% of the cases and the figure 9 in 4.6% of the cases. If Benford's law is violated, it may be an indication that the numbers may be manipulated, or more generally of low quality. Possibly the most high-profile case is the investigation of the Greek macroeconomic accounts. The Stability and Growth Pact of the EU imposes certain constraints on member countries budget deficits, and there were concerns about the Greek economy in the 2000s. According to some observers, it was \"well-known\" among EU-officials that the Greece numbers were \"cooked\". It is then of some interest to note that the Greek macroeconomic data were those that showed the most significant deviation from Benford's law, compared to all other EU-countries, according to the analysis of Rauch et al (2011). Analysis of the law has used different data sets: Sandron (2003) studies the population of 198 countries (good agreement); Ley (1996) finds that one-day returns for some American indexes follow the law; Gonzalez-Garcia and Pastor (2010) shows that macroeconomic data generally follows Benford's law. Nigrini & Mittelmaier (1997) produces a test for accounting fraud analysis. According to Stigler's (1980) law, the name of the discoverer is often different from the name of the law; it is seemingly widely acknowledged that Newcomb (1881) already made the discovery. Fellman (2014) provides a comprehensive review of the literature. Intuitively, the law does not work well for certain types of data, such as length of humans, where the majority of the numbers start with 1 and a few with 2. We must also have a large data base, so that we have \"enough\" variation in the data. It is not a very intuitive law, although there are some attempts to show that the law follows from certain mathematical arguments. Our approach here is just to apply the law to a certain data-set and explore whether or not it holds true. The law does not prove that the quality of data is bad, but it is sufficiently well tested in so many contexts that a deviation from the law merits a closer investigation of the data generating process.","PeriodicalId":107127,"journal":{"name":"Applied Ecology eJournal","volume":"398 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Do Redlisted Species Follow Benford’s Law?\",\"authors\":\"B. Kriström\",\"doi\":\"10.2139/ssrn.3266483\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Benford's law describes how different numbers are distributed as first figures in statistics. The law states, for example, that the number 1 should be the first digit in 30.1% of cases, the figure 2 in 17.6% of the cases and the figure 9 in 4.6% of the cases. If Benford's law is violated, it may be an indication that the numbers may be manipulated, or more generally of low quality. Possibly the most high-profile case is the investigation of the Greek macroeconomic accounts. The Stability and Growth Pact of the EU imposes certain constraints on member countries budget deficits, and there were concerns about the Greek economy in the 2000s. According to some observers, it was \\\"well-known\\\" among EU-officials that the Greece numbers were \\\"cooked\\\". It is then of some interest to note that the Greek macroeconomic data were those that showed the most significant deviation from Benford's law, compared to all other EU-countries, according to the analysis of Rauch et al (2011). Analysis of the law has used different data sets: Sandron (2003) studies the population of 198 countries (good agreement); Ley (1996) finds that one-day returns for some American indexes follow the law; Gonzalez-Garcia and Pastor (2010) shows that macroeconomic data generally follows Benford's law. Nigrini & Mittelmaier (1997) produces a test for accounting fraud analysis. According to Stigler's (1980) law, the name of the discoverer is often different from the name of the law; it is seemingly widely acknowledged that Newcomb (1881) already made the discovery. Fellman (2014) provides a comprehensive review of the literature. Intuitively, the law does not work well for certain types of data, such as length of humans, where the majority of the numbers start with 1 and a few with 2. We must also have a large data base, so that we have \\\"enough\\\" variation in the data. It is not a very intuitive law, although there are some attempts to show that the law follows from certain mathematical arguments. Our approach here is just to apply the law to a certain data-set and explore whether or not it holds true. The law does not prove that the quality of data is bad, but it is sufficiently well tested in so many contexts that a deviation from the law merits a closer investigation of the data generating process.\",\"PeriodicalId\":107127,\"journal\":{\"name\":\"Applied Ecology eJournal\",\"volume\":\"398 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Ecology eJournal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3266483\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Ecology eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3266483","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
本福德定律描述了不同的数字如何作为统计学中的第一数字分布。例如,法律规定,30.1%的案件中数字1应该是第一个数字,17.6%的案件中数字2应该是第一个数字,4.6%的案件中数字9应该是第一个数字。如果违反了本福德定律,这可能表明数字可能被操纵了,或者更普遍地说,质量很低。最引人注目的案例可能是对希腊宏观经济账户的调查。欧盟的《稳定与增长公约》(Stability and Growth Pact)对成员国的预算赤字施加了一定的限制,而且在本世纪头十年,人们对希腊的经济感到担忧。根据一些观察人士的说法,欧盟官员“众所周知”希腊的数字是“伪造的”。值得注意的是,根据Rauch等人(2011)的分析,与所有其他欧盟国家相比,希腊的宏观经济数据显示出与本福德定律最显著的偏差。对法律的分析使用了不同的数据集:Sandron(2003)研究了198个国家的人口(一致性很好);Ley(1996)发现一些美国指数的单日收益遵循规律;Gonzalez-Garcia和Pastor(2010)表明宏观经济数据一般遵循本福德定律。Nigrini & Mittelmaier(1997)对会计舞弊分析进行了检验。根据Stigler(1980)定律,发现者的名字通常与定律的名字不同;人们似乎普遍认为纽科姆(1881)已经发现了这一现象。Fellman(2014)提供了一个全面的文献综述。直观地说,该定律不适用于某些类型的数据,比如人类的长度,其中大多数数字以1开头,少数以2开头。我们还必须有一个大的数据库,这样我们才能在数据中有“足够”的变化。这不是一个非常直观的定律,尽管有人试图证明这个定律遵循某些数学论证。我们在这里的方法只是将这个定律应用到特定的数据集,并探索它是否成立。法律并没有证明数据的质量不好,但它在很多情况下都得到了充分的检验,因此对法律的偏离值得对数据生成过程进行更仔细的调查。
Benford's law describes how different numbers are distributed as first figures in statistics. The law states, for example, that the number 1 should be the first digit in 30.1% of cases, the figure 2 in 17.6% of the cases and the figure 9 in 4.6% of the cases. If Benford's law is violated, it may be an indication that the numbers may be manipulated, or more generally of low quality. Possibly the most high-profile case is the investigation of the Greek macroeconomic accounts. The Stability and Growth Pact of the EU imposes certain constraints on member countries budget deficits, and there were concerns about the Greek economy in the 2000s. According to some observers, it was "well-known" among EU-officials that the Greece numbers were "cooked". It is then of some interest to note that the Greek macroeconomic data were those that showed the most significant deviation from Benford's law, compared to all other EU-countries, according to the analysis of Rauch et al (2011). Analysis of the law has used different data sets: Sandron (2003) studies the population of 198 countries (good agreement); Ley (1996) finds that one-day returns for some American indexes follow the law; Gonzalez-Garcia and Pastor (2010) shows that macroeconomic data generally follows Benford's law. Nigrini & Mittelmaier (1997) produces a test for accounting fraud analysis. According to Stigler's (1980) law, the name of the discoverer is often different from the name of the law; it is seemingly widely acknowledged that Newcomb (1881) already made the discovery. Fellman (2014) provides a comprehensive review of the literature. Intuitively, the law does not work well for certain types of data, such as length of humans, where the majority of the numbers start with 1 and a few with 2. We must also have a large data base, so that we have "enough" variation in the data. It is not a very intuitive law, although there are some attempts to show that the law follows from certain mathematical arguments. Our approach here is just to apply the law to a certain data-set and explore whether or not it holds true. The law does not prove that the quality of data is bad, but it is sufficiently well tested in so many contexts that a deviation from the law merits a closer investigation of the data generating process.