移动网络大数据对发展的行为洞察:对政策制定者的最新启示

Sriganesh Lokanathan, Roshanthi Lucas Gunaratne
{"title":"移动网络大数据对发展的行为洞察:对政策制定者的最新启示","authors":"Sriganesh Lokanathan, Roshanthi Lucas Gunaratne","doi":"10.2139/ssrn.2522814","DOIUrl":null,"url":null,"abstract":"The discipline of Information and Communication Technologies for Development (ICT4D) gained traction against the exponential growth in mobile phone connectivity. There has been a multitude of projects, services, applications and even policies that aim to leverage the mobile phone to contribute to the broader development of society. This has gone hand in hand with much academic interest in understanding the effects of mobile phone connectivity on development. However it is only of late that attention is being paid to posing development related questions to the basic data artifacts that are left behind by society when consuming mobile phone services. These artifacts come under the class of Transaction Generated Data (TGD) having been recorded by mobile phone operators when certain events (for e.g. when one makes a call) occur for the purposes of billing and network optimization. Given the volumes of TGD that is produced it also falls under the category of Big Data. Big data is an amorphous category that could, for instance, include data from an astronomical observatory or the full text of all the digitized books from the 20th century. Like many others, the 2011 McKinsey Global Institute report on Big Data focuses solely on the \"big\" in defining the term: \"Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze\" (Manyika et al., 2011). This definition is intentionally subjective and incorporates a moving definition of how big a dataset needs to be in order to be considered big data with the implicit assumption that as technology advances over time, the size of datasets that qualify as big data will also increase. Also note that the definition can vary by sector, depending on what kinds of software tools are commonly available and what sizes of datasets are common in a particular industry. With those caveats, big data in many sectors today will range from a few dozen terabytes to multiple petabytes (thousands of terabytes). Gartner (2011) introduced additional important definitional characteristics in addition to volume, namely velocity and variety. Velocity refers to the speed at which data is generated, assessed and analyzed. The term \"Variety\" encompasses the fact that data can exist as different media (text, audio, video) and come in different format (structured and unstructured). Value is a fourth definitional characteristic that acknowledges the potential high socio-economic value that may be generated by Big Data (Jones, 2012). Included within its scope is the category of transaction-generated data (TGD), also sometimes described as \"data exhaust.\" This category was first discussed in 1991, though the term then used was transaction-generated information. The value of this subset of big data is that it is directly connected to human behavior and its accuracy is generally high because the data is generated for a purpose, such as the completion of telephone call or a commercial transaction.TGD has great potential for broader development and is already being leveraged to predict flu trends, forecast unemployment, understand societal ties and overall socio-economic well-being, etc.However unlike in developed countries, the only streams of comprehensive big data with wide socio-economic coverage in developing countries are those generated by telecom networks, because commercial banks and supermarkets, for example, do not reach a majority of people. Even whilst internet access is growing fast in developing economies, as noted in the 2013 Measuring the Information Society report by ITU, overall household internet penetration in developing economies was expected to be only 28% as of end 2013, as opposed to almost 80% in developed economies. Basic mobile subscriptions however have almost peaked at 96% globally (ITU, 2013). Therefore in the near term, it is non-Internet related mobile network big data that has the widest socioeconomic coverage. Such data is already being utilized for development and monitoring not just in developed economies but also in developing economies. Therefore the focus of this paper is mainly on mobile network big data for development.This policy paper serves to enlighten policy makers in developing economies, as to the range of behavioral insights on mobility, connectivity and consumption that can be extracted from mobile network TGD. Importantly, this paper also addresses how these insights can be leveraged by multiple policy domains inter alia transport, health, and economic development.","PeriodicalId":114907,"journal":{"name":"Global Business Issues eJournal","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Behavioral Insights for Development from Mobile Network Big Data: Enlightening Policy Makers on the State of the Art\",\"authors\":\"Sriganesh Lokanathan, Roshanthi Lucas Gunaratne\",\"doi\":\"10.2139/ssrn.2522814\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The discipline of Information and Communication Technologies for Development (ICT4D) gained traction against the exponential growth in mobile phone connectivity. There has been a multitude of projects, services, applications and even policies that aim to leverage the mobile phone to contribute to the broader development of society. This has gone hand in hand with much academic interest in understanding the effects of mobile phone connectivity on development. However it is only of late that attention is being paid to posing development related questions to the basic data artifacts that are left behind by society when consuming mobile phone services. These artifacts come under the class of Transaction Generated Data (TGD) having been recorded by mobile phone operators when certain events (for e.g. when one makes a call) occur for the purposes of billing and network optimization. Given the volumes of TGD that is produced it also falls under the category of Big Data. Big data is an amorphous category that could, for instance, include data from an astronomical observatory or the full text of all the digitized books from the 20th century. Like many others, the 2011 McKinsey Global Institute report on Big Data focuses solely on the \\\"big\\\" in defining the term: \\\"Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze\\\" (Manyika et al., 2011). This definition is intentionally subjective and incorporates a moving definition of how big a dataset needs to be in order to be considered big data with the implicit assumption that as technology advances over time, the size of datasets that qualify as big data will also increase. Also note that the definition can vary by sector, depending on what kinds of software tools are commonly available and what sizes of datasets are common in a particular industry. With those caveats, big data in many sectors today will range from a few dozen terabytes to multiple petabytes (thousands of terabytes). Gartner (2011) introduced additional important definitional characteristics in addition to volume, namely velocity and variety. Velocity refers to the speed at which data is generated, assessed and analyzed. The term \\\"Variety\\\" encompasses the fact that data can exist as different media (text, audio, video) and come in different format (structured and unstructured). Value is a fourth definitional characteristic that acknowledges the potential high socio-economic value that may be generated by Big Data (Jones, 2012). Included within its scope is the category of transaction-generated data (TGD), also sometimes described as \\\"data exhaust.\\\" This category was first discussed in 1991, though the term then used was transaction-generated information. The value of this subset of big data is that it is directly connected to human behavior and its accuracy is generally high because the data is generated for a purpose, such as the completion of telephone call or a commercial transaction.TGD has great potential for broader development and is already being leveraged to predict flu trends, forecast unemployment, understand societal ties and overall socio-economic well-being, etc.However unlike in developed countries, the only streams of comprehensive big data with wide socio-economic coverage in developing countries are those generated by telecom networks, because commercial banks and supermarkets, for example, do not reach a majority of people. Even whilst internet access is growing fast in developing economies, as noted in the 2013 Measuring the Information Society report by ITU, overall household internet penetration in developing economies was expected to be only 28% as of end 2013, as opposed to almost 80% in developed economies. Basic mobile subscriptions however have almost peaked at 96% globally (ITU, 2013). Therefore in the near term, it is non-Internet related mobile network big data that has the widest socioeconomic coverage. Such data is already being utilized for development and monitoring not just in developed economies but also in developing economies. Therefore the focus of this paper is mainly on mobile network big data for development.This policy paper serves to enlighten policy makers in developing economies, as to the range of behavioral insights on mobility, connectivity and consumption that can be extracted from mobile network TGD. Importantly, this paper also addresses how these insights can be leveraged by multiple policy domains inter alia transport, health, and economic development.\",\"PeriodicalId\":114907,\"journal\":{\"name\":\"Global Business Issues eJournal\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Global Business Issues eJournal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.2522814\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Business Issues eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.2522814","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

信息和通信技术促进发展(ICT4D)学科在移动电话连接呈指数增长的情况下获得了牵引力。有大量的项目、服务、应用程序甚至政策旨在利用手机为社会的更广泛发展做出贡献。这与学术界对理解移动电话连接对发展的影响的兴趣密切相关。然而,直到最近,人们才开始注意到,在使用移动电话服务时,社会遗留下来的基本数据工件会带来与发展相关的问题。这些工件属于交易生成数据(Transaction Generated Data, TGD)一类,当某些事件(例如,当一个人打电话时)发生时,由移动电话运营商记录下来,用于计费和网络优化。考虑到产生的TGD的数量,它也属于大数据的范畴。大数据是一个无定形的类别,例如,它可以包括来自天文台的数据或20世纪以来所有数字化书籍的全文。与许多其他报告一样,2011年麦肯锡全球研究所关于大数据的报告在定义术语时只关注“大”:“大数据是指其规模超出典型数据库软件工具捕获、存储、管理和分析能力的数据集”(Manyika et al., 2011)。这个定义是主观的,它包含了一个不断变化的定义,即数据集需要多大才能被认为是大数据,同时隐含的假设是,随着技术的进步,符合大数据标准的数据集的规模也会增加。还需要注意的是,定义可能因行业而异,这取决于什么类型的软件工具是常用的,以及特定行业中常见的数据集的大小。有了这些注意事项,今天许多领域的大数据将从几十太字节到几千太字节不等。Gartner(2011)介绍了除了数量之外的其他重要定义特征,即速度和多样性。速度是指数据生成、评估和分析的速度。“多样性”一词包含了这样一个事实,即数据可以作为不同的媒体(文本、音频、视频)存在,并以不同的格式(结构化和非结构化)出现。价值是第四个定义特征,它承认大数据可能产生的潜在的高社会经济价值(Jones, 2012)。它的范围包括事务生成数据(TGD)类别,有时也被描述为“数据耗尽”。这个类别最早是在1991年讨论的,不过当时使用的术语是事务生成的信息。这个大数据子集的价值在于它与人类行为直接相关,而且由于数据是为了某种目的而产生的,例如完成电话呼叫或商业交易,因此其准确性通常很高。TGD具有更广泛发展的巨大潜力,并已被用于预测流感趋势、预测失业、了解社会关系和整体社会经济福利等。然而,与发达国家不同,发展中国家唯一具有广泛社会经济覆盖的综合大数据流是电信网络产生的数据流,因为商业银行和超市等无法触及大多数人。正如国际电联在《2013年衡量信息社会》报告中所指出的那样,尽管互联网接入在发展中经济体增长迅速,但截至2013年底,发展中经济体的总体家庭互联网普及率预计仅为28%,而发达经济体则接近80%。然而,全球基本移动用户几乎达到96%的峰值(ITU, 2013年)。因此,在短期内,与互联网无关的移动网络大数据是社会经济覆盖范围最广的。不仅在发达经济体,而且在发展中经济体,这种数据已经被用于发展和监测。因此本文的重点主要是针对移动网络大数据进行开发。本政策文件旨在启发发展中经济体的政策制定者,从移动网络TGD中提取有关移动性、连接性和消费的一系列行为见解。重要的是,本文还讨论了如何在交通、卫生和经济发展等多个政策领域利用这些见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Behavioral Insights for Development from Mobile Network Big Data: Enlightening Policy Makers on the State of the Art
The discipline of Information and Communication Technologies for Development (ICT4D) gained traction against the exponential growth in mobile phone connectivity. There has been a multitude of projects, services, applications and even policies that aim to leverage the mobile phone to contribute to the broader development of society. This has gone hand in hand with much academic interest in understanding the effects of mobile phone connectivity on development. However it is only of late that attention is being paid to posing development related questions to the basic data artifacts that are left behind by society when consuming mobile phone services. These artifacts come under the class of Transaction Generated Data (TGD) having been recorded by mobile phone operators when certain events (for e.g. when one makes a call) occur for the purposes of billing and network optimization. Given the volumes of TGD that is produced it also falls under the category of Big Data. Big data is an amorphous category that could, for instance, include data from an astronomical observatory or the full text of all the digitized books from the 20th century. Like many others, the 2011 McKinsey Global Institute report on Big Data focuses solely on the "big" in defining the term: "Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze" (Manyika et al., 2011). This definition is intentionally subjective and incorporates a moving definition of how big a dataset needs to be in order to be considered big data with the implicit assumption that as technology advances over time, the size of datasets that qualify as big data will also increase. Also note that the definition can vary by sector, depending on what kinds of software tools are commonly available and what sizes of datasets are common in a particular industry. With those caveats, big data in many sectors today will range from a few dozen terabytes to multiple petabytes (thousands of terabytes). Gartner (2011) introduced additional important definitional characteristics in addition to volume, namely velocity and variety. Velocity refers to the speed at which data is generated, assessed and analyzed. The term "Variety" encompasses the fact that data can exist as different media (text, audio, video) and come in different format (structured and unstructured). Value is a fourth definitional characteristic that acknowledges the potential high socio-economic value that may be generated by Big Data (Jones, 2012). Included within its scope is the category of transaction-generated data (TGD), also sometimes described as "data exhaust." This category was first discussed in 1991, though the term then used was transaction-generated information. The value of this subset of big data is that it is directly connected to human behavior and its accuracy is generally high because the data is generated for a purpose, such as the completion of telephone call or a commercial transaction.TGD has great potential for broader development and is already being leveraged to predict flu trends, forecast unemployment, understand societal ties and overall socio-economic well-being, etc.However unlike in developed countries, the only streams of comprehensive big data with wide socio-economic coverage in developing countries are those generated by telecom networks, because commercial banks and supermarkets, for example, do not reach a majority of people. Even whilst internet access is growing fast in developing economies, as noted in the 2013 Measuring the Information Society report by ITU, overall household internet penetration in developing economies was expected to be only 28% as of end 2013, as opposed to almost 80% in developed economies. Basic mobile subscriptions however have almost peaked at 96% globally (ITU, 2013). Therefore in the near term, it is non-Internet related mobile network big data that has the widest socioeconomic coverage. Such data is already being utilized for development and monitoring not just in developed economies but also in developing economies. Therefore the focus of this paper is mainly on mobile network big data for development.This policy paper serves to enlighten policy makers in developing economies, as to the range of behavioral insights on mobility, connectivity and consumption that can be extracted from mobile network TGD. Importantly, this paper also addresses how these insights can be leveraged by multiple policy domains inter alia transport, health, and economic development.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信