A Balanced Perspective on Prediction and Inference for Data Science in Industry

Nathan Sanders
{"title":"A Balanced Perspective on Prediction and Inference for Data Science in Industry","authors":"Nathan Sanders","doi":"10.1162/99608F92.644EF4A4","DOIUrl":null,"url":null,"abstract":"The strategic role of data science teams in industry is fundamentally to help businesses to make smarter decisions. This includes decisions on minuscule scales, such as what fraction of a cent to bid on an ad placement displayed in a web browser, whose importance is only manifest when scaled by orders of magnitude through machine automation. But it also extends to singular, monumental decisions made by businesses, such as how to position a new entrant within a competitive market. In both regimes, the potential impact of data science is only realized when both humans and machine actors are learning from data and when data scientists communicate effectively to decision makers throughout the business. I examine this dynamic through the instructive lens of the duality between inference and prediction. I define these concepts, which have varied use across many fields, in practical terms for the industrial data scientist. Through a series of descriptions, illustrations, contrasting concepts, and examples from the entertainment industry (box office prediction and advertising attribution), I offer perspectives on how the concepts of inference and prediction manifest in the business setting. From a balanced perspective, prediction and inference are integral components of the process by which models are compared to data. However, through a textual analysis of research abstracts from the literature, I demonstrate that an imbalanced, prediction-oriented perspective prevails in industry and has likewise become increasingly dominant among quantitative academic disciplines. I argue that, despite these trends, data scientists in industry must not overlook the valuable, generalizable insights that can be extracted through statistical inference. I conclude by exploring the implications of this strategic choice for how data science teams are integrated in businesses.KeywordsIndustry, Entertainment, Communication, Inference, Bibliometrics","PeriodicalId":23712,"journal":{"name":"Volume 4 Issue 1","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 4 Issue 1","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1162/99608F92.644EF4A4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

The strategic role of data science teams in industry is fundamentally to help businesses to make smarter decisions. This includes decisions on minuscule scales, such as what fraction of a cent to bid on an ad placement displayed in a web browser, whose importance is only manifest when scaled by orders of magnitude through machine automation. But it also extends to singular, monumental decisions made by businesses, such as how to position a new entrant within a competitive market. In both regimes, the potential impact of data science is only realized when both humans and machine actors are learning from data and when data scientists communicate effectively to decision makers throughout the business. I examine this dynamic through the instructive lens of the duality between inference and prediction. I define these concepts, which have varied use across many fields, in practical terms for the industrial data scientist. Through a series of descriptions, illustrations, contrasting concepts, and examples from the entertainment industry (box office prediction and advertising attribution), I offer perspectives on how the concepts of inference and prediction manifest in the business setting. From a balanced perspective, prediction and inference are integral components of the process by which models are compared to data. However, through a textual analysis of research abstracts from the literature, I demonstrate that an imbalanced, prediction-oriented perspective prevails in industry and has likewise become increasingly dominant among quantitative academic disciplines. I argue that, despite these trends, data scientists in industry must not overlook the valuable, generalizable insights that can be extracted through statistical inference. I conclude by exploring the implications of this strategic choice for how data science teams are integrated in businesses.KeywordsIndustry, Entertainment, Communication, Inference, Bibliometrics
工业数据科学预测与推理的平衡视角
数据科学团队在行业中的战略角色是从根本上帮助企业做出更明智的决策。这包括极小尺度上的决策,例如在网页浏览器上展示的广告位置出价多少美分,只有通过机器自动化按数量级缩放时,其重要性才会显现出来。但它也延伸到企业做出的重大决策,比如如何在竞争激烈的市场中定位新进入者。在这两种情况下,只有当人类和机器参与者都从数据中学习,以及数据科学家与整个企业的决策者进行有效沟通时,数据科学的潜在影响才会实现。我通过推理和预测之间的二元性的有益镜头来研究这种动态。我为工业数据科学家定义了这些概念,它们在许多领域都有不同的用途。通过一系列的描述、插图、对比概念和娱乐行业的例子(票房预测和广告归因),我提供了关于推理和预测概念如何在商业环境中体现的观点。从平衡的角度来看,预测和推理是将模型与数据进行比较的过程的组成部分。然而,通过对文献研究摘要的文本分析,我证明了一种不平衡的、以预测为导向的观点在工业界盛行,同样在定量学术学科中也越来越占主导地位。我认为,尽管有这些趋势,但行业中的数据科学家绝不能忽视通过统计推断可以提取的有价值的、可推广的见解。最后,我将探讨这一战略选择对数据科学团队如何融入企业的影响。关键词:工业,娱乐,传播,推理,文献计量学
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信