Best Practices in Accelerating the Data Science Process in Python

D. Larson
{"title":"Best Practices in Accelerating the Data Science Process in Python","authors":"D. Larson","doi":"10.5772/INTECHOPEN.84784","DOIUrl":null,"url":null,"abstract":"The number of data science and big data projects is growing, and current software development approaches are challenged to support and contribute to the success and frequency of these projects. Much has been researched on how data science algorithm is used and the benefits of big data, but very little has been written about what best practices can be leveraged to accelerate and effectively deliver data science and big data projects. Big data characteristics such as volume, variety, velocity, and veracity complicate these projects. The proliferation of open-source technologies available to data scientists can also complicate the landscape. With the increase in data science and big data projects, organizations are struggling to deliver successfully. This paper addresses the data science and big data project process, the gaps in the process, best practices, and how these best practices are being applied in Python, one of the common data science open-source programming languages.","PeriodicalId":224487,"journal":{"name":"Introduction to Data Science and Machine Learning","volume":"18 5","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Introduction to Data Science and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5772/INTECHOPEN.84784","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The number of data science and big data projects is growing, and current software development approaches are challenged to support and contribute to the success and frequency of these projects. Much has been researched on how data science algorithm is used and the benefits of big data, but very little has been written about what best practices can be leveraged to accelerate and effectively deliver data science and big data projects. Big data characteristics such as volume, variety, velocity, and veracity complicate these projects. The proliferation of open-source technologies available to data scientists can also complicate the landscape. With the increase in data science and big data projects, organizations are struggling to deliver successfully. This paper addresses the data science and big data project process, the gaps in the process, best practices, and how these best practices are being applied in Python, one of the common data science open-source programming languages.
用Python加速数据科学过程的最佳实践
数据科学和大数据项目的数量正在增长,当前的软件开发方法面临着支持和促进这些项目的成功和频率的挑战。关于如何使用数据科学算法和大数据的好处已经有了很多研究,但关于如何利用最佳实践来加速和有效地交付数据科学和大数据项目的文章却很少。大数据的数量、种类、速度和准确性等特征使这些项目复杂化。数据科学家可以使用的开源技术的激增也可能使前景复杂化。随着数据科学和大数据项目的增加,组织正在努力成功交付。本文介绍了数据科学和大数据项目过程、过程中的差距、最佳实践,以及如何将这些最佳实践应用于Python(常见的数据科学开源编程语言之一)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信