Predictive Analysis and Research Of Python Usage Rate Based on Polynomial Regression Model

Yang Gong, P. Zhang
{"title":"Predictive Analysis and Research Of Python Usage Rate Based on Polynomial Regression Model","authors":"Yang Gong, P. Zhang","doi":"10.1109/AIAM54119.2021.00061","DOIUrl":null,"url":null,"abstract":"Nowadays, more and more people will choose Python to help them accomplish some things, in order to better predict the proportion of Python usage. This paper proposes a polynomial regression analysis model. First, crawl the historical usage data of the python language from the official website; then clean the analysis, use a scatter plot to visualize the relationship between tags and features; then use the training set data to train the polynomial regression Model; Finally, the generalization ability of the model is tested through the test set. After many experiments, it can be known that when the highest number is 9 times, the entire training set score is 0.912862, and the test set score is 0.886600, which achieves a better fitting effect and has a certain practical value, which can be used for popularization.","PeriodicalId":227320,"journal":{"name":"2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture (AIAM)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd International Conference on Artificial Intelligence and Advanced Manufacture (AIAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIAM54119.2021.00061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Nowadays, more and more people will choose Python to help them accomplish some things, in order to better predict the proportion of Python usage. This paper proposes a polynomial regression analysis model. First, crawl the historical usage data of the python language from the official website; then clean the analysis, use a scatter plot to visualize the relationship between tags and features; then use the training set data to train the polynomial regression Model; Finally, the generalization ability of the model is tested through the test set. After many experiments, it can be known that when the highest number is 9 times, the entire training set score is 0.912862, and the test set score is 0.886600, which achieves a better fitting effect and has a certain practical value, which can be used for popularization.
基于多项式回归模型的Python使用率预测分析与研究
现在,越来越多的人会选择Python来帮助他们完成一些事情,以便更好地预测Python的使用比例。本文提出了一个多项式回归分析模型。首先,从官网抓取python语言的历史使用数据;然后进行清理分析,利用散点图将标签与特征之间的关系可视化;然后利用训练集数据训练多项式回归模型;最后,通过测试集检验模型的泛化能力。经过多次实验可以知道,当最高次数为9次时,整个训练集得分为0.912862,测试集得分为0.886600,达到了较好的拟合效果,具有一定的实用价值,可以推广使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信