Python’s evolution on Stack Overflow: An empirical analysis of topic trends

IF 1.8 3区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING
Fengqi Hu, Weihao Xue, Siyuan Zhou, Ye Wang, Bo Jiang, Qiao Huang, Hua Zhang
{"title":"Python’s evolution on Stack Overflow: An empirical analysis of topic trends","authors":"Fengqi Hu,&nbsp;Weihao Xue,&nbsp;Siyuan Zhou,&nbsp;Ye Wang,&nbsp;Bo Jiang,&nbsp;Qiao Huang,&nbsp;Hua Zhang","doi":"10.1016/j.cola.2025.101340","DOIUrl":null,"url":null,"abstract":"<div><div>With the rapid development of information technology and changing programming practices, the demand for programming discussions on online Q&amp;A platforms is growing. This study analyzes over two million Python-related posts on Stack Overflow to identify core topics and challenges over fifteen years. By using a Gradient Boosting Decision Tree (GBDT) model to quantify post popularity, we objectively show what the hottest as well as the most disturbing topics related to Python are to users at different times. We find that: The domains most closely associated with Python are data processing and machine learning, while development environments as well as automation and testing are gradually increasing in popularity. Machine learning is the area that bothers users the most. Moreover, we found that some questions that confuse users can increase the popularity of related topics. These findings can help developers grasp the direction of the Python language so that they can better plan their personal learning and project development. Enterprises and organizations can also optimize resource allocation based on trends in hot topics for training, tool development, and technical support.</div></div>","PeriodicalId":48552,"journal":{"name":"Journal of Computer Languages","volume":"84 ","pages":"Article 101340"},"PeriodicalIF":1.8000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer Languages","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590118425000267","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

Abstract

With the rapid development of information technology and changing programming practices, the demand for programming discussions on online Q&A platforms is growing. This study analyzes over two million Python-related posts on Stack Overflow to identify core topics and challenges over fifteen years. By using a Gradient Boosting Decision Tree (GBDT) model to quantify post popularity, we objectively show what the hottest as well as the most disturbing topics related to Python are to users at different times. We find that: The domains most closely associated with Python are data processing and machine learning, while development environments as well as automation and testing are gradually increasing in popularity. Machine learning is the area that bothers users the most. Moreover, we found that some questions that confuse users can increase the popularity of related topics. These findings can help developers grasp the direction of the Python language so that they can better plan their personal learning and project development. Enterprises and organizations can also optimize resource allocation based on trends in hot topics for training, tool development, and technical support.
Python在Stack Overflow上的演变:主题趋势的实证分析
随着信息技术的快速发展和编程实践的变化,在线问答平台对编程讨论的需求越来越大。这项研究分析了Stack Overflow上超过200万个与python相关的帖子,以确定15年来的核心主题和挑战。通过使用梯度提升决策树(GBDT)模型来量化帖子受欢迎程度,我们客观地显示了在不同时间与Python相关的最热门和最令人不安的话题是什么。我们发现:与Python最密切相关的领域是数据处理和机器学习,而开发环境以及自动化和测试正在逐渐普及。机器学习是最困扰用户的领域。此外,我们发现一些让用户困惑的问题可以增加相关话题的受欢迎程度。这些发现可以帮助开发人员掌握Python语言的发展方向,从而更好地规划个人学习和项目开发。企业和组织也可以根据培训、工具开发和技术支持的热门话题趋势来优化资源分配。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Computer Languages
Journal of Computer Languages Computer Science-Computer Networks and Communications
CiteScore
5.00
自引率
13.60%
发文量
36
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信