Yang Zhang, Yiwen Wu, Tao Wang, Bo Ding, Huaimin Wang
{"title":"MLOps从业者在谈论什么问题?Stack Overflow论坛和GitHub项目讨论的研究","authors":"Yang Zhang, Yiwen Wu, Tao Wang, Bo Ding, Huaimin Wang","doi":"10.1016/j.infsof.2025.107768","DOIUrl":null,"url":null,"abstract":"<div><h3>Context:</h3><div>Machine Learning Operations (MLOps) has emerged as a crucial technology for addressing the challenges of designing and maintaining productive ML applications. The widespread adoption of MLOps makes it essential to identify the problems faced by MLOps practitioners. However, there has been relatively little research in this area.</div></div><div><h3>Objectives:</h3><div>To fill this research gap and gain an understanding of the interests and difficulties encountered by MLOps practitioners.</div></div><div><h3>Methods:</h3><div>We mine discussion data from the online Q&A forum, Stack Overflow, and GitHub projects, and analyze 6345 posts and 2103 issues.</div></div><div><h3>Results:</h3><div>We construct the first taxonomy of MLOps problems in practice, consisting of 5 categories and 19 topics. We also investigate the evolution and characteristics (difficulty and sentiment) of these topics, distill 12 frequent solutions for different MLOps problems, and design an MLOps knowledge exploration tool, MLOps-KET.</div></div><div><h3>Conclusion:</h3><div>We find that practitioners face diverse challenges when performing MLOps practices and that the focus of their discussions changed over time. Our study contributes to the MLOps research and development community by providing implications for different audiences and guidance for future support of relevant techniques and tools.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"185 ","pages":"Article 107768"},"PeriodicalIF":4.3000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"What problems are MLOps practitioners talking about? A study of discussions in Stack Overflow forum and GitHub projects\",\"authors\":\"Yang Zhang, Yiwen Wu, Tao Wang, Bo Ding, Huaimin Wang\",\"doi\":\"10.1016/j.infsof.2025.107768\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Context:</h3><div>Machine Learning Operations (MLOps) has emerged as a crucial technology for addressing the challenges of designing and maintaining productive ML applications. The widespread adoption of MLOps makes it essential to identify the problems faced by MLOps practitioners. However, there has been relatively little research in this area.</div></div><div><h3>Objectives:</h3><div>To fill this research gap and gain an understanding of the interests and difficulties encountered by MLOps practitioners.</div></div><div><h3>Methods:</h3><div>We mine discussion data from the online Q&A forum, Stack Overflow, and GitHub projects, and analyze 6345 posts and 2103 issues.</div></div><div><h3>Results:</h3><div>We construct the first taxonomy of MLOps problems in practice, consisting of 5 categories and 19 topics. We also investigate the evolution and characteristics (difficulty and sentiment) of these topics, distill 12 frequent solutions for different MLOps problems, and design an MLOps knowledge exploration tool, MLOps-KET.</div></div><div><h3>Conclusion:</h3><div>We find that practitioners face diverse challenges when performing MLOps practices and that the focus of their discussions changed over time. Our study contributes to the MLOps research and development community by providing implications for different audiences and guidance for future support of relevant techniques and tools.</div></div>\",\"PeriodicalId\":54983,\"journal\":{\"name\":\"Information and Software Technology\",\"volume\":\"185 \",\"pages\":\"Article 107768\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information and Software Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950584925001077\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information and Software Technology","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950584925001077","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
What problems are MLOps practitioners talking about? A study of discussions in Stack Overflow forum and GitHub projects
Context:
Machine Learning Operations (MLOps) has emerged as a crucial technology for addressing the challenges of designing and maintaining productive ML applications. The widespread adoption of MLOps makes it essential to identify the problems faced by MLOps practitioners. However, there has been relatively little research in this area.
Objectives:
To fill this research gap and gain an understanding of the interests and difficulties encountered by MLOps practitioners.
Methods:
We mine discussion data from the online Q&A forum, Stack Overflow, and GitHub projects, and analyze 6345 posts and 2103 issues.
Results:
We construct the first taxonomy of MLOps problems in practice, consisting of 5 categories and 19 topics. We also investigate the evolution and characteristics (difficulty and sentiment) of these topics, distill 12 frequent solutions for different MLOps problems, and design an MLOps knowledge exploration tool, MLOps-KET.
Conclusion:
We find that practitioners face diverse challenges when performing MLOps practices and that the focus of their discussions changed over time. Our study contributes to the MLOps research and development community by providing implications for different audiences and guidance for future support of relevant techniques and tools.
期刊介绍:
Information and Software Technology is the international archival journal focusing on research and experience that contributes to the improvement of software development practices. The journal''s scope includes methods and techniques to better engineer software and manage its development. Articles submitted for review should have a clear component of software engineering or address ways to improve the engineering and management of software development. Areas covered by the journal include:
• Software management, quality and metrics,
• Software processes,
• Software architecture, modelling, specification, design and programming
• Functional and non-functional software requirements
• Software testing and verification & validation
• Empirical studies of all aspects of engineering and managing software development
Short Communications is a new section dedicated to short papers addressing new ideas, controversial opinions, "Negative" results and much more. Read the Guide for authors for more information.
The journal encourages and welcomes submissions of systematic literature studies (reviews and maps) within the scope of the journal. Information and Software Technology is the premiere outlet for systematic literature studies in software engineering.