Research on cloud computing resource scheduling based on machine learning

Yansong Li
{"title":"Research on cloud computing resource scheduling based on machine learning","authors":"Yansong Li","doi":"10.1109/MLISE57402.2022.00090","DOIUrl":null,"url":null,"abstract":"In the cloud computing environment, concurrent training of multiple machine learning models will cause serious competition for shared cluster resources and affect the execution efficiency. Aiming at this problem, this paper proposes a cloud computing resource scheduling method for distributed machine learning. Based on historical monitoring data, a model between the number of iterations and model quality improvement is established, the impact of resource allocation on model quality improvement is predicted online, resource optimization scheduling strategies are formulated, and a resource scheduling framework is designed. Experimental results show that the proposed method can quickly adapt to the dynamic changes of tasks and loads and maximize the overall performance of multiple model training jobs.","PeriodicalId":350291,"journal":{"name":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MLISE57402.2022.00090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the cloud computing environment, concurrent training of multiple machine learning models will cause serious competition for shared cluster resources and affect the execution efficiency. Aiming at this problem, this paper proposes a cloud computing resource scheduling method for distributed machine learning. Based on historical monitoring data, a model between the number of iterations and model quality improvement is established, the impact of resource allocation on model quality improvement is predicted online, resource optimization scheduling strategies are formulated, and a resource scheduling framework is designed. Experimental results show that the proposed method can quickly adapt to the dynamic changes of tasks and loads and maximize the overall performance of multiple model training jobs.
基于机器学习的云计算资源调度研究
在云计算环境下,多个机器学习模型的并发训练会对共享集群资源造成严重的竞争,影响执行效率。针对这一问题,本文提出了一种用于分布式机器学习的云计算资源调度方法。基于历史监测数据,建立了迭代次数与模型质量改进之间的模型,在线预测了资源分配对模型质量改进的影响,制定了资源优化调度策略,设计了资源调度框架。实验结果表明,该方法能够快速适应任务和负荷的动态变化,并最大限度地提高多个模型训练任务的综合性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信