{"title":"基于机器学习的云计算资源调度研究","authors":"Yansong Li","doi":"10.1109/MLISE57402.2022.00090","DOIUrl":null,"url":null,"abstract":"In the cloud computing environment, concurrent training of multiple machine learning models will cause serious competition for shared cluster resources and affect the execution efficiency. Aiming at this problem, this paper proposes a cloud computing resource scheduling method for distributed machine learning. Based on historical monitoring data, a model between the number of iterations and model quality improvement is established, the impact of resource allocation on model quality improvement is predicted online, resource optimization scheduling strategies are formulated, and a resource scheduling framework is designed. Experimental results show that the proposed method can quickly adapt to the dynamic changes of tasks and loads and maximize the overall performance of multiple model training jobs.","PeriodicalId":350291,"journal":{"name":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on cloud computing resource scheduling based on machine learning\",\"authors\":\"Yansong Li\",\"doi\":\"10.1109/MLISE57402.2022.00090\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the cloud computing environment, concurrent training of multiple machine learning models will cause serious competition for shared cluster resources and affect the execution efficiency. Aiming at this problem, this paper proposes a cloud computing resource scheduling method for distributed machine learning. Based on historical monitoring data, a model between the number of iterations and model quality improvement is established, the impact of resource allocation on model quality improvement is predicted online, resource optimization scheduling strategies are formulated, and a resource scheduling framework is designed. Experimental results show that the proposed method can quickly adapt to the dynamic changes of tasks and loads and maximize the overall performance of multiple model training jobs.\",\"PeriodicalId\":350291,\"journal\":{\"name\":\"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MLISE57402.2022.00090\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MLISE57402.2022.00090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on cloud computing resource scheduling based on machine learning
In the cloud computing environment, concurrent training of multiple machine learning models will cause serious competition for shared cluster resources and affect the execution efficiency. Aiming at this problem, this paper proposes a cloud computing resource scheduling method for distributed machine learning. Based on historical monitoring data, a model between the number of iterations and model quality improvement is established, the impact of resource allocation on model quality improvement is predicted online, resource optimization scheduling strategies are formulated, and a resource scheduling framework is designed. Experimental results show that the proposed method can quickly adapt to the dynamic changes of tasks and loads and maximize the overall performance of multiple model training jobs.