{"title":"IDL-BiGRU:集成深度学习辅助云环境下大数据智能调度","authors":"Rama Satish K V , Vibha M B , Lovely Sasidharan","doi":"10.1016/j.datak.2025.102489","DOIUrl":null,"url":null,"abstract":"<div><div>The rapid expansion of Internet of Things (IoT) applications generates a continuous and massive flow of data, creating significant challenges in both data processing and storage management. Cloud computing offers scalable infrastructure to handle such data intensive workloads, but optimal task scheduling remains critical to ensure performance and resource efficiency. Traditional scheduling algorithms often fall short due to limited adaptability and consideration of only a few system parameters. In this paper, a novel integrated deep learning-assisted scheduling framework is utilized for scheduling big data over a cloud environment. The proposed framework integrated deep reinforcement learning with the bidirectional gated recurrent unit (IDL-BiGRU) model to intelligently schedule tasks based on real-time system states. The IDL-BiGRU model leverages the advantage of deep Q-learning for decision making and BiGRU's ability to capture bidirectional temporal dependencies in task and resource usage patterns. In this work, RAM, CPU, bandwidth utilization of the network, and disk storage are considered for scheduling purposes. The suggested method is to shorten the makespan and increase resource utilization. The Java tool is utilized for conducting the experimental verifications. Analysis and comparison of the suggested deep learning framework's performance with current methods are done. For 1000 tasks, the proposed method attains 0.90 degrees of imbalance, 291.17 ms downtime, 1050 ms throughput, and 721.58 makespan. The performance analysis demonstrates that the suggested strategy outperforms previous methods.</div></div>","PeriodicalId":55184,"journal":{"name":"Data & Knowledge Engineering","volume":"160 ","pages":"Article 102489"},"PeriodicalIF":2.7000,"publicationDate":"2025-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"IDL-BiGRU: Integrated deep learning assisted smart scheduling of big data over cloud environment\",\"authors\":\"Rama Satish K V , Vibha M B , Lovely Sasidharan\",\"doi\":\"10.1016/j.datak.2025.102489\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The rapid expansion of Internet of Things (IoT) applications generates a continuous and massive flow of data, creating significant challenges in both data processing and storage management. Cloud computing offers scalable infrastructure to handle such data intensive workloads, but optimal task scheduling remains critical to ensure performance and resource efficiency. Traditional scheduling algorithms often fall short due to limited adaptability and consideration of only a few system parameters. In this paper, a novel integrated deep learning-assisted scheduling framework is utilized for scheduling big data over a cloud environment. The proposed framework integrated deep reinforcement learning with the bidirectional gated recurrent unit (IDL-BiGRU) model to intelligently schedule tasks based on real-time system states. The IDL-BiGRU model leverages the advantage of deep Q-learning for decision making and BiGRU's ability to capture bidirectional temporal dependencies in task and resource usage patterns. In this work, RAM, CPU, bandwidth utilization of the network, and disk storage are considered for scheduling purposes. The suggested method is to shorten the makespan and increase resource utilization. The Java tool is utilized for conducting the experimental verifications. Analysis and comparison of the suggested deep learning framework's performance with current methods are done. For 1000 tasks, the proposed method attains 0.90 degrees of imbalance, 291.17 ms downtime, 1050 ms throughput, and 721.58 makespan. The performance analysis demonstrates that the suggested strategy outperforms previous methods.</div></div>\",\"PeriodicalId\":55184,\"journal\":{\"name\":\"Data & Knowledge Engineering\",\"volume\":\"160 \",\"pages\":\"Article 102489\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2025-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data & Knowledge Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169023X25000849\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data & Knowledge Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169023X25000849","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
IDL-BiGRU: Integrated deep learning assisted smart scheduling of big data over cloud environment
The rapid expansion of Internet of Things (IoT) applications generates a continuous and massive flow of data, creating significant challenges in both data processing and storage management. Cloud computing offers scalable infrastructure to handle such data intensive workloads, but optimal task scheduling remains critical to ensure performance and resource efficiency. Traditional scheduling algorithms often fall short due to limited adaptability and consideration of only a few system parameters. In this paper, a novel integrated deep learning-assisted scheduling framework is utilized for scheduling big data over a cloud environment. The proposed framework integrated deep reinforcement learning with the bidirectional gated recurrent unit (IDL-BiGRU) model to intelligently schedule tasks based on real-time system states. The IDL-BiGRU model leverages the advantage of deep Q-learning for decision making and BiGRU's ability to capture bidirectional temporal dependencies in task and resource usage patterns. In this work, RAM, CPU, bandwidth utilization of the network, and disk storage are considered for scheduling purposes. The suggested method is to shorten the makespan and increase resource utilization. The Java tool is utilized for conducting the experimental verifications. Analysis and comparison of the suggested deep learning framework's performance with current methods are done. For 1000 tasks, the proposed method attains 0.90 degrees of imbalance, 291.17 ms downtime, 1050 ms throughput, and 721.58 makespan. The performance analysis demonstrates that the suggested strategy outperforms previous methods.
期刊介绍:
Data & Knowledge Engineering (DKE) stimulates the exchange of ideas and interaction between these two related fields of interest. DKE reaches a world-wide audience of researchers, designers, managers and users. The major aim of the journal is to identify, investigate and analyze the underlying principles in the design and effective use of these systems.