{"title":"Queuing-based energy-efficient processing algorithm for smart transportation through V2V communication","authors":"Laya Mohammadi, Vahid Khajehvand","doi":"10.1002/cpe.8235","DOIUrl":"10.1002/cpe.8235","url":null,"abstract":"<div>\u0000 \u0000 <p>Applications of intelligent systems installed in vehicles require substantial computational processing for various tasks. These intensive computations result in high energy consumption and power demands within vehicles. Computational offloading based on Vehicle-to-Vehicle (V2V) communication in vehicular fog computing (VFC) has been proposed as a promising solution to enhance energy efficiency in transportation applications. In this paper, the primary objective is addressing this concern by identifying the optimal nearby vehicle that minimizes energy consumption for the offloading and execution of computational tasks. Therefore, a decision-making and intelligent task offloading mechanism based on queueing theory is proposed. By modeling the problem environment based on queueing theory and modeling the behavior of distributed tasks with discrete-time Markov chain, the proposed solution can predict the future behavior of vehicles in selecting the most energy-efficient processing node. Therefore, this paper investigates three energy decision parameters based on queueing theory extracted from the Markov model to enhance the performance of the proposed algorithm. Experimental results demonstrate that the computational energy parameter achieves the most significant improvement. The proposed algorithm outperforms previous methods, improving energy-efficient system performance by 6.25% and 2.67%, and reducing delivery failure rate by 6.52% and 2.72%. It also decreases overall transportation system processing energy consumption by 0.05% for 100–500 vehicle arrival rates, resulting in an average total processing energy consumption of 0.48%.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 23","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141870626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Updated delegated proof of stake and Nash equilibrium: A mining pool game model for blockchain network","authors":"Namratha M, Kunwar Singh","doi":"10.1002/cpe.8224","DOIUrl":"10.1002/cpe.8224","url":null,"abstract":"<div>\u0000 \u0000 <p>In the present era, blockchain technology has an extensive variety of uses in the financial, marketing, and so forth. The performance and security of the blockchain system are directly affected by the consensus algorithm. Since there are 30 consensus techniques like proof of stake, delegated proof of stake, and proof of work, and so forth in the previous years, the operating efficiency, safety, and stability still lag behind the basic requirements. To this concern, this research manuscript introduces a novel consensus mechanism-based game model using upgraded delegated proof of stake (UDPoS) and Nash equilibrium (NE) that is UDPoS-NE. In a blockchain network powered by UDPoS, blockchain miners provide computing power to publish blocks. Due to a single miner's limited Computing Power (CP), miners frequently join mining pools and divide the pool's profits in accordance with their respective contributions. However, certain miners launch block-withholding attacks that lead to loss the computer power and endanger the blockchain network's effectiveness. Therefore, NE is used to calculate cooperative game solutions and optimize policy decisions of the miners to frustrate the block withholding attack by the influence of stakes and computing power on producing blocks. Using this mechanism, efficiency, and stability in the consensus process is achieved through the combined influence of computing resources and stakes on block generation. Finally, the suggested model's applicability and validity are confirmed with a throughput (600 TPS), delay (3 s), and energy consumption (<100 KWh) for 10 nodes and processing time (0.9 s) for 200 nodes.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 23","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141870627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimization design of supply chain network based on BP neural network performance evaluation and feedback mechanism","authors":"Yao Wu, Weiwei Liu","doi":"10.1002/cpe.8233","DOIUrl":"10.1002/cpe.8233","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper proposes a supply chain network design method suitable for multi-product and multi-inventory models, and uses the improved BP neural network to evaluate and provide feedback on the collaborative performance of the supply chain, adjusting the supply chain network design scheme on time. In the context of the Internet of Things (IoT) in manufacturing, it has been found that supply chain operations are difficult to meet personalized customer needs with high precision and quality. Therefore, we adopted a dynamic library strategy, supply chain network optimization model, hybrid algorithm, and the improved BP neural network to solve the above problems. First, this paper designs a corresponding inventory strategy selection mechanism for the various ordering methods of retailers in the manufacturing IoT environment. Based on this, we have constructed a dual objective model for a sustainable supply chain network to minimize total cost and maximize customer satisfaction. Second, we have developed a hybrid improved Grey Wolf and Whale Algorithm (OLDGWOA) that can accurately solve the above model. The hybrid algorithm divides the population into two parts through opposition-based learning, and then we use the improved grey wolf algorithm and whale algorithm to solve the two populations, and seek the optimal solution in the results, resulting in a hybrid algorithm. Finally, we constructed a supply chain performance evaluation model and feedback mechanism based on the improved BP neural network to adjust inventory strategies and network design at any time. We also validated the developed model and algorithm through numerical examples, and the results showed that: (1) the hybrid algorithm has certain advantages in search and solution speed, (2) the advantages of supply chain network design based on supply chain performance evaluation and feedback mechanisms, and (3) the trade-off between ordering methods and inventory strategies, as well as the trade-off between location and inventory strategies.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 23","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jinsheng Fang, Hanjiang Lin, Jianglong Zhao, Kun Zeng
{"title":"An efficient multi-scale large asymmetric-kernel network for lightweight image super-resolution","authors":"Jinsheng Fang, Hanjiang Lin, Jianglong Zhao, Kun Zeng","doi":"10.1002/cpe.8240","DOIUrl":"10.1002/cpe.8240","url":null,"abstract":"<div>\u0000 \u0000 <p>Recently, lightweight convolutional neural networks (CNNs) on single image super-resolution (SISR) tasks have received impressive improvement with delicate structures. However, numerous lightweight methods may reduce the representation capacity of the network due to decreasing the model size and computational complexities, leading to unsatisfactory performance. In this paper, we propose an efficient multi-scale large asymmetric-kernel network (MLAN) for lightweight SISR. Specifically, MLAN is built with a succession of feature cross extraction blocks (FCEBs), which better models local and long-range interactive information of features for SR. Each of the FCEB contains a multi-scale asymmetric large-kernel attention block (MACAB) by using multiple convolutional kernels to extract features in different receptive fields and a gated mechanism to preserve the useful information for SR. Extensive experimental results on five public benchmark datasets demonstrate the superiority of MLAN over the other advanced lightweight SISR competitors. The average PSNR values are about 0.12, 0.17 and 0.11 dB greater than the second-best competitors under scaling factors of ×2, ×3 and ×4, respectively. The proposed efficient blocks enable our MLAN to make a better balance between model size and performance and achieve comparable performance with Transformer-based methods at a similar level of parameters.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 23","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Secure paths based trustworthy fault-tolerant routing in data center networks","authors":"Kaiyun Liu, Weibei Fan, Fu Xiao, Haolin Mao, Huipeng Huang, Yizhou Zhao","doi":"10.1002/cpe.8229","DOIUrl":"10.1002/cpe.8229","url":null,"abstract":"<div>\u0000 \u0000 <p>With the continuous expansion scale of data center networks (DCNs), the probability of network failures becomes high. Trustworthy fault-tolerant routing is extremely significant for reliable communication in data centers. In this article, we tackle the challenge by proposing a novel fault-tolerant routing scheme for a torus-based DCN. First, we present a multipath information transmission model based on the trust degree of reachable paths and propose a novel Hamiltonian odd–even turning model without deadlock. Second, we design an efficient deadlock-free fault-routing algorithm by constructing the longest fault-free path between any two fault-free nodes in DCN. Extensive simulation results show that the proposed fault-tolerant routing outperforms the previous algorithms. Compared with the most advanced fault-tolerant routing algorithms, the proposed algorithm has a 21.5% to 25.3% increase in throughput and packet arrival rate. Moreover, it can reduce the average delay of 18.6% and the maximum delay of 23.7% in the network respectively.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 23","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Taha Abdelazziz Rahmani, Ghalem Belalem, Sidi Ahmed Mahmoudi, Omar Rafik Merad-Boudia
{"title":"Equalizer: Energy-efficient machine learning-based heterogeneous cluster load balancer","authors":"Taha Abdelazziz Rahmani, Ghalem Belalem, Sidi Ahmed Mahmoudi, Omar Rafik Merad-Boudia","doi":"10.1002/cpe.8230","DOIUrl":"10.1002/cpe.8230","url":null,"abstract":"<div>\u0000 \u0000 <p>Heterogeneous systems deliver high computing performance when effectively utilized. It is crucial to execute each application on the most suitable device while maintaining system balance. However, achieving equal distribution of the computing load is challenging due to variations in computing power and device architectures within the system. Moreover, scheduling applications at real-time further complicates this task, as prior information about the submitted applications is absent. In this context, we introduce “Equalizer,” a real-time load balancer for heterogeneous systems. “Equalizer” leverages machine learning to continuously monitor the system's state, predicting optimal devices for application execution at runtime. It assigns applications to devices that minimize system imbalance. To quantify system imbalance, we propose a novel metric that reflects the disparity in computing loads across the system's devices. This metric is calculated using predicted execution times of applications. To validate the performance of “Equalizer,” we conducted a comparative study against widely adopted approaches, namely Round Robin and Device Suitability. The experiments were performed on a heterogeneous cluster comprising a master host and three slave servers, equipped with a total of 4 central processing units (CPUs) and 4 graphics processing units (GPUs). All approaches were deployed on the cluster and evaluated using three distinct workloads categorized by their computing intensity: medium intensity, heavy intensity, and a combination of heavy and medium intensity, simulating real-world scenarios. Each workload consisted of a set of 80 OpenCL applications with varying input data sizes. The experimental results demonstrate that “Equalizer” effectively minimized the system's imbalance, reduced the idle time of devices, and eliminated overloads. Moreover, “Equalizer” exhibited significant improvements in workload execution time, resource utilization, throughput, and energy consumption. Across all tested scenarios, “Equalizer” consistently outperformed alternative approaches, showcasing its robustness, adaptability to dynamic environments, and applicability in real-world practice.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 23","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Glenn K. Lockwood, Alberto Chiusole, Lisa Gerhardt, Kirill Lozinskiy, David Paul, Nicholas J. Wright
{"title":"Architecture and performance of Perlmutter's 35 PB ClusterStor E1000 all-flash file system","authors":"Glenn K. Lockwood, Alberto Chiusole, Lisa Gerhardt, Kirill Lozinskiy, David Paul, Nicholas J. Wright","doi":"10.1002/cpe.8143","DOIUrl":"10.1002/cpe.8143","url":null,"abstract":"<div>\u0000 \u0000 <p>NERSC's newest system, Perlmutter, features a 35 PB all-flash Lustre file system built on HPE Cray ClusterStor E1000. We present its architecture, early performance figures, and performance considerations unique to this architecture. We demonstrate the performance of E1000 OSSes through low-level Lustre tests that achieve over 90% of the theoretical bandwidth of the SSDs at the OST and LNet levels. We also show end-to-end performance for both traditional dimensions of I/O performance (peak bulk-synchronous bandwidth) and nonoptimal workloads endemic to production computing (small, incoherent I/Os at random offsets) and compare them to NERSC's previous system, Cori, to illustrate that Perlmutter achieves the performance of a burst buffer and the resilience of a scratch file system. Finally, we discuss performance considerations unique to all-flash Lustre and present ways in which users and HPC facilities can adjust their I/O patterns and operations to make optimal use of such architectures.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 23","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The quality evaluation system of ideological and political classroom teaching in universities based on GA-BP algorithm","authors":"Guohua Jing","doi":"10.1002/cpe.8228","DOIUrl":"10.1002/cpe.8228","url":null,"abstract":"<div>\u0000 \u0000 <p>The advancement of teaching quality is an indispensable section of the reform and growth of universities, and ideological and political education has critical impact on ideological education. The quality of classroom education can provide data support for efficient development, and has crucial influence on achieving scientific, reasonable, and accurate evaluation of ideological and political teaching performance. Thus, a performance assessment system for ideological and political education in universities with genetic algorithm optimized neural network algorithm is put forward. First, based on existing teaching evaluation indicators and combined with actual situations, a targeted teaching quality evaluation system is proposed. Then, based on BP, an adaptive genetic algorithm is proposed for improvement, and the output results are improved using entropy method. The results indicated that the proposed model could reach its optimal state after 81 iterations in this study. In the fitting test, it reached 0.971. In actual testing, the average error was only 2.68, which was much bigger than the other three algorithms. Its accuracy was 2%–3.2% higher than that of the best existing algorithms. These results indicated that the method put forward in this study had better practical significance, lower error, more accurate evaluation results, and offered scientific data support for the education reform work of universities, which can better accelerate the development and construction of universities.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 23","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Label distribution feature selection based on neighborhood rough set","authors":"Yilin Wu, Wenzhong Guo, Yaojin Lin","doi":"10.1002/cpe.8236","DOIUrl":"10.1002/cpe.8236","url":null,"abstract":"<div>\u0000 \u0000 <p>In label distribution learning (LDL), an instance is involved with many labels in different importance degrees, and the feature space of instances is accompanied with thousands of redundant and/or irrelevant features. Therefore, the main characteristic of feature selection in LDL is to evaluate the ability of each feature. Motivated by neighborhood rough set (NRS), which can be used to measure the dependency degree of feature via constructing neighborhood relations on feature space and label space, respectively, this article proposes a novel label distribution feature selection method. In this article, the neighborhood class of instance in label distribution space is defined, which is beneficial to recognize the logical class of target instance. Then, a new NRS model for LDL is proposed. Specially, the dependency degree of feature combining label weight is defined. Finally, a label distribution feature selection based on NRS is presented. Extensive experiments on 12 data sets show the effectiveness of the proposed algorithm.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 23","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Amir Shehata, Thomas Naughton, David E. Bernholdt, Howard Pritchard
{"title":"Bringing HPE Slingshot 11 support to Open MPI","authors":"Amir Shehata, Thomas Naughton, David E. Bernholdt, Howard Pritchard","doi":"10.1002/cpe.8203","DOIUrl":"10.1002/cpe.8203","url":null,"abstract":"<div>\u0000 \u0000 <p>The Cray HPE Slingshot 11 network is used on the new exascale systems arriving at the U.S. Department of Energy (DoE) laboratories (e.g., <i>Frontier</i>, <i>Aurora</i>, <i>Perlmutter</i>). As such, the support of this network is an important capability to meet the needs of exascale applications. This article highlights recent work to develop supporting infrastructure to enable Open MPI to efficiently support these new platforms. A key component of this effort involves development of a new Open Fabrics Interface (OFI) provider, <i>LinkX</i>. We discuss the design and development of enhancements that take advantage of the new Slingshot 11 network and AMD GPUs. We include performance data from tests on the <i>Frontier</i> supercomputer using synthetic communication benchmarks, and the vendor provided MPI as a baseline for comparison. The tests demonstrate full functionality of Open MPI on the system and initial results show favorable performance when compared to the highly tuned vendor implementation.</p>\u0000 </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 22","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141743571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}