Evaluating Machine Learning Models for Disparate Computer Systems Performance Prediction

2020 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT) Pub Date : 2020-07-01 DOI:10.1109/CONECCT50063.2020.9198512

Amit Mankodi, Amit Bhatt, B. Chaudhury, Rajat Kumar, Aditya Amrutiya

{"title":"Evaluating Machine Learning Models for Disparate Computer Systems Performance Prediction","authors":"Amit Mankodi, Amit Bhatt, B. Chaudhury, Rajat Kumar, Aditya Amrutiya","doi":"10.1109/CONECCT50063.2020.9198512","DOIUrl":null,"url":null,"abstract":"Performance prediction is an active area of research due to its applicability in the advancements of hardware-software co-development. Several empirical machine-learning models, such as linear models, non-linear models, probabilistic models, tree-based models and, neural networks, are used for performance prediction. Furthermore, the prediction model’s accuracy may vary depending on performance data gathered for different software types (compute-bound, memory-bound) and different hardware (simulation-based or physical systems). We have examined fourteen machine-learning models on simulation-based hardware and physical systems by executing several benchmark programs with different computation and data access patterns. Our results show that the tree-based machine-learning models outperform all other models with median absolute percentage error (MedAPE) of less than 5% followed by bagging and boosting models that help to improve weak learners. We have also observed that prediction accuracy is higher on simulation-based hardware due to its deterministic nature as compared to physical systems. Moreover, in physical systems, the prediction accuracy of memory-bound algorithms is higher as compared to compute-bound algorithms due to manufacturer variability in processors.","PeriodicalId":261794,"journal":{"name":"2020 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONECCT50063.2020.9198512","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Performance prediction is an active area of research due to its applicability in the advancements of hardware-software co-development. Several empirical machine-learning models, such as linear models, non-linear models, probabilistic models, tree-based models and, neural networks, are used for performance prediction. Furthermore, the prediction model’s accuracy may vary depending on performance data gathered for different software types (compute-bound, memory-bound) and different hardware (simulation-based or physical systems). We have examined fourteen machine-learning models on simulation-based hardware and physical systems by executing several benchmark programs with different computation and data access patterns. Our results show that the tree-based machine-learning models outperform all other models with median absolute percentage error (MedAPE) of less than 5% followed by bagging and boosting models that help to improve weak learners. We have also observed that prediction accuracy is higher on simulation-based hardware due to its deterministic nature as compared to physical systems. Moreover, in physical systems, the prediction accuracy of memory-bound algorithms is higher as compared to compute-bound algorithms due to manufacturer variability in processors.

查看原文本刊更多论文

评估不同计算机系统性能预测的机器学习模型

性能预测由于其在软硬件协同开发中的适用性而成为一个活跃的研究领域。一些经验机器学习模型，如线性模型、非线性模型、概率模型、基于树的模型和神经网络，被用于性能预测。此外，预测模型的准确性可能会因不同软件类型(计算型、内存型)和不同硬件(基于模拟或物理系统)收集的性能数据而有所不同。我们通过执行几个具有不同计算和数据访问模式的基准程序，在基于仿真的硬件和物理系统上检查了14个机器学习模型。我们的研究结果表明，基于树的机器学习模型的表现优于所有其他模型，其中位数绝对百分比误差(MedAPE)小于5%，其次是有助于改善弱学习者的bagging和boosting模型。我们还观察到，与物理系统相比，基于仿真的硬件的预测准确性更高，因为它具有确定性。此外，在物理系统中，由于处理器的制造商可变性，与计算绑定算法相比，内存绑定算法的预测精度更高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)

自引率

0.00%

发文量