{"title":"ARTS:一种近似简化树和基于分割的乘法器","authors":"Mahla Salehi Sheikhali Kelayeh , Sahand Divsalar , Shaghayegh Vahdat , Nima TaheriNejad","doi":"10.1016/j.future.2025.108098","DOIUrl":null,"url":null,"abstract":"<div><div>Due to the increasing use of machine learning applications in daily human life, efficient hardware implementation of these applications has turned into a serious challenge recently. Multipliers are one of the most prevalent, and at the same time expensive (from a hardware perspective), arithmetic units used in such applications. In this paper, a novel approximate multiplier, called ARTS, is designed based on the idea of dividing input operands into different static segments and completing certain steps of the calculation approximately by using simplified reduction trees to sum up the partial products. ARTS manifests significant improvements in hardware characteristics. Namely, 68.6%, 16.5%, and 60% improvements in power, delay, and area are achieved with respect to an exact 8-bit Wallace multiplier, while up to 59.8%, 37.2%, and 52.8% improvements are obtained compared to the other start-of-the-art (SoTA) approximate multipliers. The efficiency of ARTS is assessed in image processing and DNN applications. ARTS shows up to 91.4% and 28.3% better PSNR and 52.4% and 20.5% better SSIM in image multiplication and Sobel edge detection applications, respectively, compared to the other SoTA approximate multipliers. In DNN applications, ARTS exhibits outstanding performance, achieving up to 84.8% higher classification accuracy compared to SoTA approximate designs with similar hardware characteristics. Additionally, when compared to SoTA designs offering comparable accuracy, ARTS achieves this performance with up to 191% lower energy consumption.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"175 ","pages":"Article 108098"},"PeriodicalIF":6.2000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ARTS: An approximate reduced tree and segmentation-based multiplier\",\"authors\":\"Mahla Salehi Sheikhali Kelayeh , Sahand Divsalar , Shaghayegh Vahdat , Nima TaheriNejad\",\"doi\":\"10.1016/j.future.2025.108098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Due to the increasing use of machine learning applications in daily human life, efficient hardware implementation of these applications has turned into a serious challenge recently. Multipliers are one of the most prevalent, and at the same time expensive (from a hardware perspective), arithmetic units used in such applications. In this paper, a novel approximate multiplier, called ARTS, is designed based on the idea of dividing input operands into different static segments and completing certain steps of the calculation approximately by using simplified reduction trees to sum up the partial products. ARTS manifests significant improvements in hardware characteristics. Namely, 68.6%, 16.5%, and 60% improvements in power, delay, and area are achieved with respect to an exact 8-bit Wallace multiplier, while up to 59.8%, 37.2%, and 52.8% improvements are obtained compared to the other start-of-the-art (SoTA) approximate multipliers. The efficiency of ARTS is assessed in image processing and DNN applications. ARTS shows up to 91.4% and 28.3% better PSNR and 52.4% and 20.5% better SSIM in image multiplication and Sobel edge detection applications, respectively, compared to the other SoTA approximate multipliers. In DNN applications, ARTS exhibits outstanding performance, achieving up to 84.8% higher classification accuracy compared to SoTA approximate designs with similar hardware characteristics. Additionally, when compared to SoTA designs offering comparable accuracy, ARTS achieves this performance with up to 191% lower energy consumption.</div></div>\",\"PeriodicalId\":55132,\"journal\":{\"name\":\"Future Generation Computer Systems-The International Journal of Escience\",\"volume\":\"175 \",\"pages\":\"Article 108098\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Future Generation Computer Systems-The International Journal of Escience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167739X25003929\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25003929","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
ARTS: An approximate reduced tree and segmentation-based multiplier
Due to the increasing use of machine learning applications in daily human life, efficient hardware implementation of these applications has turned into a serious challenge recently. Multipliers are one of the most prevalent, and at the same time expensive (from a hardware perspective), arithmetic units used in such applications. In this paper, a novel approximate multiplier, called ARTS, is designed based on the idea of dividing input operands into different static segments and completing certain steps of the calculation approximately by using simplified reduction trees to sum up the partial products. ARTS manifests significant improvements in hardware characteristics. Namely, 68.6%, 16.5%, and 60% improvements in power, delay, and area are achieved with respect to an exact 8-bit Wallace multiplier, while up to 59.8%, 37.2%, and 52.8% improvements are obtained compared to the other start-of-the-art (SoTA) approximate multipliers. The efficiency of ARTS is assessed in image processing and DNN applications. ARTS shows up to 91.4% and 28.3% better PSNR and 52.4% and 20.5% better SSIM in image multiplication and Sobel edge detection applications, respectively, compared to the other SoTA approximate multipliers. In DNN applications, ARTS exhibits outstanding performance, achieving up to 84.8% higher classification accuracy compared to SoTA approximate designs with similar hardware characteristics. Additionally, when compared to SoTA designs offering comparable accuracy, ARTS achieves this performance with up to 191% lower energy consumption.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.