基于移动异构多核架构的能量感知HEVC软件解码

PARMA-DITAM@HiPEAC Pub Date : 1900-01-01 DOI:10.4230/OASIcs.PARMA-DITAM.2022.4

Mohammed Bey Ahmed Khernache, Jalil Boukhobza, Yahia Benmoussa, D. Ménard

{"title":"基于移动异构多核架构的能量感知HEVC软件解码","authors":"Mohammed Bey Ahmed Khernache, Jalil Boukhobza, Yahia Benmoussa, D. Ménard","doi":"10.4230/OASIcs.PARMA-DITAM.2022.4","DOIUrl":null,"url":null,"abstract":"Video content is becoming increasingly omnipresent on mobile platforms thanks to advances in mobile heterogeneous architectures. These platforms typically include limited rechargeable batteries which do not improve as fast as video content. Most state-of-the-art studies proposed solutions based on parallelism to exploit the GPP heterogeneity and DVFS to scale up/down the GPP frequency based on the video workload. However, some studies assume to have information about the workload before to start decoding. Others do not exploit the asymmetry character of recent mobile architectures. To address these two challenges, we propose a solution based on classification and frequency scaling. First, a model to classify frames based on their type and size is built during design-time. Second, this model is applied for each frame to decide which GPP cores will decode it. Third, the frequency of the chosen GPP cores is dynamically adjusted based on the output buffer size. Experiments on real-world mobile platforms show that the proposed solution can save more than 20% of energy (mJ/Frame) compared to the Ondemand Linux governor with less than 5% of miss-rate. Moreover, it needs less than one second of decoding to enter the stable state and the overhead represents less than 1% of the frame decoding time.","PeriodicalId":436349,"journal":{"name":"PARMA-DITAM@HiPEAC","volume":"216 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Energy-Aware HEVC Software Decoding On Mobile Heterogeneous Multi-Cores Architectures\",\"authors\":\"Mohammed Bey Ahmed Khernache, Jalil Boukhobza, Yahia Benmoussa, D. Ménard\",\"doi\":\"10.4230/OASIcs.PARMA-DITAM.2022.4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Video content is becoming increasingly omnipresent on mobile platforms thanks to advances in mobile heterogeneous architectures. These platforms typically include limited rechargeable batteries which do not improve as fast as video content. Most state-of-the-art studies proposed solutions based on parallelism to exploit the GPP heterogeneity and DVFS to scale up/down the GPP frequency based on the video workload. However, some studies assume to have information about the workload before to start decoding. Others do not exploit the asymmetry character of recent mobile architectures. To address these two challenges, we propose a solution based on classification and frequency scaling. First, a model to classify frames based on their type and size is built during design-time. Second, this model is applied for each frame to decide which GPP cores will decode it. Third, the frequency of the chosen GPP cores is dynamically adjusted based on the output buffer size. Experiments on real-world mobile platforms show that the proposed solution can save more than 20% of energy (mJ/Frame) compared to the Ondemand Linux governor with less than 5% of miss-rate. Moreover, it needs less than one second of decoding to enter the stable state and the overhead represents less than 1% of the frame decoding time.\",\"PeriodicalId\":436349,\"journal\":{\"name\":\"PARMA-DITAM@HiPEAC\",\"volume\":\"216 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PARMA-DITAM@HiPEAC\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4230/OASIcs.PARMA-DITAM.2022.4\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PARMA-DITAM@HiPEAC","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/OASIcs.PARMA-DITAM.2022.4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

由于移动异构架构的进步，视频内容在移动平台上变得越来越无所不在。这些平台通常包括有限的可充电电池，其改进速度不如视频内容快。大多数最新的研究提出了基于并行的解决方案来利用GPP的异构性和DVFS来根据视频工作负载放大/缩小GPP频率。然而，一些研究假设在开始解码之前有关于工作量的信息。另一些则没有利用最近移动架构的不对称特征。为了解决这两个问题，我们提出了一种基于分类和频率缩放的解决方案。首先，在设计阶段根据框架的类型和尺寸建立框架分类模型。其次，将该模型应用于每一帧，以确定哪些GPP核心将对其进行解码。第三，根据输出缓冲区大小动态调整所选GPP核心的频率。在实际移动平台上的实验表明，与Ondemand Linux调控器相比，所提出的解决方案可以节省20%以上的能量(mJ/Frame)，失误率不到5%。此外，它只需要不到1秒的解码就可以进入稳定状态，开销不到帧解码时间的1%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Energy-Aware HEVC Software Decoding On Mobile Heterogeneous Multi-Cores Architectures

Video content is becoming increasingly omnipresent on mobile platforms thanks to advances in mobile heterogeneous architectures. These platforms typically include limited rechargeable batteries which do not improve as fast as video content. Most state-of-the-art studies proposed solutions based on parallelism to exploit the GPP heterogeneity and DVFS to scale up/down the GPP frequency based on the video workload. However, some studies assume to have information about the workload before to start decoding. Others do not exploit the asymmetry character of recent mobile architectures. To address these two challenges, we propose a solution based on classification and frequency scaling. First, a model to classify frames based on their type and size is built during design-time. Second, this model is applied for each frame to decide which GPP cores will decode it. Third, the frequency of the chosen GPP cores is dynamically adjusted based on the output buffer size. Experiments on real-world mobile platforms show that the proposed solution can save more than 20% of energy (mJ/Frame) compared to the Ondemand Linux governor with less than 5% of miss-rate. Moreover, it needs less than one second of decoding to enter the stable state and the overhead represents less than 1% of the frame decoding time.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

PARMA-DITAM@HiPEAC

自引率

0.00%

发文量