基于兴趣域的H.264低功耗视频通信编码参数分配

Minghui Wang, T. Zhang, Chen Liu, S. Goto
{"title":"基于兴趣域的H.264低功耗视频通信编码参数分配","authors":"Minghui Wang, T. Zhang, Chen Liu, S. Goto","doi":"10.1109/CSPA.2009.5069223","DOIUrl":null,"url":null,"abstract":"H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.","PeriodicalId":338469,"journal":{"name":"2009 5th International Colloquium on Signal Processing & Its Applications","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Region-of-interest based H.264 encoding parameter allocation for low power video communication\",\"authors\":\"Minghui Wang, T. Zhang, Chen Liu, S. Goto\",\"doi\":\"10.1109/CSPA.2009.5069223\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.\",\"PeriodicalId\":338469,\"journal\":{\"name\":\"2009 5th International Colloquium on Signal Processing & Its Applications\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-03-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 5th International Colloquium on Signal Processing & Its Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPA.2009.5069223\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 5th International Colloquium on Signal Processing & Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPA.2009.5069223","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

摘要

H.264是现代视频压缩标准中最先进的。其极高的压缩比满足了便携式设备间视频通信的要求。由于便携式设备的功率有限,H.264的巨大计算量是硬件设计的关键问题。根据人类视觉系统(HVS)的研究,人类的视觉只能聚焦在一帧图像中的一个区域,这个区域被定义为兴趣区域(ROI)。在大多数情况下,人脸最能吸引设备用户的注意力。这种现象给所有宏块的编码不平等提供了机会。在这项工作中,利用色度和纹理对比度特征,使用面向编码器的快速算法检测感兴趣区域。检测到感兴趣区域后,编码器将在感兴趣区域和非感兴趣区域分别分配编码参数。结果表明,该方法在ROI中保持了良好的质量,在非ROI中节省了大量吞吐量,大大减少了计算量。感兴趣检测器和编码器也被设计为解码友好和硬件友好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Region-of-interest based H.264 encoding parameter allocation for low power video communication
H.264 is the state-of-the-art in modern video compression standards. Its extremely high compression ratio meets the requirements the video communication between portable devices. Since the power is limited in portable devices, the huge computation of H.264 is a critical problem for hardware design. According to the human visual system (HVS) research, human vision is only able to focus on one area in a frame, which is defined as region-of-interest (ROI). In most cases, human face attracts the most attention of the device user. This phenomenon gives a chance to code all macroblocks unequally. In this work, the ROI is detected with an encoder-oriented fast algorithm, using chrominance and texture contrast features. After the ROI is detected, the encoder will allocated the coding parameters respectively in ROI and non-ROI. As a result, it keeps fine quality in ROI, saves much throughput in non-ROI, and greatly reduces the computation. The ROI detector and the encoder are also designed to be decoding-friendly and hardware-friendly.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信