{"title":"一种高效的AVS高清视频编码器模式决策算法和体系结构","authors":"Shuai Li, Chuang Zhu, Fei Wang, Huizhu Jia, Xiaodong Xie, Wen Gao","doi":"10.1109/IST.2012.6295561","DOIUrl":null,"url":null,"abstract":"In Advanced Audio Video coding Standard (AVS), the utilization of variable block size ranging from 16×16 to 8×8 in inter frame encoding improves the coding efficiency significantly compared with a fixed MB partition. Rate distortion optimization (RDO) is the best known mode decision method, but the corresponding extremely high computational complexity limits its application. This paper proposes an algorithm based on the visual perception model and Sobel operator edge detection model to quickly select the best inter mode from 16×16, 16×8, 8×16 and 8×8 just by using the original pixels. We further analyze and redesign the MB level pipeline structure, and give the optimized hardware structure of the encoder. We tested different sequences including cif, 720p and 1080p, and the experimental results show that the coding efficiency is comparable with the traditional RDO method. The proposed hardware structure saves fractional motion estimation (FME) by 60% in areas and reduces the processing time by 200 cycles. Our proposed mode decision architecture can support the real time processing of 1080P@30fps.","PeriodicalId":213330,"journal":{"name":"2012 IEEE International Conference on Imaging Systems and Techniques Proceedings","volume":"242 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A highly efficient mode decision algorithm and architecture for AVS HD Video Encoder\",\"authors\":\"Shuai Li, Chuang Zhu, Fei Wang, Huizhu Jia, Xiaodong Xie, Wen Gao\",\"doi\":\"10.1109/IST.2012.6295561\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In Advanced Audio Video coding Standard (AVS), the utilization of variable block size ranging from 16×16 to 8×8 in inter frame encoding improves the coding efficiency significantly compared with a fixed MB partition. Rate distortion optimization (RDO) is the best known mode decision method, but the corresponding extremely high computational complexity limits its application. This paper proposes an algorithm based on the visual perception model and Sobel operator edge detection model to quickly select the best inter mode from 16×16, 16×8, 8×16 and 8×8 just by using the original pixels. We further analyze and redesign the MB level pipeline structure, and give the optimized hardware structure of the encoder. We tested different sequences including cif, 720p and 1080p, and the experimental results show that the coding efficiency is comparable with the traditional RDO method. The proposed hardware structure saves fractional motion estimation (FME) by 60% in areas and reduces the processing time by 200 cycles. Our proposed mode decision architecture can support the real time processing of 1080P@30fps.\",\"PeriodicalId\":213330,\"journal\":{\"name\":\"2012 IEEE International Conference on Imaging Systems and Techniques Proceedings\",\"volume\":\"242 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Conference on Imaging Systems and Techniques Proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IST.2012.6295561\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Imaging Systems and Techniques Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IST.2012.6295561","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A highly efficient mode decision algorithm and architecture for AVS HD Video Encoder
In Advanced Audio Video coding Standard (AVS), the utilization of variable block size ranging from 16×16 to 8×8 in inter frame encoding improves the coding efficiency significantly compared with a fixed MB partition. Rate distortion optimization (RDO) is the best known mode decision method, but the corresponding extremely high computational complexity limits its application. This paper proposes an algorithm based on the visual perception model and Sobel operator edge detection model to quickly select the best inter mode from 16×16, 16×8, 8×16 and 8×8 just by using the original pixels. We further analyze and redesign the MB level pipeline structure, and give the optimized hardware structure of the encoder. We tested different sequences including cif, 720p and 1080p, and the experimental results show that the coding efficiency is comparable with the traditional RDO method. The proposed hardware structure saves fractional motion estimation (FME) by 60% in areas and reduces the processing time by 200 cycles. Our proposed mode decision architecture can support the real time processing of 1080P@30fps.