Pei-Jun Lee, Ho-Ju Lin, Shun-Hsing Huang, Wen-June Wang
{"title":"A fast mode determination algorithm for multi-view video coding","authors":"Pei-Jun Lee, Ho-Ju Lin, Shun-Hsing Huang, Wen-June Wang","doi":"10.1109/ICCE.2011.5722811","DOIUrl":null,"url":null,"abstract":"Multi-view video coding by using the hierarchical B picture structure utilizes intra-view and inter-view predictions to reduce redundant information. The best coding mode is determined by exhaustively searching over all possible partition modes using rate distortion comparison in motion estimation for each macroblock. However, there is a high computational complexity in exhaustive searching. This study analyzes the statistics of coding mode distribution in the inter- and intra-view and then a fast mode decision algorithm is proposed to select a proper coding mode, in which the probability density function of RD cost and motion homogeneity degree are set as the multi-threshold in the algorithm to achieve the mode determination. For multi-view, the mode correlation of the neighboring views is utilized to select the coding mode from interview or the intra-view prediction. Experimental results show that the encoding time for the basic view and multi-view is saved up to 75% and 65%, respectively, and the quality of the reconstruction video is almost remained.","PeriodicalId":256368,"journal":{"name":"2011 IEEE International Conference on Consumer Electronics (ICCE)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Consumer Electronics (ICCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE.2011.5722811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Multi-view video coding by using the hierarchical B picture structure utilizes intra-view and inter-view predictions to reduce redundant information. The best coding mode is determined by exhaustively searching over all possible partition modes using rate distortion comparison in motion estimation for each macroblock. However, there is a high computational complexity in exhaustive searching. This study analyzes the statistics of coding mode distribution in the inter- and intra-view and then a fast mode decision algorithm is proposed to select a proper coding mode, in which the probability density function of RD cost and motion homogeneity degree are set as the multi-threshold in the algorithm to achieve the mode determination. For multi-view, the mode correlation of the neighboring views is utilized to select the coding mode from interview or the intra-view prediction. Experimental results show that the encoding time for the basic view and multi-view is saved up to 75% and 65%, respectively, and the quality of the reconstruction video is almost remained.