P. Subramanian, D. Blasiak, Jiandong Shen, W. Chan
{"title":"扩展H.263框架中的编码器优化","authors":"P. Subramanian, D. Blasiak, Jiandong Shen, W. Chan","doi":"10.1109/ACSSC.1997.680147","DOIUrl":null,"url":null,"abstract":"In a motion-compensated discrete-cosine-transform (MC/DCT) coding framework similar to the H.263-standard video coder, we explore the performance gain furnished by rate-distortion (R-D) optimization. We show that exact global R-D optimization is a very complex task. We seek good performance-complexity tradeoffs instead. We explore an encoding scheme that has several complexity reduction features: noniterative Lagrangian motion estimation, optimized table-based estimators of DCT coding bit rate and distortion performance, and bottom-up propagation of block matching results. Our scheme is compared with the advanced prediction mode (APM) of H.263, as instrumented by the popular Telenor test model. For typical QCIF video test sequences, our scheme furnishes PSNR gains ranging from 0.35 to 1.1 dB. Visual quality improvement is highly palpable. The complexity of our scheme is roughly 30% higher than the Telenor implementation.","PeriodicalId":240431,"journal":{"name":"Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Encoder optimization in an extended H.263 framework\",\"authors\":\"P. Subramanian, D. Blasiak, Jiandong Shen, W. Chan\",\"doi\":\"10.1109/ACSSC.1997.680147\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In a motion-compensated discrete-cosine-transform (MC/DCT) coding framework similar to the H.263-standard video coder, we explore the performance gain furnished by rate-distortion (R-D) optimization. We show that exact global R-D optimization is a very complex task. We seek good performance-complexity tradeoffs instead. We explore an encoding scheme that has several complexity reduction features: noniterative Lagrangian motion estimation, optimized table-based estimators of DCT coding bit rate and distortion performance, and bottom-up propagation of block matching results. Our scheme is compared with the advanced prediction mode (APM) of H.263, as instrumented by the popular Telenor test model. For typical QCIF video test sequences, our scheme furnishes PSNR gains ranging from 0.35 to 1.1 dB. Visual quality improvement is highly palpable. The complexity of our scheme is roughly 30% higher than the Telenor implementation.\",\"PeriodicalId\":240431,\"journal\":{\"name\":\"Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1997-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACSSC.1997.680147\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACSSC.1997.680147","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Encoder optimization in an extended H.263 framework
In a motion-compensated discrete-cosine-transform (MC/DCT) coding framework similar to the H.263-standard video coder, we explore the performance gain furnished by rate-distortion (R-D) optimization. We show that exact global R-D optimization is a very complex task. We seek good performance-complexity tradeoffs instead. We explore an encoding scheme that has several complexity reduction features: noniterative Lagrangian motion estimation, optimized table-based estimators of DCT coding bit rate and distortion performance, and bottom-up propagation of block matching results. Our scheme is compared with the advanced prediction mode (APM) of H.263, as instrumented by the popular Telenor test model. For typical QCIF video test sequences, our scheme furnishes PSNR gains ranging from 0.35 to 1.1 dB. Visual quality improvement is highly palpable. The complexity of our scheme is roughly 30% higher than the Telenor implementation.