Jiacong Chen , Qingyu Mao , Shuai Liu , Fanyang Meng , Shuangyan Yi , Yongsheng Liang
{"title":"Exploring Lottery Ticket Hypothesis in Neural Video Representations","authors":"Jiacong Chen , Qingyu Mao , Shuai Liu , Fanyang Meng , Shuangyan Yi , Yongsheng Liang","doi":"10.1016/j.displa.2025.102982","DOIUrl":null,"url":null,"abstract":"<div><div>Neural Video Representations (NVR) have emerged as a novel method for video compression, which encode video content into network parameters, transforming video compression into a model compression issue. In this paper, we attempt to introduce Lottery Ticket Hypothesis (LTH) into NVR, aiming to identify optimal sub-networks (<span><math><mrow><mi>i</mi><mo>.</mo><mi>e</mi><mo>.</mo></mrow></math></span>, winning tickets) capable of effective representing corresponding videos. Firstly, we validate the existence of winning tickets within NVR and reveal that these winning tickets exhibit non-transferability across different videos. To balance heavy searching cost with training efficiency, we devise an Identifying Winning Tickets (IWT) method tailored for NVR. Additionally, leveraging the non-transferability of winning tickets and video frames redundancy, we design a Progressive Cyclic Learning (PCL) strategy to accelerate the searching winning tickets phase. Finally, comprehensive experiments are conducted to evaluate the performance and general properties of winning tickets across various NVR architectures and videos. The results demonstrate that our proposed method significantly outperforms the original pruning approach, achieving performance gains of 1.81 dB, 4.19 dB, and 3.5 dB at 90 % sparsity in NeRV, E-NeRV, and HNeRV, respectively.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"87 ","pages":"Article 102982"},"PeriodicalIF":3.7000,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225000198","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Neural Video Representations (NVR) have emerged as a novel method for video compression, which encode video content into network parameters, transforming video compression into a model compression issue. In this paper, we attempt to introduce Lottery Ticket Hypothesis (LTH) into NVR, aiming to identify optimal sub-networks (, winning tickets) capable of effective representing corresponding videos. Firstly, we validate the existence of winning tickets within NVR and reveal that these winning tickets exhibit non-transferability across different videos. To balance heavy searching cost with training efficiency, we devise an Identifying Winning Tickets (IWT) method tailored for NVR. Additionally, leveraging the non-transferability of winning tickets and video frames redundancy, we design a Progressive Cyclic Learning (PCL) strategy to accelerate the searching winning tickets phase. Finally, comprehensive experiments are conducted to evaluate the performance and general properties of winning tickets across various NVR architectures and videos. The results demonstrate that our proposed method significantly outperforms the original pruning approach, achieving performance gains of 1.81 dB, 4.19 dB, and 3.5 dB at 90 % sparsity in NeRV, E-NeRV, and HNeRV, respectively.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.