{"title":"基于自监督学习的无范例类增量动作识别","authors":"Chunyu Hou, Yonghong Hou, Jinyin Jiang, Gunel Abdullayeva","doi":"10.1016/j.imavis.2025.105544","DOIUrl":null,"url":null,"abstract":"<div><div>Class incremental action recognition faces the persistent challenge of balancing stability and plasticity, as models must learn new classes without forgetting previously acquired knowledge. Existing methods often rely on storing original samples, which significantly increases storage demands and risks overfitting to past data. To address these issues, an exemplar-free framework based on self-supervised learning and Pseudo-Feature Generation (PFG) mechanism is proposed. At each incremental step, PFG generates pseudo features for previously learned classes by using the mean and variance for each class. This framework enables effective joint training on new class data while keeping the feature extractor frozen, eliminating the need to store original data. It preserves past knowledge and dynamically adapts to new categories, striking a balance between stability and plasticity. Experiments on four extensively used datasets: UCF101, HMDB51, Kinetics, and SSV2 validate the effectiveness of the proposed framework.</div></div>","PeriodicalId":50374,"journal":{"name":"Image and Vision Computing","volume":"159 ","pages":"Article 105544"},"PeriodicalIF":4.2000,"publicationDate":"2025-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exemplar-free class incremental action recognition based on self-supervised learning\",\"authors\":\"Chunyu Hou, Yonghong Hou, Jinyin Jiang, Gunel Abdullayeva\",\"doi\":\"10.1016/j.imavis.2025.105544\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Class incremental action recognition faces the persistent challenge of balancing stability and plasticity, as models must learn new classes without forgetting previously acquired knowledge. Existing methods often rely on storing original samples, which significantly increases storage demands and risks overfitting to past data. To address these issues, an exemplar-free framework based on self-supervised learning and Pseudo-Feature Generation (PFG) mechanism is proposed. At each incremental step, PFG generates pseudo features for previously learned classes by using the mean and variance for each class. This framework enables effective joint training on new class data while keeping the feature extractor frozen, eliminating the need to store original data. It preserves past knowledge and dynamically adapts to new categories, striking a balance between stability and plasticity. Experiments on four extensively used datasets: UCF101, HMDB51, Kinetics, and SSV2 validate the effectiveness of the proposed framework.</div></div>\",\"PeriodicalId\":50374,\"journal\":{\"name\":\"Image and Vision Computing\",\"volume\":\"159 \",\"pages\":\"Article 105544\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Image and Vision Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0262885625001325\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Image and Vision Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0262885625001325","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Exemplar-free class incremental action recognition based on self-supervised learning
Class incremental action recognition faces the persistent challenge of balancing stability and plasticity, as models must learn new classes without forgetting previously acquired knowledge. Existing methods often rely on storing original samples, which significantly increases storage demands and risks overfitting to past data. To address these issues, an exemplar-free framework based on self-supervised learning and Pseudo-Feature Generation (PFG) mechanism is proposed. At each incremental step, PFG generates pseudo features for previously learned classes by using the mean and variance for each class. This framework enables effective joint training on new class data while keeping the feature extractor frozen, eliminating the need to store original data. It preserves past knowledge and dynamically adapts to new categories, striking a balance between stability and plasticity. Experiments on four extensively used datasets: UCF101, HMDB51, Kinetics, and SSV2 validate the effectiveness of the proposed framework.
期刊介绍:
Image and Vision Computing has as a primary aim the provision of an effective medium of interchange for the results of high quality theoretical and applied research fundamental to all aspects of image interpretation and computer vision. The journal publishes work that proposes new image interpretation and computer vision methodology or addresses the application of such methods to real world scenes. It seeks to strengthen a deeper understanding in the discipline by encouraging the quantitative comparison and performance evaluation of the proposed methodology. The coverage includes: image interpretation, scene modelling, object recognition and tracking, shape analysis, monitoring and surveillance, active vision and robotic systems, SLAM, biologically-inspired computer vision, motion analysis, stereo vision, document image understanding, character and handwritten text recognition, face and gesture recognition, biometrics, vision-based human-computer interaction, human activity and behavior understanding, data fusion from multiple sensor inputs, image databases.