Jian Li , Chen Du , Yuliang Zhao , Peng Shan , Xingqi Wang , Huawei Zhang , Ying Wang
{"title":"多时相图像融合增强卷积神经网络识别9种常见的小鼠动作","authors":"Jian Li , Chen Du , Yuliang Zhao , Peng Shan , Xingqi Wang , Huawei Zhang , Ying Wang","doi":"10.1016/j.knosys.2025.113628","DOIUrl":null,"url":null,"abstract":"<div><div>The study of complex behaviors and social interactions necessitates precise and efficient methodologies for the recognition and tracking of animal actions. However, existing methods such as depth perception and wearable devices for mice behavior recognition pose risks of physical harm to the subjects and exhibit limited applicability across species with low precision. To redress these deficiencies, this paper proposes the multi-temporal image fusion empowered Convolutional Neural Networks (CNN), aimed at achieving accurate and efficient recognition of nine common mice actions. In this study, we employ mice at various time points as subjects and employ a multi-temporal approach to process image sequences, integrating various frame difference extraction techniques to address the limitations inherent in single-frame prediction for capturing dynamic changes in actions. Subsequently, we utilize a Deformable Convolution Network (DCN) in conjunction with multi-stacked residual units to enhance the feature extraction capacity of the CNN, particularly focusing on mice action contours, while mitigating the risk of overfitting. Furthermore, we investigate the efficacy of fused images derived from varying frame differences in representing the nine actions, culminating in the establishment of a robust mice action recognition model through ensemble learning techniques. Experimental findings demonstrate an impressive precision rate of 92.9% in recognizing mice actions. The proposed method effectively eliminates background interference and exhibits superior generalization and adaptability properties.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"320 ","pages":"Article 113628"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-temporal image fusion empowered convolutional neural networks for recognition of 9 common mice actions\",\"authors\":\"Jian Li , Chen Du , Yuliang Zhao , Peng Shan , Xingqi Wang , Huawei Zhang , Ying Wang\",\"doi\":\"10.1016/j.knosys.2025.113628\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The study of complex behaviors and social interactions necessitates precise and efficient methodologies for the recognition and tracking of animal actions. However, existing methods such as depth perception and wearable devices for mice behavior recognition pose risks of physical harm to the subjects and exhibit limited applicability across species with low precision. To redress these deficiencies, this paper proposes the multi-temporal image fusion empowered Convolutional Neural Networks (CNN), aimed at achieving accurate and efficient recognition of nine common mice actions. In this study, we employ mice at various time points as subjects and employ a multi-temporal approach to process image sequences, integrating various frame difference extraction techniques to address the limitations inherent in single-frame prediction for capturing dynamic changes in actions. Subsequently, we utilize a Deformable Convolution Network (DCN) in conjunction with multi-stacked residual units to enhance the feature extraction capacity of the CNN, particularly focusing on mice action contours, while mitigating the risk of overfitting. Furthermore, we investigate the efficacy of fused images derived from varying frame differences in representing the nine actions, culminating in the establishment of a robust mice action recognition model through ensemble learning techniques. Experimental findings demonstrate an impressive precision rate of 92.9% in recognizing mice actions. The proposed method effectively eliminates background interference and exhibits superior generalization and adaptability properties.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"320 \",\"pages\":\"Article 113628\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705125006744\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125006744","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Multi-temporal image fusion empowered convolutional neural networks for recognition of 9 common mice actions
The study of complex behaviors and social interactions necessitates precise and efficient methodologies for the recognition and tracking of animal actions. However, existing methods such as depth perception and wearable devices for mice behavior recognition pose risks of physical harm to the subjects and exhibit limited applicability across species with low precision. To redress these deficiencies, this paper proposes the multi-temporal image fusion empowered Convolutional Neural Networks (CNN), aimed at achieving accurate and efficient recognition of nine common mice actions. In this study, we employ mice at various time points as subjects and employ a multi-temporal approach to process image sequences, integrating various frame difference extraction techniques to address the limitations inherent in single-frame prediction for capturing dynamic changes in actions. Subsequently, we utilize a Deformable Convolution Network (DCN) in conjunction with multi-stacked residual units to enhance the feature extraction capacity of the CNN, particularly focusing on mice action contours, while mitigating the risk of overfitting. Furthermore, we investigate the efficacy of fused images derived from varying frame differences in representing the nine actions, culminating in the establishment of a robust mice action recognition model through ensemble learning techniques. Experimental findings demonstrate an impressive precision rate of 92.9% in recognizing mice actions. The proposed method effectively eliminates background interference and exhibits superior generalization and adaptability properties.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.