{"title":"Deep Learning Based Motion Target Detection Algorithm","authors":"Xizhou Wang","doi":"10.1109/ICPECA60615.2024.10471116","DOIUrl":null,"url":null,"abstract":"With the dramatic growth of video data, the storage and computational resources required to process this huge amount of data have increased significantly. In order to cope with this challenge, it is necessary to extract the key information in the video in a more intelligent and efficient way, while filtering out a large amount of redundant content. In this paper, the traditional CNN model and Transformer model are constructed respectively using video frames of car motion process from video viewpoint as a dataset. The model performance is improved by advanced data preprocessing operations. The bilateral filtering technique is introduced in this study, aiming to improve the image quality and enhance the image processing effect through denoising operations, making it more applicable to the subsequent processing steps. Finally, the Transformer model is verified by the model and the recognition accuracy of the Transformer model is up to about 90%.","PeriodicalId":518671,"journal":{"name":"2024 IEEE 4th International Conference on Power, Electronics and Computer Applications (ICPECA)","volume":"13 13","pages":"943-948"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 IEEE 4th International Conference on Power, Electronics and Computer Applications (ICPECA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPECA60615.2024.10471116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the dramatic growth of video data, the storage and computational resources required to process this huge amount of data have increased significantly. In order to cope with this challenge, it is necessary to extract the key information in the video in a more intelligent and efficient way, while filtering out a large amount of redundant content. In this paper, the traditional CNN model and Transformer model are constructed respectively using video frames of car motion process from video viewpoint as a dataset. The model performance is improved by advanced data preprocessing operations. The bilateral filtering technique is introduced in this study, aiming to improve the image quality and enhance the image processing effect through denoising operations, making it more applicable to the subsequent processing steps. Finally, the Transformer model is verified by the model and the recognition accuracy of the Transformer model is up to about 90%.