{"title":"用于视频事件检测的自定义和迁移学习CNN架构的快速回顾和性能分析","authors":"Susmitha Alamuru, S. Jain","doi":"10.1109/ICDSIS55133.2022.9915866","DOIUrl":null,"url":null,"abstract":"Event/Action detection in videos is a growing research interest as it has numerous applications such as patient monitoring in health care, anomaly detection in surveillance systems, retrieval of video, human and computer interactions, gaming environment, entertainment environment etc. This is all because of one and only Deep learning due to its capability to outperform the conventional hand-crafted feature extraction algorithms. Transfer learning is significant in training deep neural networks with small datasets. The objective of this paper is to compare popular pretrained CNN models (in Top-1 accuracy) with custom CNN, built from scratch to detect human actions in UCF11 dataset.","PeriodicalId":178360,"journal":{"name":"2022 IEEE International Conference on Data Science and Information System (ICDSIS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Quick Review and Performance Analysis of Custom and Transfer Learning CNN Architectures for Event Detection in Videos\",\"authors\":\"Susmitha Alamuru, S. Jain\",\"doi\":\"10.1109/ICDSIS55133.2022.9915866\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Event/Action detection in videos is a growing research interest as it has numerous applications such as patient monitoring in health care, anomaly detection in surveillance systems, retrieval of video, human and computer interactions, gaming environment, entertainment environment etc. This is all because of one and only Deep learning due to its capability to outperform the conventional hand-crafted feature extraction algorithms. Transfer learning is significant in training deep neural networks with small datasets. The objective of this paper is to compare popular pretrained CNN models (in Top-1 accuracy) with custom CNN, built from scratch to detect human actions in UCF11 dataset.\",\"PeriodicalId\":178360,\"journal\":{\"name\":\"2022 IEEE International Conference on Data Science and Information System (ICDSIS)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Data Science and Information System (ICDSIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSIS55133.2022.9915866\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Data Science and Information System (ICDSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSIS55133.2022.9915866","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Quick Review and Performance Analysis of Custom and Transfer Learning CNN Architectures for Event Detection in Videos
Event/Action detection in videos is a growing research interest as it has numerous applications such as patient monitoring in health care, anomaly detection in surveillance systems, retrieval of video, human and computer interactions, gaming environment, entertainment environment etc. This is all because of one and only Deep learning due to its capability to outperform the conventional hand-crafted feature extraction algorithms. Transfer learning is significant in training deep neural networks with small datasets. The objective of this paper is to compare popular pretrained CNN models (in Top-1 accuracy) with custom CNN, built from scratch to detect human actions in UCF11 dataset.