Russo Mohammad Ashraf Uddin, Laksono Kurnianggoro, K. Jo
{"title":"结合深度学习模型和迁移学习的体育视频分类","authors":"Russo Mohammad Ashraf Uddin, Laksono Kurnianggoro, K. Jo","doi":"10.1109/ECACE.2019.8679371","DOIUrl":null,"url":null,"abstract":"Sports classification has considerable importance for digital content archiving in broadcasting companies. It is also a subdivision of human action recognition, which further contributes to understand the context of video scenes. In this work, deep neural networks are used, combining convolutional and recurrent networks to classify 15 individual sports classes. The sports dataset is hand-crafted to focus on sports action-based classification. CNN extracted features are combined with temporal information from RNN to formulate the general model to solve the problem. Later, transfer learning is applied with the VGG-16 model which was able to achieve 94% and 92 % test accuracy for 10 and 15 sports classes respectively.","PeriodicalId":226060,"journal":{"name":"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"30","resultStr":"{\"title\":\"Classification of sports videos with combination of deep learning models and transfer learning\",\"authors\":\"Russo Mohammad Ashraf Uddin, Laksono Kurnianggoro, K. Jo\",\"doi\":\"10.1109/ECACE.2019.8679371\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sports classification has considerable importance for digital content archiving in broadcasting companies. It is also a subdivision of human action recognition, which further contributes to understand the context of video scenes. In this work, deep neural networks are used, combining convolutional and recurrent networks to classify 15 individual sports classes. The sports dataset is hand-crafted to focus on sports action-based classification. CNN extracted features are combined with temporal information from RNN to formulate the general model to solve the problem. Later, transfer learning is applied with the VGG-16 model which was able to achieve 94% and 92 % test accuracy for 10 and 15 sports classes respectively.\",\"PeriodicalId\":226060,\"journal\":{\"name\":\"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"30\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ECACE.2019.8679371\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Electrical, Computer and Communication Engineering (ECCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECACE.2019.8679371","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification of sports videos with combination of deep learning models and transfer learning
Sports classification has considerable importance for digital content archiving in broadcasting companies. It is also a subdivision of human action recognition, which further contributes to understand the context of video scenes. In this work, deep neural networks are used, combining convolutional and recurrent networks to classify 15 individual sports classes. The sports dataset is hand-crafted to focus on sports action-based classification. CNN extracted features are combined with temporal information from RNN to formulate the general model to solve the problem. Later, transfer learning is applied with the VGG-16 model which was able to achieve 94% and 92 % test accuracy for 10 and 15 sports classes respectively.