Sehrish Khursheed, S. Khalid, Farzana Riaz, Tehmina Shehryar
{"title":"A Review of Activity Detection Methods Used in Videos Streaming","authors":"Sehrish Khursheed, S. Khalid, Farzana Riaz, Tehmina Shehryar","doi":"10.1109/ICRAI57502.2023.10089567","DOIUrl":null,"url":null,"abstract":"Activity detection in videos embraces many recent sub-research directions, such as Action Recognition (what type of activity is being performed), Temporal Action Detection (that states the time of action occurred in a large video), Spatio-Temporal Action Detection (localizing activities both in space and time). New advances in convolutional Neural Network Architectures and increased computing resources have made it possible. 3 Dimensional CNNs outperform 2D CNNs in balancing both spatial and temporal information in activity recognition while working with videos. Various approaches which incorporate these networks have been discussed in the paper. Locating a specific action in a video is an advanced and more complex task. Our focus of the paper is to give a summary of methods and advances used in the domain of action recognition and action localization.","PeriodicalId":447565,"journal":{"name":"2023 International Conference on Robotics and Automation in Industry (ICRAI)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Robotics and Automation in Industry (ICRAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAI57502.2023.10089567","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Activity detection in videos embraces many recent sub-research directions, such as Action Recognition (what type of activity is being performed), Temporal Action Detection (that states the time of action occurred in a large video), Spatio-Temporal Action Detection (localizing activities both in space and time). New advances in convolutional Neural Network Architectures and increased computing resources have made it possible. 3 Dimensional CNNs outperform 2D CNNs in balancing both spatial and temporal information in activity recognition while working with videos. Various approaches which incorporate these networks have been discussed in the paper. Locating a specific action in a video is an advanced and more complex task. Our focus of the paper is to give a summary of methods and advances used in the domain of action recognition and action localization.