Sehrish Khursheed, S. Khalid, Farzana Riaz, Tehmina Shehryar
{"title":"视频流中的活动检测方法综述","authors":"Sehrish Khursheed, S. Khalid, Farzana Riaz, Tehmina Shehryar","doi":"10.1109/ICRAI57502.2023.10089567","DOIUrl":null,"url":null,"abstract":"Activity detection in videos embraces many recent sub-research directions, such as Action Recognition (what type of activity is being performed), Temporal Action Detection (that states the time of action occurred in a large video), Spatio-Temporal Action Detection (localizing activities both in space and time). New advances in convolutional Neural Network Architectures and increased computing resources have made it possible. 3 Dimensional CNNs outperform 2D CNNs in balancing both spatial and temporal information in activity recognition while working with videos. Various approaches which incorporate these networks have been discussed in the paper. Locating a specific action in a video is an advanced and more complex task. Our focus of the paper is to give a summary of methods and advances used in the domain of action recognition and action localization.","PeriodicalId":447565,"journal":{"name":"2023 International Conference on Robotics and Automation in Industry (ICRAI)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Review of Activity Detection Methods Used in Videos Streaming\",\"authors\":\"Sehrish Khursheed, S. Khalid, Farzana Riaz, Tehmina Shehryar\",\"doi\":\"10.1109/ICRAI57502.2023.10089567\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Activity detection in videos embraces many recent sub-research directions, such as Action Recognition (what type of activity is being performed), Temporal Action Detection (that states the time of action occurred in a large video), Spatio-Temporal Action Detection (localizing activities both in space and time). New advances in convolutional Neural Network Architectures and increased computing resources have made it possible. 3 Dimensional CNNs outperform 2D CNNs in balancing both spatial and temporal information in activity recognition while working with videos. Various approaches which incorporate these networks have been discussed in the paper. Locating a specific action in a video is an advanced and more complex task. Our focus of the paper is to give a summary of methods and advances used in the domain of action recognition and action localization.\",\"PeriodicalId\":447565,\"journal\":{\"name\":\"2023 International Conference on Robotics and Automation in Industry (ICRAI)\",\"volume\":\"40 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Robotics and Automation in Industry (ICRAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICRAI57502.2023.10089567\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Robotics and Automation in Industry (ICRAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAI57502.2023.10089567","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Review of Activity Detection Methods Used in Videos Streaming
Activity detection in videos embraces many recent sub-research directions, such as Action Recognition (what type of activity is being performed), Temporal Action Detection (that states the time of action occurred in a large video), Spatio-Temporal Action Detection (localizing activities both in space and time). New advances in convolutional Neural Network Architectures and increased computing resources have made it possible. 3 Dimensional CNNs outperform 2D CNNs in balancing both spatial and temporal information in activity recognition while working with videos. Various approaches which incorporate these networks have been discussed in the paper. Locating a specific action in a video is an advanced and more complex task. Our focus of the paper is to give a summary of methods and advances used in the domain of action recognition and action localization.