{"title":"Action segmentation based on Bag-of-Visual-Words models","authors":"Guan-Jhih Chen, I-Cheng Chang, Hung-Yu Yeh","doi":"10.1109/UMEDIA.2017.8074105","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an action segmentation system which can segment a video into a number of video shots according to the human action. At the same time, the proposed system can also recognize the action types of those segmented shots. Our system estimates the action number of the video by using the distribution of visual word vectors without any predefined information and automatically adjusts the window step size instead of manual selection. In the proposed approach, the bag of visual word (BOVW) together with self-growing and self-organized neural gas (SGONG) are used to construct the action word dictionary to recognize the video shots. The experimental results show that the performance of our segmentation method can perform well for the test videos.","PeriodicalId":440018,"journal":{"name":"2017 10th International Conference on Ubi-media Computing and Workshops (Ubi-Media)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 10th International Conference on Ubi-media Computing and Workshops (Ubi-Media)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UMEDIA.2017.8074105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In this paper, we propose an action segmentation system which can segment a video into a number of video shots according to the human action. At the same time, the proposed system can also recognize the action types of those segmented shots. Our system estimates the action number of the video by using the distribution of visual word vectors without any predefined information and automatically adjusts the window step size instead of manual selection. In the proposed approach, the bag of visual word (BOVW) together with self-growing and self-organized neural gas (SGONG) are used to construct the action word dictionary to recognize the video shots. The experimental results show that the performance of our segmentation method can perform well for the test videos.