{"title":"Fuzzy clustering of lecture videos based on topic modeling","authors":"Subhasree Basu, Yi Yu, Roger Zimmermann","doi":"10.1109/CBMI.2016.7500264","DOIUrl":null,"url":null,"abstract":"Lecture videos constitute an important part of the e-learning paradigm. These online video-lectures contain multimedia materials aimed at explaining complex concepts in a more effective way. The videos are mostly grouped by their subjects. However, often there are overlaps between the subjects, e.g. Mathematics and Statistics. Hence, educational content-wise, some of the lecture videos can belong to more than one subject. When they are labeled by only one subject, students searching for the content of the lecture might miss some of these videos. To solve this problem, we aim to provide a clustering of these lecture videos based on their educational content rather than their titles so that such lectures will not be missed out based on the subject labels. Our novel algorithm uses topic modeling on video transcripts generated by automatic captions to extract the contents of these videos. We choose representative text documents for each of the clusters from the Wikipedia. Then we calculate a similarity between the topics extracted from the videos and those of the representative documents of the clusters. Finally we apply fuzzy clustering based on these similarity values and provide a lecture-content based clustering for these lecture videos. The initial results are plausible and confirm the effectiveness of the proposed scheme.","PeriodicalId":356608,"journal":{"name":"2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMI.2016.7500264","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 24
Abstract
Lecture videos constitute an important part of the e-learning paradigm. These online video-lectures contain multimedia materials aimed at explaining complex concepts in a more effective way. The videos are mostly grouped by their subjects. However, often there are overlaps between the subjects, e.g. Mathematics and Statistics. Hence, educational content-wise, some of the lecture videos can belong to more than one subject. When they are labeled by only one subject, students searching for the content of the lecture might miss some of these videos. To solve this problem, we aim to provide a clustering of these lecture videos based on their educational content rather than their titles so that such lectures will not be missed out based on the subject labels. Our novel algorithm uses topic modeling on video transcripts generated by automatic captions to extract the contents of these videos. We choose representative text documents for each of the clusters from the Wikipedia. Then we calculate a similarity between the topics extracted from the videos and those of the representative documents of the clusters. Finally we apply fuzzy clustering based on these similarity values and provide a lecture-content based clustering for these lecture videos. The initial results are plausible and confirm the effectiveness of the proposed scheme.