{"title":"Analysis of Punctuation Prediction Models for Automated Transcript Generation in MOOC Videos","authors":"Bhrigu Garg, Anika","doi":"10.1109/MITE.2018.8747063","DOIUrl":null,"url":null,"abstract":"In today’s e-learning based educational scenarios, lot of efforts in terms of time and manpower are required by the MOOC instructors for the generation of transcripts. This research study is focused on the efficient and correct punctuation prediction in the process of automated generation of these transcripts. Various deep learning based and other commonly used punctuation prediction techniques and models existing in the literature have been identified and analyzed for the educational domain videos. The hybrid model of Convolution Neural Networks and Bidirectional Long Short Term Memory ensembled with the acoustic model outperformed other models. It yielded an accuracy of 93.56 percent, recall of 56.15 percent and precision of 63.69 percent. This study also proposed a generalized architecture for efficient punctuation prediction.","PeriodicalId":426754,"journal":{"name":"2018 IEEE 6th International Conference on MOOCs, Innovation and Technology in Education (MITE)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 6th International Conference on MOOCs, Innovation and Technology in Education (MITE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MITE.2018.8747063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In today’s e-learning based educational scenarios, lot of efforts in terms of time and manpower are required by the MOOC instructors for the generation of transcripts. This research study is focused on the efficient and correct punctuation prediction in the process of automated generation of these transcripts. Various deep learning based and other commonly used punctuation prediction techniques and models existing in the literature have been identified and analyzed for the educational domain videos. The hybrid model of Convolution Neural Networks and Bidirectional Long Short Term Memory ensembled with the acoustic model outperformed other models. It yielded an accuracy of 93.56 percent, recall of 56.15 percent and precision of 63.69 percent. This study also proposed a generalized architecture for efficient punctuation prediction.