{"title":"MOOC视频自动生成文本的标点符号预测模型分析","authors":"Bhrigu Garg, Anika","doi":"10.1109/MITE.2018.8747063","DOIUrl":null,"url":null,"abstract":"In today’s e-learning based educational scenarios, lot of efforts in terms of time and manpower are required by the MOOC instructors for the generation of transcripts. This research study is focused on the efficient and correct punctuation prediction in the process of automated generation of these transcripts. Various deep learning based and other commonly used punctuation prediction techniques and models existing in the literature have been identified and analyzed for the educational domain videos. The hybrid model of Convolution Neural Networks and Bidirectional Long Short Term Memory ensembled with the acoustic model outperformed other models. It yielded an accuracy of 93.56 percent, recall of 56.15 percent and precision of 63.69 percent. This study also proposed a generalized architecture for efficient punctuation prediction.","PeriodicalId":426754,"journal":{"name":"2018 IEEE 6th International Conference on MOOCs, Innovation and Technology in Education (MITE)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Analysis of Punctuation Prediction Models for Automated Transcript Generation in MOOC Videos\",\"authors\":\"Bhrigu Garg, Anika\",\"doi\":\"10.1109/MITE.2018.8747063\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In today’s e-learning based educational scenarios, lot of efforts in terms of time and manpower are required by the MOOC instructors for the generation of transcripts. This research study is focused on the efficient and correct punctuation prediction in the process of automated generation of these transcripts. Various deep learning based and other commonly used punctuation prediction techniques and models existing in the literature have been identified and analyzed for the educational domain videos. The hybrid model of Convolution Neural Networks and Bidirectional Long Short Term Memory ensembled with the acoustic model outperformed other models. It yielded an accuracy of 93.56 percent, recall of 56.15 percent and precision of 63.69 percent. This study also proposed a generalized architecture for efficient punctuation prediction.\",\"PeriodicalId\":426754,\"journal\":{\"name\":\"2018 IEEE 6th International Conference on MOOCs, Innovation and Technology in Education (MITE)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 6th International Conference on MOOCs, Innovation and Technology in Education (MITE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MITE.2018.8747063\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 6th International Conference on MOOCs, Innovation and Technology in Education (MITE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MITE.2018.8747063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Analysis of Punctuation Prediction Models for Automated Transcript Generation in MOOC Videos
In today’s e-learning based educational scenarios, lot of efforts in terms of time and manpower are required by the MOOC instructors for the generation of transcripts. This research study is focused on the efficient and correct punctuation prediction in the process of automated generation of these transcripts. Various deep learning based and other commonly used punctuation prediction techniques and models existing in the literature have been identified and analyzed for the educational domain videos. The hybrid model of Convolution Neural Networks and Bidirectional Long Short Term Memory ensembled with the acoustic model outperformed other models. It yielded an accuracy of 93.56 percent, recall of 56.15 percent and precision of 63.69 percent. This study also proposed a generalized architecture for efficient punctuation prediction.