Transforming online class recording into useful information repositories using NLP methods: An Empirical Study

Deepa Fernandes, R. Wagh
{"title":"Transforming online class recording into useful information repositories using NLP methods: An Empirical Study","authors":"Deepa Fernandes, R. Wagh","doi":"10.1109/ICAC3N56670.2022.10074025","DOIUrl":null,"url":null,"abstract":"Most educational institutions have adapted to the mode of online teaching which has resulted in an increase of online video recordings. Learner community can be benefited with the ability to retrieve required information from the online class recordings. In this paper, we propose a methodology for converting video transcript data into useful information repositories for the purpose of retrieving class transcripts relevant to user's information needs. We focus on the online video recording transcript data. We also discuss challenges in transcribing which are crucial to understand preliminary processing. Our dataset consists of transcripts from diverse subject domains deeper experimental insights. We use interactive transcripts obtained from ASR (automatic speech recognition) services and non-interactive human generated transcripts. State-of-the-art methods for keyword retrieval: Latent Dirichlet Topic Modelling (LDA), Term Frequency (TF.IDF) and Text Rank (graph based) are applied on the video transcript data. Further, cosine similarity metric is applied to obtain the similarity measure between the transcript documents and keywords.","PeriodicalId":342573,"journal":{"name":"2022 4th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAC3N56670.2022.10074025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Most educational institutions have adapted to the mode of online teaching which has resulted in an increase of online video recordings. Learner community can be benefited with the ability to retrieve required information from the online class recordings. In this paper, we propose a methodology for converting video transcript data into useful information repositories for the purpose of retrieving class transcripts relevant to user's information needs. We focus on the online video recording transcript data. We also discuss challenges in transcribing which are crucial to understand preliminary processing. Our dataset consists of transcripts from diverse subject domains deeper experimental insights. We use interactive transcripts obtained from ASR (automatic speech recognition) services and non-interactive human generated transcripts. State-of-the-art methods for keyword retrieval: Latent Dirichlet Topic Modelling (LDA), Term Frequency (TF.IDF) and Text Rank (graph based) are applied on the video transcript data. Further, cosine similarity metric is applied to obtain the similarity measure between the transcript documents and keywords.
利用NLP方法将在线课堂记录转化为有用的信息库:一项实证研究
大多数教育机构已经适应了在线教学的模式,这导致了在线视频的增加。学习者社区可以从在线课堂录音中检索所需的信息。在本文中,我们提出了一种将视频记录数据转换为有用的信息库的方法,用于检索与用户信息需求相关的课堂记录。我们专注于在线视频记录文本数据。我们还讨论了转录中的挑战,这对理解初步处理至关重要。我们的数据集包括来自不同学科领域的文本,更深入的实验见解。我们使用从ASR(自动语音识别)服务获得的交互式文本和非交互式人工生成的文本。最先进的关键字检索方法:Latent Dirichlet Topic modeling (LDA), Term Frequency (TF.IDF)和Text Rank(基于图)应用于视频文本数据。在此基础上,利用余弦相似度度量获得文本文档与关键词之间的相似度度量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信