Video Stabilization and Text-Based Sentence Identification for Video-Based Answering Systems

Hendy Gunawan, Chin-Chuan Han, Chang-Hsing Lee
{"title":"Video Stabilization and Text-Based Sentence Identification for Video-Based Answering Systems","authors":"Hendy Gunawan, Chin-Chuan Han, Chang-Hsing Lee","doi":"10.1109/CCOMS.2018.8463237","DOIUrl":null,"url":null,"abstract":"Recently, teaching using video clips is a simple and direct method instead of the text-based approaches. On the E-learning platform, students ask a question, and teachers answer the question using video clips especially the derivation of mathematical equations. Teachers write their answers down on an A4-size paper, use an APP to record the whole process, and upload the video clips to the platform. However, reading the sentences or equations from videos is very uncomfortable due to the paper movements when teacher wrote the sentences. In addition, teacher's hand frequently occludes the answers during the recording process. In the study, a video stabilizer with hand removal is proposed for comfortably reading. There are two key components: Video stabilization and sentence identification. Video stabilization estimates the smooth camera paths using SURF -based corner detection, hand part removal, image registration, and frame calibration. In sentence identification, sentences are segmented and extracted using contour detection and the convex hull generation algorithm. After the calibration process, users won't fell dizzy in reading sentences from calibrated and un-shaky videos. In addition, time stamps are marked at the beginning and the end of each sentence in the processed videos. When users click a specified sentence, a stabilized clip with good quality will be played.","PeriodicalId":405664,"journal":{"name":"2018 3rd International Conference on Computer and Communication Systems (ICCCS)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 3rd International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCOMS.2018.8463237","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, teaching using video clips is a simple and direct method instead of the text-based approaches. On the E-learning platform, students ask a question, and teachers answer the question using video clips especially the derivation of mathematical equations. Teachers write their answers down on an A4-size paper, use an APP to record the whole process, and upload the video clips to the platform. However, reading the sentences or equations from videos is very uncomfortable due to the paper movements when teacher wrote the sentences. In addition, teacher's hand frequently occludes the answers during the recording process. In the study, a video stabilizer with hand removal is proposed for comfortably reading. There are two key components: Video stabilization and sentence identification. Video stabilization estimates the smooth camera paths using SURF -based corner detection, hand part removal, image registration, and frame calibration. In sentence identification, sentences are segmented and extracted using contour detection and the convex hull generation algorithm. After the calibration process, users won't fell dizzy in reading sentences from calibrated and un-shaky videos. In addition, time stamps are marked at the beginning and the end of each sentence in the processed videos. When users click a specified sentence, a stabilized clip with good quality will be played.
基于视频应答系统的视频稳定和基于文本的句子识别
最近,视频教学取代了基于文本的教学方法,成为一种简单直接的教学方法。在E-learning平台上,学生提出问题,教师通过视频片段进行回答,特别是数学方程的推导。教师将答案写在a4纸大小的纸上,用APP记录整个过程,并将视频片段上传到平台。然而,阅读视频中的句子或方程是非常不舒服的,因为老师写句子时纸张会移动。此外,在录音过程中,老师的手经常遮挡答案。在研究中,提出了一种去除手的视频稳定器,用于舒适地阅读。有两个关键组成部分:视频稳定和句子识别。视频稳定使用基于SURF的角点检测、手部去除、图像配准和帧校准来估计平滑的相机路径。在句子识别中,使用轮廓检测和凸包生成算法对句子进行分割和提取。经过校准过程后,用户在阅读校准和不抖动视频中的句子时不会感到头晕。此外,在处理后的视频中,每个句子的开头和结尾都标记了时间戳。当用户点击指定的句子时,将播放质量较好的稳定片段。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信