{"title":"基于视频应答系统的视频稳定和基于文本的句子识别","authors":"Hendy Gunawan, Chin-Chuan Han, Chang-Hsing Lee","doi":"10.1109/CCOMS.2018.8463237","DOIUrl":null,"url":null,"abstract":"Recently, teaching using video clips is a simple and direct method instead of the text-based approaches. On the E-learning platform, students ask a question, and teachers answer the question using video clips especially the derivation of mathematical equations. Teachers write their answers down on an A4-size paper, use an APP to record the whole process, and upload the video clips to the platform. However, reading the sentences or equations from videos is very uncomfortable due to the paper movements when teacher wrote the sentences. In addition, teacher's hand frequently occludes the answers during the recording process. In the study, a video stabilizer with hand removal is proposed for comfortably reading. There are two key components: Video stabilization and sentence identification. Video stabilization estimates the smooth camera paths using SURF -based corner detection, hand part removal, image registration, and frame calibration. In sentence identification, sentences are segmented and extracted using contour detection and the convex hull generation algorithm. After the calibration process, users won't fell dizzy in reading sentences from calibrated and un-shaky videos. In addition, time stamps are marked at the beginning and the end of each sentence in the processed videos. When users click a specified sentence, a stabilized clip with good quality will be played.","PeriodicalId":405664,"journal":{"name":"2018 3rd International Conference on Computer and Communication Systems (ICCCS)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Video Stabilization and Text-Based Sentence Identification for Video-Based Answering Systems\",\"authors\":\"Hendy Gunawan, Chin-Chuan Han, Chang-Hsing Lee\",\"doi\":\"10.1109/CCOMS.2018.8463237\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, teaching using video clips is a simple and direct method instead of the text-based approaches. On the E-learning platform, students ask a question, and teachers answer the question using video clips especially the derivation of mathematical equations. Teachers write their answers down on an A4-size paper, use an APP to record the whole process, and upload the video clips to the platform. However, reading the sentences or equations from videos is very uncomfortable due to the paper movements when teacher wrote the sentences. In addition, teacher's hand frequently occludes the answers during the recording process. In the study, a video stabilizer with hand removal is proposed for comfortably reading. There are two key components: Video stabilization and sentence identification. Video stabilization estimates the smooth camera paths using SURF -based corner detection, hand part removal, image registration, and frame calibration. In sentence identification, sentences are segmented and extracted using contour detection and the convex hull generation algorithm. After the calibration process, users won't fell dizzy in reading sentences from calibrated and un-shaky videos. In addition, time stamps are marked at the beginning and the end of each sentence in the processed videos. When users click a specified sentence, a stabilized clip with good quality will be played.\",\"PeriodicalId\":405664,\"journal\":{\"name\":\"2018 3rd International Conference on Computer and Communication Systems (ICCCS)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 3rd International Conference on Computer and Communication Systems (ICCCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCOMS.2018.8463237\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 3rd International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCOMS.2018.8463237","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Video Stabilization and Text-Based Sentence Identification for Video-Based Answering Systems
Recently, teaching using video clips is a simple and direct method instead of the text-based approaches. On the E-learning platform, students ask a question, and teachers answer the question using video clips especially the derivation of mathematical equations. Teachers write their answers down on an A4-size paper, use an APP to record the whole process, and upload the video clips to the platform. However, reading the sentences or equations from videos is very uncomfortable due to the paper movements when teacher wrote the sentences. In addition, teacher's hand frequently occludes the answers during the recording process. In the study, a video stabilizer with hand removal is proposed for comfortably reading. There are two key components: Video stabilization and sentence identification. Video stabilization estimates the smooth camera paths using SURF -based corner detection, hand part removal, image registration, and frame calibration. In sentence identification, sentences are segmented and extracted using contour detection and the convex hull generation algorithm. After the calibration process, users won't fell dizzy in reading sentences from calibrated and un-shaky videos. In addition, time stamps are marked at the beginning and the end of each sentence in the processed videos. When users click a specified sentence, a stabilized clip with good quality will be played.