{"title":"教育4.0中视频文本识别的几个方面","authors":"G. Buddhawar, K. Jariwala, C. Chattopadhyay","doi":"10.1109/ETI4.051663.2021.9619427","DOIUrl":null,"url":null,"abstract":"Optical character recognition in the video document or videos from is a challenging task because flipping in the unconstrained environment and at any speed can cause difficulties specially in selecting a frame that contains the required image (OBI) for readability. In this paper, we stress on the problem of recognizing the readable and whole frame by selecting the frames with optical flow. Fundamentally, optical flow is used here for segmentation of the required area from the frame. Due to the unavailability of such a dataset, we have also created a dataset of video document containing book-flipping videos. Experiments are executed on the sample video documents recorded using mobile camera. The use of optical flow on video document gives proper track of motion as it detects the boundary of the document. Here in this case a normal Indian publication textbook is used. The need to develop this technique is to help various educational reforms in Indian universities related to paper evaluation. A light-weight model and accurate results makes this technique is cost-efficient and sustainable.","PeriodicalId":129682,"journal":{"name":"2021 Emerging Trends in Industry 4.0 (ETI 4.0)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Some Aspects of Text Recognition from Video Document in Education 4.0\",\"authors\":\"G. Buddhawar, K. Jariwala, C. Chattopadhyay\",\"doi\":\"10.1109/ETI4.051663.2021.9619427\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Optical character recognition in the video document or videos from is a challenging task because flipping in the unconstrained environment and at any speed can cause difficulties specially in selecting a frame that contains the required image (OBI) for readability. In this paper, we stress on the problem of recognizing the readable and whole frame by selecting the frames with optical flow. Fundamentally, optical flow is used here for segmentation of the required area from the frame. Due to the unavailability of such a dataset, we have also created a dataset of video document containing book-flipping videos. Experiments are executed on the sample video documents recorded using mobile camera. The use of optical flow on video document gives proper track of motion as it detects the boundary of the document. Here in this case a normal Indian publication textbook is used. The need to develop this technique is to help various educational reforms in Indian universities related to paper evaluation. A light-weight model and accurate results makes this technique is cost-efficient and sustainable.\",\"PeriodicalId\":129682,\"journal\":{\"name\":\"2021 Emerging Trends in Industry 4.0 (ETI 4.0)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Emerging Trends in Industry 4.0 (ETI 4.0)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ETI4.051663.2021.9619427\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Emerging Trends in Industry 4.0 (ETI 4.0)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETI4.051663.2021.9619427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Some Aspects of Text Recognition from Video Document in Education 4.0
Optical character recognition in the video document or videos from is a challenging task because flipping in the unconstrained environment and at any speed can cause difficulties specially in selecting a frame that contains the required image (OBI) for readability. In this paper, we stress on the problem of recognizing the readable and whole frame by selecting the frames with optical flow. Fundamentally, optical flow is used here for segmentation of the required area from the frame. Due to the unavailability of such a dataset, we have also created a dataset of video document containing book-flipping videos. Experiments are executed on the sample video documents recorded using mobile camera. The use of optical flow on video document gives proper track of motion as it detects the boundary of the document. Here in this case a normal Indian publication textbook is used. The need to develop this technique is to help various educational reforms in Indian universities related to paper evaluation. A light-weight model and accurate results makes this technique is cost-efficient and sustainable.