{"title":"The Experimental Implementation of GrabCut for Hardcode Subtitle Extraction","authors":"Dong Wang, Aimoerfu","doi":"10.1109/ICIS.2018.8466484","DOIUrl":null,"url":null,"abstract":"In this paper, we describe a relatively convenient way to extract and recognize text from a background in various circumstances of complicity. One general purpose in many applications is to extract hardcode subtitles from videos. Popular hardcode subtitle-rip tools (applications) nowadays follow a similar procedure from text location settings, customized image post-editing to OCR process. The quality of results usually may not be satisfied, and the supporting video formats are limited. Thanks to \"GrabCut\" image segmentation algorithm and \"Tesseract-OCR\" technology, we have developed a more augmented version in python for hardcode subtitle extraction on a fixed location, based on many attempts of experimental practice and research. The method we proposed outperforms most of the competitive tools on the quality of results.","PeriodicalId":447019,"journal":{"name":"2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIS.2018.8466484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we describe a relatively convenient way to extract and recognize text from a background in various circumstances of complicity. One general purpose in many applications is to extract hardcode subtitles from videos. Popular hardcode subtitle-rip tools (applications) nowadays follow a similar procedure from text location settings, customized image post-editing to OCR process. The quality of results usually may not be satisfied, and the supporting video formats are limited. Thanks to "GrabCut" image segmentation algorithm and "Tesseract-OCR" technology, we have developed a more augmented version in python for hardcode subtitle extraction on a fixed location, based on many attempts of experimental practice and research. The method we proposed outperforms most of the competitive tools on the quality of results.