I. Zharikov, Filipp Nikitin, I. Vasiliev, V. Dokholyan
{"title":"DDI-100: Dataset for Text Detection and Recognition","authors":"I. Zharikov, Filipp Nikitin, I. Vasiliev, V. Dokholyan","doi":"10.1145/3440084.3441192","DOIUrl":"https://doi.org/10.1145/3440084.3441192","url":null,"abstract":"With the increasing popularity of document analysis and recognition systems, text detection (TD) and optical character recognition (OCR) in document images become challenging tasks. However, according to our best knowledge, no publicly available datasets for these particular problems exist. In this paper, we introduce a Distorted Document Images dataset (DDI-100) and provide a detailed analysis of the DDI-100 in its current state. To create the dataset we collected 7000 unique document pages, and extended it by applying different types of distortions and geometric transformations. In total, DDI-100 contains more than 100,000 document images together with binary text masks, text and character locations in terms of bounding boxes. We conduct the experiments with several TD and OCR approaches trained on the introduced dataset. Obtained results demonstrate the usefulness of DDI-100 dataset to achieve high-quality results using a small amount of real data.","PeriodicalId":250100,"journal":{"name":"Proceedings of the 2020 4th International Symposium on Computer Science and Intelligent Control","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121260831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Edutainment Software for the Pepper Robot","authors":"Martin Matulík, M. Vavrecka, Lucie Vidovićová","doi":"10.1145/3440084.3441194","DOIUrl":"https://doi.org/10.1145/3440084.3441194","url":null,"abstract":"We present our software for the Pepper robot that is suitable for human-robot interaction in the area of edutainment. The combination of education and entertainment is achieved by modifying a state-of-the-art conversational artificial intelligence system, developing several interactive quiz applications for the robot and implementing these into Pepper, a humanoid robot. In the paper, we describe the technical details of the chatbot implementation and also the description of the software for the edutainment.","PeriodicalId":250100,"journal":{"name":"Proceedings of the 2020 4th International Symposium on Computer Science and Intelligent Control","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125184228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Proceedings of the 2020 4th International Symposium on Computer Science and Intelligent Control","authors":"","doi":"10.1145/3440084","DOIUrl":"https://doi.org/10.1145/3440084","url":null,"abstract":"","PeriodicalId":250100,"journal":{"name":"Proceedings of the 2020 4th International Symposium on Computer Science and Intelligent Control","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130313862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}