Dipon Talukder, Fatima Jahara, Suvadra Barua, M. Haque
{"title":"OkkhorNama: BdSL Image Dataset For Real Time Object Detection Algorithms","authors":"Dipon Talukder, Fatima Jahara, Suvadra Barua, M. Haque","doi":"10.1109/TENSYMP52854.2021.9550907","DOIUrl":null,"url":null,"abstract":"In recent years, lots of researches are being conducted to interpret Bangladeshi Sign Language (BdSL) to the means that general people can communicate with people having a hearing impairment and reduce the verbal gap between them. Computer Vision is playing a vital role in this regard by developing a sustainable system to understand the signs for machine translations. To obtain optimal performance, along with the state-of-the-art CNN model, the requirement of a high-quality sign language dataset cannot be foreseen. In this paper, we have introduced a new image dataset OkkhorNama for Fingerspelled Bangladeshi Sign Language including all 46 signs with images over 12K. In each of the images, bounding boxes are carefully annotated and labeled. OkkhorNama contains images of high resolution, good quality, and adequate variation making it ideal to train object detection and localization algorithms that would perform well on real-world applications. The OkkhorNama dataset is compared with other datasets where OkkhorNama significantly outperforms other datasets in number and trained model performance. The dataset is publicly available for future research and development.","PeriodicalId":137485,"journal":{"name":"2021 IEEE Region 10 Symposium (TENSYMP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Region 10 Symposium (TENSYMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENSYMP52854.2021.9550907","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In recent years, lots of researches are being conducted to interpret Bangladeshi Sign Language (BdSL) to the means that general people can communicate with people having a hearing impairment and reduce the verbal gap between them. Computer Vision is playing a vital role in this regard by developing a sustainable system to understand the signs for machine translations. To obtain optimal performance, along with the state-of-the-art CNN model, the requirement of a high-quality sign language dataset cannot be foreseen. In this paper, we have introduced a new image dataset OkkhorNama for Fingerspelled Bangladeshi Sign Language including all 46 signs with images over 12K. In each of the images, bounding boxes are carefully annotated and labeled. OkkhorNama contains images of high resolution, good quality, and adequate variation making it ideal to train object detection and localization algorithms that would perform well on real-world applications. The OkkhorNama dataset is compared with other datasets where OkkhorNama significantly outperforms other datasets in number and trained model performance. The dataset is publicly available for future research and development.