Shana Sherin M., Shyna A., Jini Raju, Reena Mary George
{"title":"BrailleSegNet:一种新的盲文数据集生成和字符分割方法","authors":"Shana Sherin M., Shyna A., Jini Raju, Reena Mary George","doi":"10.1016/j.displa.2025.103145","DOIUrl":null,"url":null,"abstract":"<div><div>Recent research in the field of Braille learning has highlighted vital role of accurately segmenting Braille letters from Braille documents to improve accessibility and educational opportunities for visually impaired children. A novel methodology, BrailleSegNet, is proposed for Braille Dataset Generation and Braille character segmentation, structured into six distinct phases: Image Acquisition, Image Preprocessing, Fixed-Sized Square Conversion, Rows Extraction, Zonal Operations, and Braille Character Extraction. The initial phase involves acquiring images from the Braille-TextStory dataset, followed by preprocessing steps including grayscale conversion, binary conversion, Gaussian filtering for noise removal, and image inversion. Subsequently, the method standardizes the varying sizes and shapes of Braille dots into fixed-sized squares, extracts rows containing Braille characters, and performs zonal operations such as vertical dilation, zone identification, full zone conversion, and space zone addition to accurately segment and recognize Braille characters. The final phase extracts Braille characters from the designated full zones. Addressing the scarcity of datasets with appropriate ground truth reflecting real-world Braille document scenarios, a new dataset, Braille-TextStory, was created as part of this work. This dataset includes short stories in English, generated using the Braille-PageMap algorithm for evaluating Braille character segmentation techniques. The Braille-TextStory dataset maps English letters to their corresponding Braille images, accurately placing them on plain pages with proper management of parameters such as letter spacing, word spacing, and line spacing to preserve the integrity and readability of Braille documents. The proposed segmentation methodology was tested using this dataset, demonstrating a high level of effectiveness and accuracy compared to state-of-the-art methods.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"90 ","pages":"Article 103145"},"PeriodicalIF":3.4000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"BrailleSegNet: A novel methodology for Braille dataset generation and character segmentation\",\"authors\":\"Shana Sherin M., Shyna A., Jini Raju, Reena Mary George\",\"doi\":\"10.1016/j.displa.2025.103145\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Recent research in the field of Braille learning has highlighted vital role of accurately segmenting Braille letters from Braille documents to improve accessibility and educational opportunities for visually impaired children. A novel methodology, BrailleSegNet, is proposed for Braille Dataset Generation and Braille character segmentation, structured into six distinct phases: Image Acquisition, Image Preprocessing, Fixed-Sized Square Conversion, Rows Extraction, Zonal Operations, and Braille Character Extraction. The initial phase involves acquiring images from the Braille-TextStory dataset, followed by preprocessing steps including grayscale conversion, binary conversion, Gaussian filtering for noise removal, and image inversion. Subsequently, the method standardizes the varying sizes and shapes of Braille dots into fixed-sized squares, extracts rows containing Braille characters, and performs zonal operations such as vertical dilation, zone identification, full zone conversion, and space zone addition to accurately segment and recognize Braille characters. The final phase extracts Braille characters from the designated full zones. Addressing the scarcity of datasets with appropriate ground truth reflecting real-world Braille document scenarios, a new dataset, Braille-TextStory, was created as part of this work. This dataset includes short stories in English, generated using the Braille-PageMap algorithm for evaluating Braille character segmentation techniques. The Braille-TextStory dataset maps English letters to their corresponding Braille images, accurately placing them on plain pages with proper management of parameters such as letter spacing, word spacing, and line spacing to preserve the integrity and readability of Braille documents. The proposed segmentation methodology was tested using this dataset, demonstrating a high level of effectiveness and accuracy compared to state-of-the-art methods.</div></div>\",\"PeriodicalId\":50570,\"journal\":{\"name\":\"Displays\",\"volume\":\"90 \",\"pages\":\"Article 103145\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Displays\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0141938225001829\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225001829","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
BrailleSegNet: A novel methodology for Braille dataset generation and character segmentation
Recent research in the field of Braille learning has highlighted vital role of accurately segmenting Braille letters from Braille documents to improve accessibility and educational opportunities for visually impaired children. A novel methodology, BrailleSegNet, is proposed for Braille Dataset Generation and Braille character segmentation, structured into six distinct phases: Image Acquisition, Image Preprocessing, Fixed-Sized Square Conversion, Rows Extraction, Zonal Operations, and Braille Character Extraction. The initial phase involves acquiring images from the Braille-TextStory dataset, followed by preprocessing steps including grayscale conversion, binary conversion, Gaussian filtering for noise removal, and image inversion. Subsequently, the method standardizes the varying sizes and shapes of Braille dots into fixed-sized squares, extracts rows containing Braille characters, and performs zonal operations such as vertical dilation, zone identification, full zone conversion, and space zone addition to accurately segment and recognize Braille characters. The final phase extracts Braille characters from the designated full zones. Addressing the scarcity of datasets with appropriate ground truth reflecting real-world Braille document scenarios, a new dataset, Braille-TextStory, was created as part of this work. This dataset includes short stories in English, generated using the Braille-PageMap algorithm for evaluating Braille character segmentation techniques. The Braille-TextStory dataset maps English letters to their corresponding Braille images, accurately placing them on plain pages with proper management of parameters such as letter spacing, word spacing, and line spacing to preserve the integrity and readability of Braille documents. The proposed segmentation methodology was tested using this dataset, demonstrating a high level of effectiveness and accuracy compared to state-of-the-art methods.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.