Bilingual Bangla OCR for Rural Empowerment: Detecting Handwritten Queries and Agricultural Assistance

Mahanur Alam;Md. Johirul Islam Tutul;Md. Anwar Hussen Wadud;Md. Jakir Hossen;M. F. Mridha
{"title":"Bilingual Bangla OCR for Rural Empowerment: Detecting Handwritten Queries and Agricultural Assistance","authors":"Mahanur Alam;Md. Johirul Islam Tutul;Md. Anwar Hussen Wadud;Md. Jakir Hossen;M. F. Mridha","doi":"10.1109/OJCS.2025.3573317","DOIUrl":null,"url":null,"abstract":"Farmers in rural areas often struggle to access crucial agricultural information due to language barriers, low literacy rates, and limited exposure to digital tools. While many can write in Bangla, most agricultural resources are available only in English or require navigating complex systems, making it difficult for them to find relevant information. Existing Optical Character Recognition (OCR) technologies, which could help bridge this gap, are primarily designed for printed text and often fail to recognize handwritten Bangla script accurately. Issues such as biased datasets, diverse handwriting styles, and background noise further reduce accuracy, making these systems unreliable for real-world use. To tackle these challenges, we have developed a lightweight and unbiased OCR model specifically for handwritten Bangla text. Our solution integrates a custom Convolutional Neural Network (CNN) with InceptionV3, enhancing recognition accuracy while ensuring efficiency for low-resource devices like smartphones. Additionally, we incorporate a two-way translation feature, enabling seamless Bangla-to-English and English-to-Bangla conversion. This allows farmers to write in Bangla, translate content when needed, and access critical information in a way that best suits them. Our solution empowers rural farmers by enabling them to interact with digital platforms in their native language, bridging the gap between handwritten communication and modern technology. Beyond agriculture, this technology has far-reaching applications in tourism, healthcare, education, and government services, fostering digital inclusion. By advancing OCR for Bangla, our research promotes equitable access to technology, equipping communities with essential tools to improve productivity and quality of life in the digital era.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"6 ","pages":"943-954"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11014544","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Computer Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11014544/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Farmers in rural areas often struggle to access crucial agricultural information due to language barriers, low literacy rates, and limited exposure to digital tools. While many can write in Bangla, most agricultural resources are available only in English or require navigating complex systems, making it difficult for them to find relevant information. Existing Optical Character Recognition (OCR) technologies, which could help bridge this gap, are primarily designed for printed text and often fail to recognize handwritten Bangla script accurately. Issues such as biased datasets, diverse handwriting styles, and background noise further reduce accuracy, making these systems unreliable for real-world use. To tackle these challenges, we have developed a lightweight and unbiased OCR model specifically for handwritten Bangla text. Our solution integrates a custom Convolutional Neural Network (CNN) with InceptionV3, enhancing recognition accuracy while ensuring efficiency for low-resource devices like smartphones. Additionally, we incorporate a two-way translation feature, enabling seamless Bangla-to-English and English-to-Bangla conversion. This allows farmers to write in Bangla, translate content when needed, and access critical information in a way that best suits them. Our solution empowers rural farmers by enabling them to interact with digital platforms in their native language, bridging the gap between handwritten communication and modern technology. Beyond agriculture, this technology has far-reaching applications in tourism, healthcare, education, and government services, fostering digital inclusion. By advancing OCR for Bangla, our research promotes equitable access to technology, equipping communities with essential tools to improve productivity and quality of life in the digital era.
双语孟加拉语OCR用于农村赋权:检测手写查询和农业援助
由于语言障碍、识字率低以及接触数字工具有限,农村地区的农民往往难以获取关键的农业信息。虽然许多人可以用孟加拉语写作,但大多数农业资源只能用英语提供,或者需要导航复杂的系统,这使得他们很难找到相关信息。现有的光学字符识别(OCR)技术可以帮助弥补这一差距,但它们主要是为印刷文本设计的,常常不能准确识别手写的孟加拉文。诸如有偏差的数据集、不同的手写风格和背景噪声等问题进一步降低了准确性,使这些系统在实际使用中不可靠。为了应对这些挑战,我们开发了一个轻量级的、无偏置的OCR模型,专门用于手写的孟加拉文本。我们的解决方案将自定义卷积神经网络(CNN)与InceptionV3集成在一起,提高了识别精度,同时确保了智能手机等低资源设备的效率。此外,我们还结合了双向翻译功能,实现了孟加拉语到英语和英语到孟加拉语的无缝转换。这使得农民可以用孟加拉语写作,在需要时翻译内容,并以最适合他们的方式获取关键信息。我们的解决方案使农村农民能够用母语与数字平台进行互动,弥合了手写通信与现代技术之间的差距。除农业外,该技术还在旅游、医疗保健、教育和政府服务方面有着深远的应用,促进了数字包容。通过推进孟加拉国的OCR,我们的研究促进了对技术的公平获取,为社区提供了必要的工具,以提高数字时代的生产力和生活质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
12.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信