SurveyNet: A Unified Deep Learning Framework for OCR and OMR-Based Survey Digitization.

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging Pub Date : 2026-04-17 DOI:10.3390/jimaging12040175

Rubi Quiñones, Sreeja Cheekireddy, Eren Gultepe

{"title":"SurveyNet: A Unified Deep Learning Framework for OCR and OMR-Based Survey Digitization.","authors":"Rubi Quiñones, Sreeja Cheekireddy, Eren Gultepe","doi":"10.3390/jimaging12040175","DOIUrl":null,"url":null,"abstract":"<p><p>Manual survey data entry remains a bottleneck in large-scale research, marketing, and public policy, where survey sheets are still widely used due to accessibility and high response rates. Despite the progress in Optical Character Recognition (OCR) and Optical Mark Recognition (OMR), existing systems treat these tasks separately and are typically tailored to clean, standardized forms, making them unreliable for real-world survey sheets with diverse markings and handwritten inputs. These limitations hinder automation and introduce significant error rates in data transcription. To address this, we propose SurveyNet, a unified deep learning framework that combines OCR and OMR capabilities to automatically digitize complex survey responses within a single model. SurveyNet processes both handwritten digits and a wide variety of mark types including ticks, circles, and crosses across multiple question formats. We also introduce SurveySet, a novel dataset comprising 135 real-world survey forms annotated across four key response types. Experimental results demonstrate that SurveyNet achieves between 50% and 97% classification accuracy across tasks, with strong performance even on small and imbalanced datasets. This framework offers a scalable solution for streamlining survey digitization workflows, reducing manual errors, and enabling timely analysis in domains ranging from consumer research to public health and education.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 4","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2026-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13117292/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/jimaging12040175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Manual survey data entry remains a bottleneck in large-scale research, marketing, and public policy, where survey sheets are still widely used due to accessibility and high response rates. Despite the progress in Optical Character Recognition (OCR) and Optical Mark Recognition (OMR), existing systems treat these tasks separately and are typically tailored to clean, standardized forms, making them unreliable for real-world survey sheets with diverse markings and handwritten inputs. These limitations hinder automation and introduce significant error rates in data transcription. To address this, we propose SurveyNet, a unified deep learning framework that combines OCR and OMR capabilities to automatically digitize complex survey responses within a single model. SurveyNet processes both handwritten digits and a wide variety of mark types including ticks, circles, and crosses across multiple question formats. We also introduce SurveySet, a novel dataset comprising 135 real-world survey forms annotated across four key response types. Experimental results demonstrate that SurveyNet achieves between 50% and 97% classification accuracy across tasks, with strong performance even on small and imbalanced datasets. This framework offers a scalable solution for streamlining survey digitization workflows, reducing manual errors, and enabling timely analysis in domains ranging from consumer research to public health and education.

查看原文本刊更多论文

基于OCR和omr的调查数字化的统一深度学习框架。

人工调查数据输入仍然是大规模研究、市场营销和公共政策的瓶颈，在这些领域，由于易于获取和高回复率，调查表仍然被广泛使用。尽管光学字符识别（OCR）和光学标记识别（OMR）取得了进展，但现有系统将这些任务分开处理，并且通常针对干净、标准化的表格进行定制，这使得它们对于具有不同标记和手写输入的真实调查表不可靠。这些限制阻碍了自动化，并在数据转录中引入了显著的错误率。为了解决这个问题，我们提出了SurveyNet，这是一个统一的深度学习框架，结合了OCR和OMR功能，可以在单个模型中自动数字化复杂的调查响应。SurveyNet可以处理手写数字和各种各样的标记类型，包括勾号、圆圈和跨多种问题格式的十字。我们还介绍了SurveySet，这是一个新的数据集，包含135个真实世界的调查表格，标注了四种关键的回答类型。实验结果表明，SurveyNet跨任务的分类准确率在50%到97%之间，即使在小而不平衡的数据集上也有很强的性能。该框架提供了一个可扩展的解决方案，用于简化调查数字化工作流程，减少人工错误，并在从消费者研究到公共卫生和教育等领域实现及时分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊