HCCD：用于在不同退化条件下增强文档的手写相机捕获数据集

IF 1 Q3 MULTIDISCIPLINARY SCIENCES

Data in Brief Pub Date : 2025-07-02 DOI:10.1016/j.dib.2025.111849

K.S. Koushik , Bipin Nair B J , N. Shobha Rani

{"title":"HCCD：用于在不同退化条件下增强文档的手写相机捕获数据集","authors":"K.S. Koushik , Bipin Nair B J , N. Shobha Rani","doi":"10.1016/j.dib.2025.111849","DOIUrl":null,"url":null,"abstract":"<div><div>Enhancing degraded handwritten documents captured with smartphone cameras remains a significant challenge in document analysis. Although deep learning-based enhancement techniques have shown promise, the performance of deep learning models largely relies on the availability of meticulously labeled ground truth datasets. To address this gap, in this study, the Handwritten Camera-Captured Dataset (HCCD) is introduced to support document enhancement and recognition tasks specific to real-world scenarios. Unlike existing datasets, which are captured in controlled environments with scanners or smartphone cameras, HCCD features real-time, camera-captured handwritten documents exhibiting a range of natural degradations. The degradation issues encompass motion blur, shadow artifacts, and uneven lighting, which reflect challenges incurred in the real-life document digitization process.</div><div>In the proposed dataset, each handwritten document is paired with a high-quality enhanced image created through a combination of computer vision-based imaging techniques. The documents are in Roman script and were contributed by multiple individuals with varying handwriting styles. The dataset is valuable for machine learning/ deep learning-based training for image restoration, denoising, and OCR applications. Each sample is annotated with rich metadata for further targeted research, including degradation type, severity level, and writer-specific demographics.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"61 ","pages":"Article 111849"},"PeriodicalIF":1.0000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HCCD: A handwritten camera-captured dataset for document enhancement under varied degradation conditions\",\"authors\":\"K.S. Koushik , Bipin Nair B J , N. Shobha Rani\",\"doi\":\"10.1016/j.dib.2025.111849\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Enhancing degraded handwritten documents captured with smartphone cameras remains a significant challenge in document analysis. Although deep learning-based enhancement techniques have shown promise, the performance of deep learning models largely relies on the availability of meticulously labeled ground truth datasets. To address this gap, in this study, the Handwritten Camera-Captured Dataset (HCCD) is introduced to support document enhancement and recognition tasks specific to real-world scenarios. Unlike existing datasets, which are captured in controlled environments with scanners or smartphone cameras, HCCD features real-time, camera-captured handwritten documents exhibiting a range of natural degradations. The degradation issues encompass motion blur, shadow artifacts, and uneven lighting, which reflect challenges incurred in the real-life document digitization process.</div><div>In the proposed dataset, each handwritten document is paired with a high-quality enhanced image created through a combination of computer vision-based imaging techniques. The documents are in Roman script and were contributed by multiple individuals with varying handwriting styles. The dataset is valuable for machine learning/ deep learning-based training for image restoration, denoising, and OCR applications. Each sample is annotated with rich metadata for further targeted research, including degradation type, severity level, and writer-specific demographics.</div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"61 \",\"pages\":\"Article 111849\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340925005761\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925005761","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

增强智能手机相机捕获的退化手写文档仍然是文档分析中的重大挑战。尽管基于深度学习的增强技术已经显示出前景，但深度学习模型的性能在很大程度上依赖于精心标记的地面真实数据集的可用性。为了解决这一差距，在本研究中，引入了手写相机捕获数据集（HCCD）来支持特定于现实场景的文档增强和识别任务。现有的数据集是通过扫描仪或智能手机相机在受控环境中捕获的，与之不同，HCCD的特点是实时的，相机捕获的手写文档显示了一系列自然退化。退化问题包括运动模糊，阴影伪影和不均匀的照明，这反映了现实生活中文档数字化过程中产生的挑战。在建议的数据集中，每个手写文档都与通过基于计算机视觉的成像技术组合创建的高质量增强图像配对。这些文件是用罗马文字写的，是由不同笔迹风格的人共同完成的。该数据集对于基于机器学习/深度学习的图像恢复、去噪和OCR应用的训练很有价值。每个样本都有丰富的元数据注释，用于进一步有针对性的研究，包括退化类型、严重级别和特定于作者的人口统计数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

HCCD: A handwritten camera-captured dataset for document enhancement under varied degradation conditions

Enhancing degraded handwritten documents captured with smartphone cameras remains a significant challenge in document analysis. Although deep learning-based enhancement techniques have shown promise, the performance of deep learning models largely relies on the availability of meticulously labeled ground truth datasets. To address this gap, in this study, the Handwritten Camera-Captured Dataset (HCCD) is introduced to support document enhancement and recognition tasks specific to real-world scenarios. Unlike existing datasets, which are captured in controlled environments with scanners or smartphone cameras, HCCD features real-time, camera-captured handwritten documents exhibiting a range of natural degradations. The degradation issues encompass motion blur, shadow artifacts, and uneven lighting, which reflect challenges incurred in the real-life document digitization process.

In the proposed dataset, each handwritten document is paired with a high-quality enhanced image created through a combination of computer vision-based imaging techniques. The documents are in Roman script and were contributed by multiple individuals with varying handwriting styles. The dataset is valuable for machine learning/ deep learning-based training for image restoration, denoising, and OCR applications. Each sample is annotated with rich metadata for further targeted research, including degradation type, severity level, and writer-specific demographics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Data in Brief MULTIDISCIPLINARY SCIENCES-

CiteScore

3.10

自引率

0.00%

发文量

996

审稿时长

70 days

期刊介绍： Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.