Processing the manuscripts of Atatürk

Talha Soysal, Hande Adiguzel, A. Oktem, Alican Haman, E. Can, Pinar Duygulu, M. Kalpakli
{"title":"Processing the manuscripts of Atatürk","authors":"Talha Soysal, Hande Adiguzel, A. Oktem, Alican Haman, E. Can, Pinar Duygulu, M. Kalpakli","doi":"10.1109/SIU.2010.5652708","DOIUrl":null,"url":null,"abstract":"In this paper, as a first step to an easy and convenient way to access the manuscripts of Atatürk with a word based search engine, the preprocessing of digitalized documents and their line and word segmentation is studied. The techniques that are applied on printed documents may not yield satisfactory results. Due to this fact, more developed techniques are decided to be applied consisting of a technique based on Hough transform [1] for line segmentation and a technique that is based on dealing with skewness of lines for word segmentation. The results, which are acquired through studies that are conducted on the documents provided by Afet Đnan and consisting of 30 pages [2], prove to be highly accurate and promising for future researches.","PeriodicalId":152297,"journal":{"name":"2010 IEEE 18th Signal Processing and Communications Applications Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE 18th Signal Processing and Communications Applications Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU.2010.5652708","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, as a first step to an easy and convenient way to access the manuscripts of Atatürk with a word based search engine, the preprocessing of digitalized documents and their line and word segmentation is studied. The techniques that are applied on printed documents may not yield satisfactory results. Due to this fact, more developed techniques are decided to be applied consisting of a technique based on Hough transform [1] for line segmentation and a technique that is based on dealing with skewness of lines for word segmentation. The results, which are acquired through studies that are conducted on the documents provided by Afet Đnan and consisting of 30 pages [2], prove to be highly accurate and promising for future researches.
处理atatat rk的手稿
本文首先研究了数字化文档的预处理及其行分词方法,为利用基于词的搜索引擎实现对atatat rk手稿的便捷检索迈出了第一步。应用于打印文件的技术可能不会产生令人满意的结果。因此,决定采用更发达的技术,包括基于Hough变换[1]的线段分割技术和基于处理线段偏度的线段分割技术。该结果是通过对Afet Đnan提供的文件进行研究而获得的,共30页[2],结果证明是高度准确的,并为未来的研究提供了希望。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信