OCR BASED SPEECH SYNTHESIS SYSTEM USING LABVIEW : Text to Speech Conversion System using OCR

J. J. Mullani, M. Sankar, P. Khade, Snehal H Sonalkar, Nikita L Patil
{"title":"OCR BASED SPEECH SYNTHESIS SYSTEM USING LABVIEW : Text to Speech Conversion System using OCR","authors":"J. J. Mullani, M. Sankar, P. Khade, Snehal H Sonalkar, Nikita L Patil","doi":"10.1109/ICCMC.2018.8487731","DOIUrl":null,"url":null,"abstract":"Machine replication of human capacities, such as perusing, is an antiquated dream. Be that as it may, in the course of the most recent five decades, machine perusing has developed from a fantasy to reality. Discourse is likely the most proficient medium for correspondence between people. Optical character acknowledgment has turned out to be a standout amongst the best utilizations of innovation in the field of example acknowledgment and manmade brainpower. In current society, there is an awesome request to rapidly include expansive measure of printed and manually written data into the PC, along these lines everybody depend vigorously on PCs to process tremendous volumes of information. The essential goal is to enable vocally debilitated individuals to utilize the PC or to peruse archives in a simpler way. The framework is separated into two sections initially is Optical Character Recognition (OCR) and second part is content to discourse. In the initial segment, Virtual Instrument is produced in which a hued picture that contains the characters is changed over into grayscale picture and characters are prepared and in the second part; transformation from content to discourse is created. The mean of normal review time, standard deviation, least examination time and most extreme assessment time in ms is estimated. There are a few varieties in time parameters due to factors like number of characters perceived, line profile, histogram, shine, difference and gamma revision esteems.","PeriodicalId":6604,"journal":{"name":"2018 Second International Conference on Computing Methodologies and Communication (ICCMC)","volume":"264 1","pages":"7-14"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Second International Conference on Computing Methodologies and Communication (ICCMC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCMC.2018.8487731","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Machine replication of human capacities, such as perusing, is an antiquated dream. Be that as it may, in the course of the most recent five decades, machine perusing has developed from a fantasy to reality. Discourse is likely the most proficient medium for correspondence between people. Optical character acknowledgment has turned out to be a standout amongst the best utilizations of innovation in the field of example acknowledgment and manmade brainpower. In current society, there is an awesome request to rapidly include expansive measure of printed and manually written data into the PC, along these lines everybody depend vigorously on PCs to process tremendous volumes of information. The essential goal is to enable vocally debilitated individuals to utilize the PC or to peruse archives in a simpler way. The framework is separated into two sections initially is Optical Character Recognition (OCR) and second part is content to discourse. In the initial segment, Virtual Instrument is produced in which a hued picture that contains the characters is changed over into grayscale picture and characters are prepared and in the second part; transformation from content to discourse is created. The mean of normal review time, standard deviation, least examination time and most extreme assessment time in ms is estimated. There are a few varieties in time parameters due to factors like number of characters perceived, line profile, histogram, shine, difference and gamma revision esteems.
基于OCR的LABVIEW语音合成系统:基于OCR的文本到语音转换系统
机器复制人类的能力,比如阅读,是一个过时的梦想。尽管如此,在最近的50年里,机器阅读已经从幻想变成了现实。话语可能是人与人之间沟通最熟练的媒介。光学字符识别已成为实例识别和人工智能领域创新的最佳应用之一。在当今社会,有一个惊人的要求,迅速包括大量的印刷和手工写入数据到个人电脑,沿着这些路线,每个人都大力依赖个人电脑来处理大量的信息。其基本目标是使声音衰弱的人能够以更简单的方式使用PC或阅读档案。该框架首先分为光学字符识别(OCR)和内容语篇两部分。在初始部分,制作虚拟仪器,其中将包含字符的彩色图像转换为灰度图像并准备字符,第二部分;创造了从内容到话语的转换。估计了正常评审时间、标准偏差、最小评审时间和最极端评审时间的平均值(ms)。由于感知到的字符数量、线条轮廓、直方图、亮度、差异和伽玛修正值等因素,时间参数有一些变化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信