Research on visual-tactile cross-modality based on generative adversarial network

Yaoyao Li, Huailin Zhao, Huaping Liu, Shan Lu, Y.R. Hou
{"title":"Research on visual-tactile cross-modality based on generative adversarial network","authors":"Yaoyao Li, Huailin Zhao, Huaping Liu, Shan Lu, Y.R. Hou","doi":"10.1049/CCS2.12008","DOIUrl":null,"url":null,"abstract":"Joint Fund of Science & Technology Department of Liaoning Province and State Key Laboratory of Robotics, Grant/Award Number: 2020‐KF‐22‐06; The National Natural Science Foundation Project, Grant/Award Number: 61673238 Abstract Aiming at the research of assisted blind technology, a generative adversarial network model was proposed to complete the transformation of the mode from vision to touch. Firstly, two key representations of visual to tactile sense are identified: the texture image of the object and the audio frequency that generates vibrotactile. It is essentially a matter of generating audio from images. The authors propose a cross‐modal network framework that generates corresponding vibrotactile signals based on texture images. More importantly, the network structure is an end‐to‐end, which eliminates the traditional intermediate form of converting texture image to spectrum image, and can directly carry out the transformation from visual to tactile. A quantitative evaluation system is proposed in this study, which can evaluate the performance of the network model. The experimental results show that the network can complete the conversion of visual information to tactile signals. The proposed method is proved to be superior to the existing method of indirectly generating vibrotactile signals, and the applicability of the model is verified.","PeriodicalId":187152,"journal":{"name":"Cogn. Comput. Syst.","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cogn. Comput. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/CCS2.12008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Joint Fund of Science & Technology Department of Liaoning Province and State Key Laboratory of Robotics, Grant/Award Number: 2020‐KF‐22‐06; The National Natural Science Foundation Project, Grant/Award Number: 61673238 Abstract Aiming at the research of assisted blind technology, a generative adversarial network model was proposed to complete the transformation of the mode from vision to touch. Firstly, two key representations of visual to tactile sense are identified: the texture image of the object and the audio frequency that generates vibrotactile. It is essentially a matter of generating audio from images. The authors propose a cross‐modal network framework that generates corresponding vibrotactile signals based on texture images. More importantly, the network structure is an end‐to‐end, which eliminates the traditional intermediate form of converting texture image to spectrum image, and can directly carry out the transformation from visual to tactile. A quantitative evaluation system is proposed in this study, which can evaluate the performance of the network model. The experimental results show that the network can complete the conversion of visual information to tactile signals. The proposed method is proved to be superior to the existing method of indirectly generating vibrotactile signals, and the applicability of the model is verified.
基于生成对抗网络的视触觉交叉模态研究
辽宁省科技厅与机器人国家重点实验室联合基金,资助/奖励号:2020‐KF‐22‐06;摘要针对辅助盲技术的研究,提出了一种生成式对抗网络模型,以完成视觉模式到触觉模式的转换。首先,识别视觉到触觉的两个关键表征:物体的纹理图像和产生振动触觉的音频。它本质上是一个从图像生成音频的问题。作者提出了一种基于纹理图像生成相应触觉振动信号的跨模态网络框架。更重要的是,该网络结构是端到端的,消除了传统的纹理图像到光谱图像转换的中间形式,可以直接进行从视觉到触觉的转换。本文提出了一个定量评价系统,可以对网络模型的性能进行评价。实验结果表明,该网络可以完成视觉信息到触觉信号的转换。结果表明,该方法优于现有的间接产生振动触觉信号的方法,验证了模型的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信