Object recognition based on tactile information: A generalized recognition network combining wavelet transform and transformer model for small sample datasets
IF 6.8 1区 计算机科学0 COMPUTER SCIENCE, INFORMATION SYSTEMS
Liang Li , Shengjie Qiu , Baojiang Li , Bin Wang , Haiyan Wang , Zizhen Yi , Chunbo Zhao
{"title":"Object recognition based on tactile information: A generalized recognition network combining wavelet transform and transformer model for small sample datasets","authors":"Liang Li , Shengjie Qiu , Baojiang Li , Bin Wang , Haiyan Wang , Zizhen Yi , Chunbo Zhao","doi":"10.1016/j.ins.2025.122464","DOIUrl":null,"url":null,"abstract":"<div><div>Tactile recognition is a crucial pathway for robots in perception and cognitive processing. While deep learning-based methods have shown excellent performance, training deep neural networks demands a substantial number of manually labeled samples. Unfortunately, current tactile recognition datasets lack the samples needed for robust training. To address this, we introduce a generalized tactile recognition method under low-sample conditions, Wave-Tactile-Transformer. Initially, we expand the tactile data using proposed TacGAN, avoiding traditional processes like rotation and cropping that create redundancy, which yields up to 5.8% accuracy improvement over traditional augmentation techniques. We also propose a Transformer framework integrated with multi-scale wavelet transforms, which is applicable to various tactile data formats. The wavelet transform enhances the model’s ability to discern details in tactile images, while the Transformer network refines the comprehension of feature relationships. This dual approach not only significantly reduces computational costs but also boosts object recognition accuracy. Our approach introduces an innovative framework that harmonizes the processing of tactile data across diverse formats. Cross-format tactile dataset experiments achieved a peak recognition accuracy of 96.7%, outperforming conventional CNN-based methods by up to 2.6%, surpassing previous methods. This generalized tactile recognition network offers innovative solutions for robotic tactile perception and grasp control.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"719 ","pages":"Article 122464"},"PeriodicalIF":6.8000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525005961","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Tactile recognition is a crucial pathway for robots in perception and cognitive processing. While deep learning-based methods have shown excellent performance, training deep neural networks demands a substantial number of manually labeled samples. Unfortunately, current tactile recognition datasets lack the samples needed for robust training. To address this, we introduce a generalized tactile recognition method under low-sample conditions, Wave-Tactile-Transformer. Initially, we expand the tactile data using proposed TacGAN, avoiding traditional processes like rotation and cropping that create redundancy, which yields up to 5.8% accuracy improvement over traditional augmentation techniques. We also propose a Transformer framework integrated with multi-scale wavelet transforms, which is applicable to various tactile data formats. The wavelet transform enhances the model’s ability to discern details in tactile images, while the Transformer network refines the comprehension of feature relationships. This dual approach not only significantly reduces computational costs but also boosts object recognition accuracy. Our approach introduces an innovative framework that harmonizes the processing of tactile data across diverse formats. Cross-format tactile dataset experiments achieved a peak recognition accuracy of 96.7%, outperforming conventional CNN-based methods by up to 2.6%, surpassing previous methods. This generalized tactile recognition network offers innovative solutions for robotic tactile perception and grasp control.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.