Bin Wang, Baojiang Li, Liang Li, Zhekai Zhang, Shengjie Qiu, Haiyan Wang, Xichao Wang
{"title":"Object Recognition Using Shape and Texture Tactile Information: A Fusion Network Based on Data Augmentation and Attention Mechanism.","authors":"Bin Wang, Baojiang Li, Liang Li, Zhekai Zhang, Shengjie Qiu, Haiyan Wang, Xichao Wang","doi":"10.1109/TOH.2024.3502910","DOIUrl":null,"url":null,"abstract":"<p><p>Currently, most tactile-based object recognition algorithms focus on single shape or texture recognition. However, these single attribute-based recognition methods perform poorly when dealing with objects with similar shape or texture characteristics. Research on integrating shape and texture attributes is still limited, and existing feature fusion mechanisms tend to rely on simple connectivity while ignoring the interactions between different features. To address this issue, we propose a novel attention-based fusion network, TSMFormer, which classifies by integrating shape and texture information and harnesses the global learning capabilities of attention mechanisms to explore interactions between shape and texture in tactile images. Considering the advantages of Transformer networks in handling large datasets, we expanded the existing tactile image dataset through data augmentation. Extensive comparative experiments on this dataset show that the accuracy of the network combining texture and shape information is significantly improved to 99.3%. Comparisons with existing fusion methods further validate the effectiveness of our proposed attention fusion mechanism. The results demonstrate that TSMFormer is highly valuable for research, as it fuses texture and shape information in tactile images through an attention mechanism. Additionally, it shows great potential for practical applications such as robot grasping and automatic quality inspection in industrial environments.</p>","PeriodicalId":13215,"journal":{"name":"IEEE Transactions on Haptics","volume":"PP ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Haptics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/TOH.2024.3502910","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}
引用次数: 0
Abstract
Currently, most tactile-based object recognition algorithms focus on single shape or texture recognition. However, these single attribute-based recognition methods perform poorly when dealing with objects with similar shape or texture characteristics. Research on integrating shape and texture attributes is still limited, and existing feature fusion mechanisms tend to rely on simple connectivity while ignoring the interactions between different features. To address this issue, we propose a novel attention-based fusion network, TSMFormer, which classifies by integrating shape and texture information and harnesses the global learning capabilities of attention mechanisms to explore interactions between shape and texture in tactile images. Considering the advantages of Transformer networks in handling large datasets, we expanded the existing tactile image dataset through data augmentation. Extensive comparative experiments on this dataset show that the accuracy of the network combining texture and shape information is significantly improved to 99.3%. Comparisons with existing fusion methods further validate the effectiveness of our proposed attention fusion mechanism. The results demonstrate that TSMFormer is highly valuable for research, as it fuses texture and shape information in tactile images through an attention mechanism. Additionally, it shows great potential for practical applications such as robot grasping and automatic quality inspection in industrial environments.
期刊介绍:
IEEE Transactions on Haptics (ToH) is a scholarly archival journal that addresses the science, technology, and applications associated with information acquisition and object manipulation through touch. Haptic interactions relevant to this journal include all aspects of manual exploration and manipulation of objects by humans, machines and interactions between the two, performed in real, virtual, teleoperated or networked environments. Research areas of relevance to this publication include, but are not limited to, the following topics: Human haptic and multi-sensory perception and action, Aspects of motor control that explicitly pertain to human haptics, Haptic interactions via passive or active tools and machines, Devices that sense, enable, or create haptic interactions locally or at a distance, Haptic rendering and its association with graphic and auditory rendering in virtual reality, Algorithms, controls, and dynamics of haptic devices, users, and interactions between the two, Human-machine performance and safety with haptic feedback, Haptics in the context of human-computer interactions, Systems and networks using haptic devices and interactions, including multi-modal feedback, Application of the above, for example in areas such as education, rehabilitation, medicine, computer-aided design, skills training, computer games, driver controls, simulation, and visualization.