{"title":"Off-policy correction algorithm for double Q network based on deep reinforcement learning","authors":"Qingbo Zhang, Manlu Liu, Heng Wang, Weimin Qian, Xinglang Zhang","doi":"10.1049/csy2.12102","DOIUrl":"https://doi.org/10.1049/csy2.12102","url":null,"abstract":"<p>A deep reinforcement learning (DRL) method based on the deep deterministic policy gradient (DDPG) algorithm is proposed to address the problems of a mismatch between the needed training samples and the actual training samples during the training of intelligence, the overestimation and underestimation of the existence of Q-values, and the insufficient dynamism of the intelligence policy exploration. This method introduces the Actor-Critic Off-Policy Correction (AC-Off-POC) reinforcement learning framework and an improved double Q-value learning method, which enables the value function network in the target task to provide a more accurate evaluation of the policy network and converge to the optimal policy more quickly and stably to obtain higher value returns. The method is applied to multiple MuJoCo tasks on the Open AI Gym simulation platform. The experimental results show that it is better than the DDPG algorithm based solely on the different policy correction framework (AC-Off-POC) and the conventional DRL algorithm. The value of returns and stability of the double-Q-network off-policy correction algorithm for the deep deterministic policy gradient (DCAOP-DDPG) proposed by the authors are significantly higher than those of other DRL algorithms.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12102","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139041971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhicong Zhang, Wenyu Zhang, Donglin Zhu, Yi Xu, Changjun Zhou
{"title":"Printed circuit board solder joint quality inspection based on lightweight classification network","authors":"Zhicong Zhang, Wenyu Zhang, Donglin Zhu, Yi Xu, Changjun Zhou","doi":"10.1049/csy2.12101","DOIUrl":"https://doi.org/10.1049/csy2.12101","url":null,"abstract":"<p>Solder joint quality inspection is a crucial step in the qualification inspection of printed circuit board (PCB) components, and efficient and accurate inspection methods will greatly improve its production efficiency. In this paper, we propose a PCB solder joint quality detection algorithm based on a lightweight classification network. First, the Select Joint segmentation method was used to obtain the solder joint information, and colour space conversion was used to locate the solder joint. The mask method, contour detection, and box line method were combined to complete the extraction of solder joint information. Then, by combining the respective characteristics of convolutional neural network and Transformer and introducing Cross-covariance attention to reduce the computational complexity and resource consumption of the model and evenly distribute the global view mutual information in the whole training process, a new lightweight network model MobileXT is proposed to complete defect classification. Only 16.4% of the Vision Transformer computing resources used in this model can achieve an average accuracy improvement of 31%. Additionally, the network is trained and validated using a dataset of 1804 solder joint images constructed from 93 PCB images and two external datasets to evaluate MobileXT performance. The proposed method achieves more efficient localization of the solder joint information and more accurate classification of weld joint defects, and the lightweight model design is more appropriate for industrial edge device deployments.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12101","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134806524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A proposal on centralised and distributed optimisation via proportional–integral–derivative controllers (PID) control perspective","authors":"Jiaxu Liu, Song Chen, Shengze Cai, Chao Xu","doi":"10.1049/csy2.12100","DOIUrl":"10.1049/csy2.12100","url":null,"abstract":"<p>Motivated by the excellent performance of proportional–integral–derivative controllers (PIDs) in the field of control, the authors injected the philosophy of PID into optimisation and introduced two types of novel PID optimisers from a continuous-time view, which benefit from the idea that discrete-time optimisation algorithm can be modelled as a continuous dynamical system/controlled system. For centralised optimisation, the authors discuss the idea of the first-order PID optimiser and the second-order accelerated PID optimiser. Furthermore, this framework is extended into distributed optimisation settings, and a distributed PID optimiser is proposed. Finally, some numerical examples are given to verify our ideas.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12100","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135868667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Wei Ren, You Wang, Haoxiang Liu, Song Jin, Yixu Wang, Yifan Liu, Ziang Zhang, Tao Hu, Guang Li
{"title":"Spherical robot: A novel robot for exploration in harsh unknown environments","authors":"Wei Ren, You Wang, Haoxiang Liu, Song Jin, Yixu Wang, Yifan Liu, Ziang Zhang, Tao Hu, Guang Li","doi":"10.1049/csy2.12099","DOIUrl":"https://doi.org/10.1049/csy2.12099","url":null,"abstract":"<p>The authors propose a complete software and hardware framework for a novel spherical robot to cope with exploration in harsh and unknown environments. The proposed robot is driven by a heavy pendulum covered by a fully enclosed spherical shell, which is strongly protected, amphibious, anti-overturn and has a long-battery-life. Algorithms for location and perception, planning and motion control are comprehensively designed. On the one hand, the authors fully consider the kinematic model of a spherical robot, propose a positioning algorithm that fuses data from inertial measurement units, motor encoder and Global Navigation Satellite System, improve global path planning algorithm based on Hybrid A* and design an instruction planning controller based on model predictive control (MPC). On the other hand, the dynamic model is built, linear MPC and robust servo linear quadratic regulator algorithm is improved, and a speed controller and a direction controller are designed. In addition, based on the pose and motion characteristics of a spherical robot, a visual obstacle perception algorithm and an electronic image stabilisation algorithm are designed. Finally, the authors build physical systems to verify the effectiveness of the above algorithms through experiments.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71984763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A novel multifunctional intelligent bed integrated with multimodal human–robot interaction approach and safe nursing methods","authors":"Donghui Zhao, Yuhui Wu, Chenhao Yang, Junyou Yang, Houdei Liu, Shuoyu Wang, Yinlai Jiang, Yokoi Hiroshi","doi":"10.1049/csy2.12097","DOIUrl":"https://doi.org/10.1049/csy2.12097","url":null,"abstract":"<p>The authors propose a multifunctional intelligent bed (MIB) that integrates multiple modes of interaction to improve the welfare of mobility-impaired users and reduce the workload of medical personnel. The MIB features independent autonomous omnidirectional movement, position adjustment, multi-degree-of-freedom (DOF) movement regulation and posture memory functions to facilitate comfortable and convenient interaction for mobility-impaired users. In particular, an integrated “MIB-state perception-interaction interfaces” system is established, and a bed fall risk detection algorithm and assisted get-up-transfer algorithm is proposed. By recognising and sharing human body state characteristics, nursing collaboration can be achieved with caregivers or other nursing robots. Comprehensive experiments demonstrate that the MIB is a novel MIB that is highly adaptable to the environment, convenient to interact with and safe. By integrating the proposed algorithms, daily safety monitoring, assisted get-up and defecation tasks can be effectively accomplished. This technology demonstrates excellent applicability and promising prospects for implementation in hospitals, nursing centres and homes catering to elderly and disabled individuals with mobility impairments.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12097","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50139314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bao Pang, Jun Teng, Qingyang Xu, Yong Song, Xianfeng Yuan, Yibin Li
{"title":"Chinese personalised text-to-speech synthesis for robot human–machine interaction","authors":"Bao Pang, Jun Teng, Qingyang Xu, Yong Song, Xianfeng Yuan, Yibin Li","doi":"10.1049/csy2.12098","DOIUrl":"https://doi.org/10.1049/csy2.12098","url":null,"abstract":"<p>Speech interaction is an important means of robot interaction. With the rapid development of deep learning, end-to-end speech synthesis methods based on this technique have gradually become mainstream. Chinese deep learning-based speech synthesis techniques suffer from problems such as unstable synthesised speech, poor naturalness and poor personalised speech synthesis, which do not satisfy some practical application scenarios. Hence, an F-MelGAN model is adopted to improve the performance of Chinese speech synthesis. A post-processing network is used to refine the Mel-spectrum predicted by the decoder and alleviate the Mel-spectrum distortion phenomenon. A phoneme-level and sentence-level combined module is proposed to model the personalised style of speakers. A combination of an acoustic conditioning network, speaker encoder network GCNet and feedback-constrained training is proposed to solve the problem of poor personalised speech synthesis and achieve personalised speech customisation in Chinese. Experimental results show that the whole model can generate high-quality speech with high speaker similarity for both speakers that appear in the training process and speakers that never appear in the training process.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12098","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50139313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhengshun Fei, Jinglong Wang, Kangling Liu, Eric Attahi, Bingqiang Huang
{"title":"Deep feature fusion-based stacked denoising autoencoder for tag recommendation systems","authors":"Zhengshun Fei, Jinglong Wang, Kangling Liu, Eric Attahi, Bingqiang Huang","doi":"10.1049/csy2.12095","DOIUrl":"10.1049/csy2.12095","url":null,"abstract":"<p>With the rapid development of artificial intelligence technology, commercial robots have gradually entered our daily lives. In order to promote product dissemination, shopping guide robots are a new service options of commerce platforms that use tag recommendation systems to identify users' intentions. A large number of applications combine user historical tagging information with the multi-round dialogue ability of shopping guide robots to help users efficiently search for and retrieve products of interest. Recently, tensor decomposition methods have become a common approach for modelling entity interaction relationships in tag recommendation systems. However, due to the sparsity of data, these methods only consider low-order information of entities, making it difficult to capture the higher-order collaborative signals among entities. Recommendation methods by autoencoders can effectively extract abstract feature representations while they only focus on the two-dimensional relationship between users and items, ignoring the interaction relationship among users, items and tags in real complex recommendation scenarios. The authors focus on modelling the similarity relationship among entities and propose a method called deep feature fusion tag (DFFT) based on the deep feature fusion of stacked denoising autoencoders. This method can extract high-order information with different embedding dimensions and fuse them in a unified framework. To extract robust feature representations, the authors inject random noise (mask-out/drop-out noise) into the tag information corresponding to users and items to generate corrupted input data, and then utilise autoencoders to encode the interaction relationship among entities. To further obtain the interaction relationship with different dimensions, different encoding layers are stacked and combined to produce a better expanded model which can reinforce each other. Finally, a decoding component is used to reconstruct the original input data. According to the experimental results on two common datasets, the proposed DFFT method outperforms other baselines in terms of the F1@N, NDCG@N and Recall@N evaluation metrics.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12095","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41925018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Unmanned aerial vehicle orthogonal laser localization by Gaussian mixture model-based map representation","authors":"Zeyu Wan, Changjian Jiang, Yu Zhang","doi":"10.1049/csy2.12096","DOIUrl":"10.1049/csy2.12096","url":null,"abstract":"<p>Localization is a core problem in mobile robot navigation. Simultaneous localization and mapping (SLAM) costs much for an unmanned aerial vehicle (UAV). This research aims to design an orthogonal laser scan device for localization and to save computation costs. Based on disturbance analysis, residual influences on sensor state are quantitative, and they are related to uncertainty and sensitivity. This research applied the residual selection method to a UAV. The feature point detection utilises multi-scale and Gaussian model fitting techniques to guarantee true positives. The map is represented by Gaussian Mixture Models (GMM) with lower memory costs. The orthogonal laser scan device is composed and placed on a UAV for real-time three-dimensional localization, whose errors are at the centimeter level.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12096","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47246688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ravi Suppiah, Noori Kim, Khalid Abidi, Anurag Sharma
{"title":"BIO-inspired fuzzy inference system—For physiological signal analysis","authors":"Ravi Suppiah, Noori Kim, Khalid Abidi, Anurag Sharma","doi":"10.1049/csy2.12093","DOIUrl":"10.1049/csy2.12093","url":null,"abstract":"<p>When a person's neuromuscular system is affected by an injury or disease, Activities-for-Daily-Living (ADL), such as gripping, turning, and walking, are impaired. Electroencephalography (EEG) and Electromyography (EMG) are physiological signals generated by a body during neuromuscular activities embedding the intentions of the subject, and they are used in Brain–Computer Interface (BCI) or robotic rehabilitation systems. However, existing BCI or robotic rehabilitation systems use signal classification technique limitations such as (1) missing temporal correlation of the EEG and EMG signals in the entire window and (2) overlooking the interrelationship between different sensors in the system. Furthermore, typical existing systems are designed to operate based on the presence of dominant physiological signals associated with certain actions; (3) their effectiveness will be greatly reduced if subjects are disabled in generating the dominant signals. A novel classification model, named BIOFIS is proposed, which fuses signals from different sensors to generate inter-channel and intra-channel relationships. It explores the temporal correlation of the signals within a timeframe via a Long Short-Term Memory (LSTM) block. The proposed architecture is able to classify the various subsets of a full-range arm movement that performs actions such as forward, grip and raise, lower and release, and reverse. The system can achieve 98.6% accuracy for a 4-way action using EEG data and 97.18% accuracy using EMG data. Moreover, even without the dominant signal, the accuracy scores were 90.1% for the EEG data and 85.2% for the EMG data. The proposed mechanism shows promise in the design of EEG/EMG-based use in the medical device and rehabilitation industries.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12093","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48412643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Minghuan Zhang, Yaguang Zhu, Ao Cao, Qibin Wei, Qiong Liu
{"title":"Body trajectory optimisation of walking gait for a quadruped robot","authors":"Minghuan Zhang, Yaguang Zhu, Ao Cao, Qibin Wei, Qiong Liu","doi":"10.1049/csy2.12094","DOIUrl":"10.1049/csy2.12094","url":null,"abstract":"<p>To ensure that the robot can follow the planned trajectory, smooth switching between swinging legs and a smooth transition of motion process is realised. The previous motion planning work is analysed, and a method for improving the optimisation objective function and constraint conditions is proposed to eliminate the sudden change of acceleration and reduce the peak value of acceleration change. This method eliminates the impact phenomenon in the motor drive process and reduces the motor drive energy consumption, thus ensuring the smooth and consistent movement of the robot. The results show that the improved optimisation method has a better motion effect than the previous approach in terms of centre of mass motion speed, trajectory fitting and body posture change, and realises more robust motion of quadruped robots in a senseless state.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12094","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46025496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}