Deep Neural Network Gaussian Process Regression Method for End-to-end Driving Behavior Learning

2020 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA) Pub Date : 2020-10-16 DOI:10.1109/ICMRA51221.2020.9398374

Hongjun Chen, Yujun Zeng, J. Huang, Yichuan Zhang

{"title":"Deep Neural Network Gaussian Process Regression Method for End-to-end Driving Behavior Learning","authors":"Hongjun Chen, Yujun Zeng, J. Huang, Yichuan Zhang","doi":"10.1109/ICMRA51221.2020.9398374","DOIUrl":null,"url":null,"abstract":"For dynamic complex driving tasks of autonomous vehicle, traditional driving behavior learning methods still need to be improved in terms of learning accuracy and generalization performance. A convolutional neural network gaussian process regression (CNN-GPR) method for driving behavior learning is proposed to tackle these problems that full connection layers of end-to-end convolutional neural networks (CNN) have limited generalization ability and easily converge to local optimization. At the same time, the PilotNet-GPR algorithm based on the CNN-GPR method is designed. The gaussian process regression (GPR) method with global mapping capability and better generalization performance is used to improve fully connected layers of the end-to-end CNN in order to complete the mapping from features extracted to driving actions more efficiently for the proposed CNN-GPR method. In addition, the long short-term memory network (LSTM) is added into CNN-GPR method, and a convolutional long short-term memory network gaussian process regression (CNN-LSTM-GPR) method of driving behavior learning with time-sequential images is proposed in order to further improve the accuracy of driving behavior learning through time-sequential information. This method utilises the gaussian process regression method to improve structures of fully connected layers in the cascaded deep neural network (CNN-LSTM) in order to make a more efficient learning approximation between features extracted by the CNN-LSTM and driving actions. Verification experiments on the Apollo end-to-end dataset for autonomous driving show that the proposed CNN-GPR method can further improve the imitation accuracy for end-to-end driving behavior learning and promote generalization performance of learned models compared with the PilotNet method. compared with related end-to-end driving behavior learning methods based on single image under the same condition, the proposed CNN-LSTM-GPR method can make full use of time-sequential information of images, which leads to smaller imitation errors. Additionally, it can further enhance the learning accuracy and demonstrates a more satisfying generalization performance compared with the CNN-LSTM method.","PeriodicalId":160127,"journal":{"name":"2020 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMRA51221.2020.9398374","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

For dynamic complex driving tasks of autonomous vehicle, traditional driving behavior learning methods still need to be improved in terms of learning accuracy and generalization performance. A convolutional neural network gaussian process regression (CNN-GPR) method for driving behavior learning is proposed to tackle these problems that full connection layers of end-to-end convolutional neural networks (CNN) have limited generalization ability and easily converge to local optimization. At the same time, the PilotNet-GPR algorithm based on the CNN-GPR method is designed. The gaussian process regression (GPR) method with global mapping capability and better generalization performance is used to improve fully connected layers of the end-to-end CNN in order to complete the mapping from features extracted to driving actions more efficiently for the proposed CNN-GPR method. In addition, the long short-term memory network (LSTM) is added into CNN-GPR method, and a convolutional long short-term memory network gaussian process regression (CNN-LSTM-GPR) method of driving behavior learning with time-sequential images is proposed in order to further improve the accuracy of driving behavior learning through time-sequential information. This method utilises the gaussian process regression method to improve structures of fully connected layers in the cascaded deep neural network (CNN-LSTM) in order to make a more efficient learning approximation between features extracted by the CNN-LSTM and driving actions. Verification experiments on the Apollo end-to-end dataset for autonomous driving show that the proposed CNN-GPR method can further improve the imitation accuracy for end-to-end driving behavior learning and promote generalization performance of learned models compared with the PilotNet method. compared with related end-to-end driving behavior learning methods based on single image under the same condition, the proposed CNN-LSTM-GPR method can make full use of time-sequential information of images, which leads to smaller imitation errors. Additionally, it can further enhance the learning accuracy and demonstrates a more satisfying generalization performance compared with the CNN-LSTM method.

查看原文本刊更多论文

端到端驾驶行为学习的深度神经网络高斯过程回归方法

对于自动驾驶汽车动态复杂的驾驶任务，传统的驾驶行为学习方法在学习精度和泛化性能方面还有待提高。针对端到端卷积神经网络(CNN)的全连接层泛化能力有限、容易收敛到局部最优的问题，提出了一种卷积神经网络高斯过程回归(CNN- gpr)驾驶行为学习方法。同时，设计了基于CNN-GPR方法的pilot - net - gpr算法。利用具有全局映射能力和更好泛化性能的高斯过程回归(GPR)方法对端到端CNN的全连通层进行改进，使所提出的CNN-GPR方法能够更高效地完成从特征提取到驱动动作的映射。此外，在CNN-GPR方法中加入了长短期记忆网络(LSTM)，提出了一种基于时间序列图像的卷积长短期记忆网络高斯过程回归(CNN-LSTM-GPR)驾驶行为学习方法，进一步提高了通过时间序列信息学习驾驶行为的准确性。该方法利用高斯过程回归方法对级联深度神经网络(CNN-LSTM)中全连接层的结构进行改进，使CNN-LSTM提取的特征与驱动动作之间的学习近似更加有效。在Apollo端到端自动驾驶数据集上的验证实验表明，与PilotNet方法相比，本文提出的CNN-GPR方法可以进一步提高端到端驾驶行为学习的模仿精度，提高学习模型的泛化性能。与相同条件下基于单幅图像的端到端驾驶行为学习方法相比，本文提出的CNN-LSTM-GPR方法可以充分利用图像的时间序列信息，模仿误差较小。此外，与CNN-LSTM方法相比，该方法可以进一步提高学习精度，并表现出更令人满意的泛化性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA)

自引率

0.00%

发文量