Deep Neural Network Gaussian Process Regression Method for End-to-end Driving Behavior Learning

Hongjun Chen, Yujun Zeng, J. Huang, Yichuan Zhang
{"title":"Deep Neural Network Gaussian Process Regression Method for End-to-end Driving Behavior Learning","authors":"Hongjun Chen, Yujun Zeng, J. Huang, Yichuan Zhang","doi":"10.1109/ICMRA51221.2020.9398374","DOIUrl":null,"url":null,"abstract":"For dynamic complex driving tasks of autonomous vehicle, traditional driving behavior learning methods still need to be improved in terms of learning accuracy and generalization performance. A convolutional neural network gaussian process regression (CNN-GPR) method for driving behavior learning is proposed to tackle these problems that full connection layers of end-to-end convolutional neural networks (CNN) have limited generalization ability and easily converge to local optimization. At the same time, the PilotNet-GPR algorithm based on the CNN-GPR method is designed. The gaussian process regression (GPR) method with global mapping capability and better generalization performance is used to improve fully connected layers of the end-to-end CNN in order to complete the mapping from features extracted to driving actions more efficiently for the proposed CNN-GPR method. In addition, the long short-term memory network (LSTM) is added into CNN-GPR method, and a convolutional long short-term memory network gaussian process regression (CNN-LSTM-GPR) method of driving behavior learning with time-sequential images is proposed in order to further improve the accuracy of driving behavior learning through time-sequential information. This method utilises the gaussian process regression method to improve structures of fully connected layers in the cascaded deep neural network (CNN-LSTM) in order to make a more efficient learning approximation between features extracted by the CNN-LSTM and driving actions. Verification experiments on the Apollo end-to-end dataset for autonomous driving show that the proposed CNN-GPR method can further improve the imitation accuracy for end-to-end driving behavior learning and promote generalization performance of learned models compared with the PilotNet method. compared with related end-to-end driving behavior learning methods based on single image under the same condition, the proposed CNN-LSTM-GPR method can make full use of time-sequential information of images, which leads to smaller imitation errors. Additionally, it can further enhance the learning accuracy and demonstrates a more satisfying generalization performance compared with the CNN-LSTM method.","PeriodicalId":160127,"journal":{"name":"2020 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMRA51221.2020.9398374","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

For dynamic complex driving tasks of autonomous vehicle, traditional driving behavior learning methods still need to be improved in terms of learning accuracy and generalization performance. A convolutional neural network gaussian process regression (CNN-GPR) method for driving behavior learning is proposed to tackle these problems that full connection layers of end-to-end convolutional neural networks (CNN) have limited generalization ability and easily converge to local optimization. At the same time, the PilotNet-GPR algorithm based on the CNN-GPR method is designed. The gaussian process regression (GPR) method with global mapping capability and better generalization performance is used to improve fully connected layers of the end-to-end CNN in order to complete the mapping from features extracted to driving actions more efficiently for the proposed CNN-GPR method. In addition, the long short-term memory network (LSTM) is added into CNN-GPR method, and a convolutional long short-term memory network gaussian process regression (CNN-LSTM-GPR) method of driving behavior learning with time-sequential images is proposed in order to further improve the accuracy of driving behavior learning through time-sequential information. This method utilises the gaussian process regression method to improve structures of fully connected layers in the cascaded deep neural network (CNN-LSTM) in order to make a more efficient learning approximation between features extracted by the CNN-LSTM and driving actions. Verification experiments on the Apollo end-to-end dataset for autonomous driving show that the proposed CNN-GPR method can further improve the imitation accuracy for end-to-end driving behavior learning and promote generalization performance of learned models compared with the PilotNet method. compared with related end-to-end driving behavior learning methods based on single image under the same condition, the proposed CNN-LSTM-GPR method can make full use of time-sequential information of images, which leads to smaller imitation errors. Additionally, it can further enhance the learning accuracy and demonstrates a more satisfying generalization performance compared with the CNN-LSTM method.
端到端驾驶行为学习的深度神经网络高斯过程回归方法
对于自动驾驶汽车动态复杂的驾驶任务,传统的驾驶行为学习方法在学习精度和泛化性能方面还有待提高。针对端到端卷积神经网络(CNN)的全连接层泛化能力有限、容易收敛到局部最优的问题,提出了一种卷积神经网络高斯过程回归(CNN- gpr)驾驶行为学习方法。同时,设计了基于CNN-GPR方法的pilot - net - gpr算法。利用具有全局映射能力和更好泛化性能的高斯过程回归(GPR)方法对端到端CNN的全连通层进行改进,使所提出的CNN-GPR方法能够更高效地完成从特征提取到驱动动作的映射。此外,在CNN-GPR方法中加入了长短期记忆网络(LSTM),提出了一种基于时间序列图像的卷积长短期记忆网络高斯过程回归(CNN-LSTM-GPR)驾驶行为学习方法,进一步提高了通过时间序列信息学习驾驶行为的准确性。该方法利用高斯过程回归方法对级联深度神经网络(CNN-LSTM)中全连接层的结构进行改进,使CNN-LSTM提取的特征与驱动动作之间的学习近似更加有效。在Apollo端到端自动驾驶数据集上的验证实验表明,与PilotNet方法相比,本文提出的CNN-GPR方法可以进一步提高端到端驾驶行为学习的模仿精度,提高学习模型的泛化性能。与相同条件下基于单幅图像的端到端驾驶行为学习方法相比,本文提出的CNN-LSTM-GPR方法可以充分利用图像的时间序列信息,模仿误差较小。此外,与CNN-LSTM方法相比,该方法可以进一步提高学习精度,并表现出更令人满意的泛化性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信