自动驾驶赛车中的零射击策略转移:强化学习与模仿学习

2022 IEEE International Conference on Assured Autonomy (ICAA) Pub Date : 2022-03-01 DOI:10.1109/ICAA52185.2022.00011

Nathaniel P. Hamilton, Patrick Musau, Diego Manzanas Lopez, Taylor T. Johnson

{"title":"自动驾驶赛车中的零射击策略转移:强化学习与模仿学习","authors":"Nathaniel P. Hamilton, Patrick Musau, Diego Manzanas Lopez, Taylor T. Johnson","doi":"10.1109/ICAA52185.2022.00011","DOIUrl":null,"url":null,"abstract":"There are few technologies that hold as much promise in achieving safe, accessible, and convenient transportation as autonomous vehicles. However, as recent years have demonstrated, safety and reliability remain the most obstinate challenges, especially in complex domains. Autonomous racing has demonstrated unique benefits in that researchers can conduct research in controlled environments, allowing for experimentation with approaches that are too risky to evaluate on public roads. In this work, we compare two leading methods for training neural network controllers, Reinforcement Learning and Imitation Learning, for the autonomous racing task. We compare their viability by analyzing their performance and safety when deployed in novel scenarios outside their training via zero-shot policy transfer. Our evaluation is made up of a large number of experiments in simulation and on our real-world hardware platform that analyze whether these algorithms remain effective when transferred to the real-world. Our results show reinforcement learning outperforms imitation learning in most scenarios. However, the increased performance comes at the cost of reduced safety. Thus, both methods are effective under different criteria.","PeriodicalId":206047,"journal":{"name":"2022 IEEE International Conference on Assured Autonomy (ICAA)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Zero-Shot Policy Transfer in Autonomous Racing: Reinforcement Learning vs Imitation Learning\",\"authors\":\"Nathaniel P. Hamilton, Patrick Musau, Diego Manzanas Lopez, Taylor T. Johnson\",\"doi\":\"10.1109/ICAA52185.2022.00011\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There are few technologies that hold as much promise in achieving safe, accessible, and convenient transportation as autonomous vehicles. However, as recent years have demonstrated, safety and reliability remain the most obstinate challenges, especially in complex domains. Autonomous racing has demonstrated unique benefits in that researchers can conduct research in controlled environments, allowing for experimentation with approaches that are too risky to evaluate on public roads. In this work, we compare two leading methods for training neural network controllers, Reinforcement Learning and Imitation Learning, for the autonomous racing task. We compare their viability by analyzing their performance and safety when deployed in novel scenarios outside their training via zero-shot policy transfer. Our evaluation is made up of a large number of experiments in simulation and on our real-world hardware platform that analyze whether these algorithms remain effective when transferred to the real-world. Our results show reinforcement learning outperforms imitation learning in most scenarios. However, the increased performance comes at the cost of reduced safety. Thus, both methods are effective under different criteria.\",\"PeriodicalId\":206047,\"journal\":{\"name\":\"2022 IEEE International Conference on Assured Autonomy (ICAA)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Assured Autonomy (ICAA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAA52185.2022.00011\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Assured Autonomy (ICAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAA52185.2022.00011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

在实现安全、便捷、便捷的交通方面，几乎没有什么技术能像自动驾驶汽车那样前途无量。然而，近年来的研究表明，安全性和可靠性仍然是最棘手的挑战，特别是在复杂的领域。自动驾驶赛车已经展示了独特的优势，因为研究人员可以在受控环境中进行研究，允许对风险太大而无法在公共道路上进行评估的方法进行实验。在这项工作中，我们比较了训练神经网络控制器的两种主要方法，强化学习和模仿学习，用于自主赛车任务。我们通过分析它们在训练之外的新场景中部署时的性能和安全性来比较它们的可行性。我们的评估是由大量的模拟实验和现实世界的硬件平台组成的，这些实验分析了这些算法在转移到现实世界时是否仍然有效。我们的结果表明，在大多数情况下，强化学习优于模仿学习。然而，性能的提高是以降低安全性为代价的。因此，两种方法在不同的标准下都是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Zero-Shot Policy Transfer in Autonomous Racing: Reinforcement Learning vs Imitation Learning

There are few technologies that hold as much promise in achieving safe, accessible, and convenient transportation as autonomous vehicles. However, as recent years have demonstrated, safety and reliability remain the most obstinate challenges, especially in complex domains. Autonomous racing has demonstrated unique benefits in that researchers can conduct research in controlled environments, allowing for experimentation with approaches that are too risky to evaluate on public roads. In this work, we compare two leading methods for training neural network controllers, Reinforcement Learning and Imitation Learning, for the autonomous racing task. We compare their viability by analyzing their performance and safety when deployed in novel scenarios outside their training via zero-shot policy transfer. Our evaluation is made up of a large number of experiments in simulation and on our real-world hardware platform that analyze whether these algorithms remain effective when transferred to the real-world. Our results show reinforcement learning outperforms imitation learning in most scenarios. However, the increased performance comes at the cost of reduced safety. Thus, both methods are effective under different criteria.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Assured Autonomy (ICAA)

自引率

0.00%

发文量