摘要:自动驾驶汽车的安全保证偏好学习方法

Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control Pub Date : 2023-05-09 DOI:10.1145/3575870.3589549

Ruya Karagulle, N. Aréchiga, A. Best, Jonathan DeCastro, N. Ozay

{"title":"摘要:自动驾驶汽车的安全保证偏好学习方法","authors":"Ruya Karagulle, N. Aréchiga, A. Best, Jonathan DeCastro, N. Ozay","doi":"10.1145/3575870.3589549","DOIUrl":null,"url":null,"abstract":"In this work, we propose a safety-guaranteed personalization for autonomous vehicles by incorporating Signal Temporal Logic (STL) into preference learning problem. We propose a new variant of STL called Parametric Weighted Signal Temporal Logic with a new quantitative semantics, namely weighted robustness. Given a set of pairwise preferences, and by using gradient-based optimization methods, we learn a set of valuations for weights that reflect preferences such that preferred ones have greater weighted robustness value than their non-preferred matches. Traditional STL formulas fail to incorporate preferences due its complex nature. Our initial results with data from a human-subject on an intersection with stop sign driving scenario, in which the participant is asked their preferred driving behavior from pairs of vehicle trajectories, indicate that we can learn a new weighted STL formula that captures preferences while also encoding correctness.","PeriodicalId":426801,"journal":{"name":"Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Poster Abstract: Safety Guaranteed Preference Learning Approach for Autonomous Vehicles\",\"authors\":\"Ruya Karagulle, N. Aréchiga, A. Best, Jonathan DeCastro, N. Ozay\",\"doi\":\"10.1145/3575870.3589549\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we propose a safety-guaranteed personalization for autonomous vehicles by incorporating Signal Temporal Logic (STL) into preference learning problem. We propose a new variant of STL called Parametric Weighted Signal Temporal Logic with a new quantitative semantics, namely weighted robustness. Given a set of pairwise preferences, and by using gradient-based optimization methods, we learn a set of valuations for weights that reflect preferences such that preferred ones have greater weighted robustness value than their non-preferred matches. Traditional STL formulas fail to incorporate preferences due its complex nature. Our initial results with data from a human-subject on an intersection with stop sign driving scenario, in which the participant is asked their preferred driving behavior from pairs of vehicle trajectories, indicate that we can learn a new weighted STL formula that captures preferences while also encoding correctness.\",\"PeriodicalId\":426801,\"journal\":{\"name\":\"Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3575870.3589549\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3575870.3589549","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在这项工作中，我们通过将信号时间逻辑(STL)纳入偏好学习问题，提出了一种安全保证的自动驾驶汽车个性化方案。我们提出了STL的一种新的变体，称为参数加权信号时间逻辑，它具有新的量化语义，即加权鲁棒性。给定一组成对偏好，通过使用基于梯度的优化方法，我们学习了一组反映偏好的权重估值，使得首选的权重鲁棒性值比非首选匹配值更大。传统的STL公式由于其复杂性而无法包含首选项。我们的初步结果来自一个人类受试者在一个有停车标志的十字路口驾驶场景的数据，在这个场景中，参与者被问及他们对车辆轨迹的偏好驾驶行为，表明我们可以学习一个新的加权STL公式，在捕获偏好的同时编码正确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Poster Abstract: Safety Guaranteed Preference Learning Approach for Autonomous Vehicles

In this work, we propose a safety-guaranteed personalization for autonomous vehicles by incorporating Signal Temporal Logic (STL) into preference learning problem. We propose a new variant of STL called Parametric Weighted Signal Temporal Logic with a new quantitative semantics, namely weighted robustness. Given a set of pairwise preferences, and by using gradient-based optimization methods, we learn a set of valuations for weights that reflect preferences such that preferred ones have greater weighted robustness value than their non-preferred matches. Traditional STL formulas fail to incorporate preferences due its complex nature. Our initial results with data from a human-subject on an intersection with stop sign driving scenario, in which the participant is asked their preferred driving behavior from pairs of vehicle trajectories, indicate that we can learn a new weighted STL formula that captures preferences while also encoding correctness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control

自引率

0.00%

发文量