{"title":"利用自适应感知引导变压器进行多代理轨迹预测","authors":"Ngan Linh Nguyen, Myungsik Yoo","doi":"10.1049/itr2.12502","DOIUrl":null,"url":null,"abstract":"<p>The ability to predict the trajectory of an autonomous vehicle accurately is crucial for safe and efficient navigation. However, predicting diverse and multimodal futures can be challenging. Recent approaches such as attention and graph neural networks have achieved state-of-the-art performance by considering agent interactions and map contexts. This study focused on multi-agent prediction using an agent-centric approach with transformers. This enables parallel computation and a comprehensive understanding of the environment. Two main features are introduced: an adaptive receptive field (ARF) that captures the relevant surroundings for each agent, and perception encoding, which serves as spatial context embeddings. The ARF adapts to the agent's velocity and rotation, focusing attention ahead at high speeds or to the sides when it is slower. Perception encoding divides agents or lanes into levels and encodes the information of each level. This approach enables the efficient encoding of complex spatial relationships. The proposed method combines these advances with transformer modelling for multi-agent trajectory prediction while ensuring real-time prediction capabilities. The approach is evaluated on the Argoverse benchmark and better performance than the state-of-the-art baseline is achieved. By addressing challenges such as multimodal outputs and robustness, the study enhances the safety and efficiency of autonomous driving systems by more accurately predicting trajectories.</p>","PeriodicalId":50381,"journal":{"name":"IET Intelligent Transport Systems","volume":"18 7","pages":"1196-1209"},"PeriodicalIF":2.3000,"publicationDate":"2024-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.12502","citationCount":"0","resultStr":"{\"title\":\"Multi-agent trajectory prediction with adaptive perception-guided transformers\",\"authors\":\"Ngan Linh Nguyen, Myungsik Yoo\",\"doi\":\"10.1049/itr2.12502\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The ability to predict the trajectory of an autonomous vehicle accurately is crucial for safe and efficient navigation. However, predicting diverse and multimodal futures can be challenging. Recent approaches such as attention and graph neural networks have achieved state-of-the-art performance by considering agent interactions and map contexts. This study focused on multi-agent prediction using an agent-centric approach with transformers. This enables parallel computation and a comprehensive understanding of the environment. Two main features are introduced: an adaptive receptive field (ARF) that captures the relevant surroundings for each agent, and perception encoding, which serves as spatial context embeddings. The ARF adapts to the agent's velocity and rotation, focusing attention ahead at high speeds or to the sides when it is slower. Perception encoding divides agents or lanes into levels and encodes the information of each level. This approach enables the efficient encoding of complex spatial relationships. The proposed method combines these advances with transformer modelling for multi-agent trajectory prediction while ensuring real-time prediction capabilities. The approach is evaluated on the Argoverse benchmark and better performance than the state-of-the-art baseline is achieved. By addressing challenges such as multimodal outputs and robustness, the study enhances the safety and efficiency of autonomous driving systems by more accurately predicting trajectories.</p>\",\"PeriodicalId\":50381,\"journal\":{\"name\":\"IET Intelligent Transport Systems\",\"volume\":\"18 7\",\"pages\":\"1196-1209\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.12502\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Intelligent Transport Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/itr2.12502\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Intelligent Transport Systems","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/itr2.12502","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Multi-agent trajectory prediction with adaptive perception-guided transformers
The ability to predict the trajectory of an autonomous vehicle accurately is crucial for safe and efficient navigation. However, predicting diverse and multimodal futures can be challenging. Recent approaches such as attention and graph neural networks have achieved state-of-the-art performance by considering agent interactions and map contexts. This study focused on multi-agent prediction using an agent-centric approach with transformers. This enables parallel computation and a comprehensive understanding of the environment. Two main features are introduced: an adaptive receptive field (ARF) that captures the relevant surroundings for each agent, and perception encoding, which serves as spatial context embeddings. The ARF adapts to the agent's velocity and rotation, focusing attention ahead at high speeds or to the sides when it is slower. Perception encoding divides agents or lanes into levels and encodes the information of each level. This approach enables the efficient encoding of complex spatial relationships. The proposed method combines these advances with transformer modelling for multi-agent trajectory prediction while ensuring real-time prediction capabilities. The approach is evaluated on the Argoverse benchmark and better performance than the state-of-the-art baseline is achieved. By addressing challenges such as multimodal outputs and robustness, the study enhances the safety and efficiency of autonomous driving systems by more accurately predicting trajectories.
期刊介绍:
IET Intelligent Transport Systems is an interdisciplinary journal devoted to research into the practical applications of ITS and infrastructures. The scope of the journal includes the following:
Sustainable traffic solutions
Deployments with enabling technologies
Pervasive monitoring
Applications; demonstrations and evaluation
Economic and behavioural analyses of ITS services and scenario
Data Integration and analytics
Information collection and processing; image processing applications in ITS
ITS aspects of electric vehicles
Autonomous vehicles; connected vehicle systems;
In-vehicle ITS, safety and vulnerable road user aspects
Mobility as a service systems
Traffic management and control
Public transport systems technologies
Fleet and public transport logistics
Emergency and incident management
Demand management and electronic payment systems
Traffic related air pollution management
Policy and institutional issues
Interoperability, standards and architectures
Funding scenarios
Enforcement
Human machine interaction
Education, training and outreach
Current Special Issue Call for papers:
Intelligent Transportation Systems in Smart Cities for Sustainable Environment - https://digital-library.theiet.org/files/IET_ITS_CFP_ITSSCSE.pdf
Sustainably Intelligent Mobility (SIM) - https://digital-library.theiet.org/files/IET_ITS_CFP_SIM.pdf
Traffic Theory and Modelling in the Era of Artificial Intelligence and Big Data (in collaboration with World Congress for Transport Research, WCTR 2019) - https://digital-library.theiet.org/files/IET_ITS_CFP_WCTR.pdf