{"title":"Intention-guided imitation learning methods under limited expert demonstration data","authors":"Yilin Liu, Xiangfeng Luo, Shaorong Xie","doi":"10.1016/j.knosys.2025.114455","DOIUrl":null,"url":null,"abstract":"<div><div>Imitation Learning has achieved significant results in various fields, such as robot control, autonomous driving, and unmanned vessel decision-making. This technology aims to mimic human behavior in specific tasks by learning the mapping between states and actions, enabling agents to execute tasks based on demonstrations. However, these methods rely on the acquisition of high-quality demonstration data, facing challenges such as difficulties in collecting expert samples, high costs, and low efficiency in policy learning. Particularly under limited sample conditions, imitation learning can easily get trapped in local optima, leading to lower success rates and accuracy in decision-making. Researchers have used data augmentation and transfer learning to tackle limited data. However, in complex scenarios, these methods are less effective due to a lack of domain-specific knowledge, which affects the interpretability of the model. To address these challenges, we propose an Intention-guided Imitation Learning method under limited expert demonstration data (ITIL), which extracts deep intent features from a small number of samples to enhance the agent’s understanding of the scene and improve the accuracy of the mapping from states to actions during Imitation Learning. Specifically, the core method consists of three modules: (1) Semantic Enhancement Module, which extracts spatiotemporal feature maps from a small number of raw trajectories to enrich the semantic information of expert data; (2) Intention Expression Module, which constructs an intention tree network to establish connections between different levels, effectively expressing and capturing expert intent; (3) Strategy Generation Module, which integrates the outputs of the first two modules as input to form efficient decision-making, creating a closed-loop architecture of cognitive understanding-knowledge expression-decision optimization. Experimental results show that our model outperforms baseline methods in navigation, capture, and formation tasks, with an average success rate improvement of approximately +6 % compared to the baseline method (ValueDICE).</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114455"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125014947","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Imitation Learning has achieved significant results in various fields, such as robot control, autonomous driving, and unmanned vessel decision-making. This technology aims to mimic human behavior in specific tasks by learning the mapping between states and actions, enabling agents to execute tasks based on demonstrations. However, these methods rely on the acquisition of high-quality demonstration data, facing challenges such as difficulties in collecting expert samples, high costs, and low efficiency in policy learning. Particularly under limited sample conditions, imitation learning can easily get trapped in local optima, leading to lower success rates and accuracy in decision-making. Researchers have used data augmentation and transfer learning to tackle limited data. However, in complex scenarios, these methods are less effective due to a lack of domain-specific knowledge, which affects the interpretability of the model. To address these challenges, we propose an Intention-guided Imitation Learning method under limited expert demonstration data (ITIL), which extracts deep intent features from a small number of samples to enhance the agent’s understanding of the scene and improve the accuracy of the mapping from states to actions during Imitation Learning. Specifically, the core method consists of three modules: (1) Semantic Enhancement Module, which extracts spatiotemporal feature maps from a small number of raw trajectories to enrich the semantic information of expert data; (2) Intention Expression Module, which constructs an intention tree network to establish connections between different levels, effectively expressing and capturing expert intent; (3) Strategy Generation Module, which integrates the outputs of the first two modules as input to form efficient decision-making, creating a closed-loop architecture of cognitive understanding-knowledge expression-decision optimization. Experimental results show that our model outperforms baseline methods in navigation, capture, and formation tasks, with an average success rate improvement of approximately +6 % compared to the baseline method (ValueDICE).
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.