Le Liu , Tao Pei , Zidong Fang , Xiaorui Yan , Chenglong Zheng , Xi Wang , Ci Song , Wenfei Luan , Jie Chen
{"title":"Extracting individual trajectories from text by fusing large language models with diverse knowledge","authors":"Le Liu , Tao Pei , Zidong Fang , Xiaorui Yan , Chenglong Zheng , Xi Wang , Ci Song , Wenfei Luan , Jie Chen","doi":"10.1016/j.jag.2025.104654","DOIUrl":null,"url":null,"abstract":"<div><div>Individual trajectories offer insights into human mobility, with data either passively recorded, such as GPS, or actively recorded, such as natural language text. While the former provides detailed movement data, it lacks important context such as personal experiences, which can be obtained from the latter. Extracting trajectories from text can enhance travel experience optimization, historical analysis, and pandemic management. However, existing trajectory extraction methods rely on rule-based frameworks that fail to capture contextual semantics, resulting in limited generalizability and loss of trajectory semantics. While general-purpose large language models (LLMs) demonstrate potential for contextual reasoning capabilities, their deficient domain-specific knowledge pertinent to trajectory patterns hinders efficient and precise trajectory extraction. To address these limitations, we propose T2TrajLLM, a novel framework that fuses LLMs with domain knowledge through three components: (1) a lightweight trajectory model for structured guidance, (2) a text-to-trajectory transformation model enabling multi-step reasoning, and (3) labelled text-trajectory samples for learning domain-adaptive constraint rules. Central to T2TrajLLM is a prompt method that dynamically fuses these components with LLMs while avoids rigid rule dependency. Evaluated across three heterogeneous datasets, T2TrajLLM achieves ∼8 % higher accuracy than existing methods, demonstrating strong transferability across datasets and extensibility to diverse application requirements. Overall, T2TrajLLM effectively extracts trajectories from diverse textual sources, providing robust support for the analysis and understanding of individual mobility.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"141 ","pages":"Article 104654"},"PeriodicalIF":8.6000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843225003012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0
Abstract
Individual trajectories offer insights into human mobility, with data either passively recorded, such as GPS, or actively recorded, such as natural language text. While the former provides detailed movement data, it lacks important context such as personal experiences, which can be obtained from the latter. Extracting trajectories from text can enhance travel experience optimization, historical analysis, and pandemic management. However, existing trajectory extraction methods rely on rule-based frameworks that fail to capture contextual semantics, resulting in limited generalizability and loss of trajectory semantics. While general-purpose large language models (LLMs) demonstrate potential for contextual reasoning capabilities, their deficient domain-specific knowledge pertinent to trajectory patterns hinders efficient and precise trajectory extraction. To address these limitations, we propose T2TrajLLM, a novel framework that fuses LLMs with domain knowledge through three components: (1) a lightweight trajectory model for structured guidance, (2) a text-to-trajectory transformation model enabling multi-step reasoning, and (3) labelled text-trajectory samples for learning domain-adaptive constraint rules. Central to T2TrajLLM is a prompt method that dynamically fuses these components with LLMs while avoids rigid rule dependency. Evaluated across three heterogeneous datasets, T2TrajLLM achieves ∼8 % higher accuracy than existing methods, demonstrating strong transferability across datasets and extensibility to diverse application requirements. Overall, T2TrajLLM effectively extracts trajectories from diverse textual sources, providing robust support for the analysis and understanding of individual mobility.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.