{"title":"Optimizing prediction models by considering different time granularity of features and target: Problem and solution","authors":"Ran Yan , Shuo Jiang , Kai Wang , Shuaian Wang","doi":"10.1016/j.trc.2025.105002","DOIUrl":null,"url":null,"abstract":"<div><div>In many prediction tasks, a common characteristic of training datasets is that features are more frequently updated than target, or in other words, the time granularity, or granularity for short, of features is smaller than that of target. One typical example is predicting ship fuel consumption in maritime transport. <em>Current practice</em> usually ignores such characteristic when developing the prediction models, which may jeopardize prediction accuracy and reliability. However, this issue is neither systematically discussed nor addressed in existing literature. To bridge this gap, this study aims to formally discuss the differences in the granularity of features and target as an ubiquitous issue in prediction problems. Then, an innovative <em>two-stage tree-based approach</em> that considers such differences by maximizing the usage of more frequently updated features is developed. We then go a step further to extend the proposed <em>two-stage tree-based approach</em> to predict accumulative target considering monotonicity and data generation process. Extended numerical experiments using simulated and real datasets in maritime and urban transportation are conducted to verify the superiority of the <em>two-stage tree-based approach</em> and its extension.</div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"172 ","pages":"Article 105002"},"PeriodicalIF":7.6000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X25000063","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
In many prediction tasks, a common characteristic of training datasets is that features are more frequently updated than target, or in other words, the time granularity, or granularity for short, of features is smaller than that of target. One typical example is predicting ship fuel consumption in maritime transport. Current practice usually ignores such characteristic when developing the prediction models, which may jeopardize prediction accuracy and reliability. However, this issue is neither systematically discussed nor addressed in existing literature. To bridge this gap, this study aims to formally discuss the differences in the granularity of features and target as an ubiquitous issue in prediction problems. Then, an innovative two-stage tree-based approach that considers such differences by maximizing the usage of more frequently updated features is developed. We then go a step further to extend the proposed two-stage tree-based approach to predict accumulative target considering monotonicity and data generation process. Extended numerical experiments using simulated and real datasets in maritime and urban transportation are conducted to verify the superiority of the two-stage tree-based approach and its extension.
期刊介绍:
Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.