Wenyang Wang , Yuping Luo , Yuqiang Xu , Danzhu Liu , Jibin Zhou , Peng Shao
{"title":"SPPformer:一个基于变压器的模型,具有稀疏关注机制,用于全面和可解释的船舶价格分析","authors":"Wenyang Wang , Yuping Luo , Yuqiang Xu , Danzhu Liu , Jibin Zhou , Peng Shao","doi":"10.1016/j.tre.2025.104136","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate prediction of vessel prices across multiple ship types is crucial for providing scientific decision-making support to shipping enterprises, investment institutions, and policymakers. However, traditional statistical methods and machine learning models face significant limitations in accuracy, generalization, applicability, interpretability, and data coverage, rendering them inadequate for high-precision forecasting in the complex shipping market. Inspired by the successful application of Transformer-based models like ChatGPT across various domains, this study proposes a novel ship price prediction model based on the Transformer architecture—SPPformer. The model integrates architectural optimization, pre-training, and fine-tuning techniques to enable comprehensive price forecasting for multiple ship types. On the data front, this paper consolidated over one million data variables from 12 maritime-related domains to pre-train the model, forming a Basic Model with foundational time-series processing capabilities. Subsequently, domain-specific maritime data were incorporated through fine-tuning to develop the Sectional Model, enhancing its specialization in the shipping sector. For interpretability, the SHAP method was embedded into the SPPformer prediction framework to visualize the impact of feature variables on target variables during the forecasting process. In terms of efficiency, a sparse attention mechanism was introduced by combining Atrous Self-Attention and Local Self-Attention, replacing the global attention mechanism of traditional Transformer, thereby significantly improving training efficiency and solving overfitting issues. The empirical study focused on predicting newbuilding and secondhand vessel prices for dry bulk, container, tanker, and the overall shipping market. The results demonstrate that the SPPformer model outperforms traditional ones in accuracy and interpretability. At the same time, the introduction of sparse attention reduces training time and memory usage by 22.91 % and 26.12 %, respectively, compared to global attention mechanisms. This research provides essential references for shipping enterprises to enhance economic efficiency and for financial institutions to manage risks. It also offers data-driven support and analytical frameworks for governments to regulate market order and promote the stable development of the shipping industry.</div></div>","PeriodicalId":49418,"journal":{"name":"Transportation Research Part E-Logistics and Transportation Review","volume":"199 ","pages":"Article 104136"},"PeriodicalIF":8.3000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SPPformer: A transformer-based model with a sparse attention mechanism for comprehensive and interpretable ship price analysis\",\"authors\":\"Wenyang Wang , Yuping Luo , Yuqiang Xu , Danzhu Liu , Jibin Zhou , Peng Shao\",\"doi\":\"10.1016/j.tre.2025.104136\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate prediction of vessel prices across multiple ship types is crucial for providing scientific decision-making support to shipping enterprises, investment institutions, and policymakers. However, traditional statistical methods and machine learning models face significant limitations in accuracy, generalization, applicability, interpretability, and data coverage, rendering them inadequate for high-precision forecasting in the complex shipping market. Inspired by the successful application of Transformer-based models like ChatGPT across various domains, this study proposes a novel ship price prediction model based on the Transformer architecture—SPPformer. The model integrates architectural optimization, pre-training, and fine-tuning techniques to enable comprehensive price forecasting for multiple ship types. On the data front, this paper consolidated over one million data variables from 12 maritime-related domains to pre-train the model, forming a Basic Model with foundational time-series processing capabilities. Subsequently, domain-specific maritime data were incorporated through fine-tuning to develop the Sectional Model, enhancing its specialization in the shipping sector. For interpretability, the SHAP method was embedded into the SPPformer prediction framework to visualize the impact of feature variables on target variables during the forecasting process. In terms of efficiency, a sparse attention mechanism was introduced by combining Atrous Self-Attention and Local Self-Attention, replacing the global attention mechanism of traditional Transformer, thereby significantly improving training efficiency and solving overfitting issues. The empirical study focused on predicting newbuilding and secondhand vessel prices for dry bulk, container, tanker, and the overall shipping market. The results demonstrate that the SPPformer model outperforms traditional ones in accuracy and interpretability. At the same time, the introduction of sparse attention reduces training time and memory usage by 22.91 % and 26.12 %, respectively, compared to global attention mechanisms. This research provides essential references for shipping enterprises to enhance economic efficiency and for financial institutions to manage risks. It also offers data-driven support and analytical frameworks for governments to regulate market order and promote the stable development of the shipping industry.</div></div>\",\"PeriodicalId\":49418,\"journal\":{\"name\":\"Transportation Research Part E-Logistics and Transportation Review\",\"volume\":\"199 \",\"pages\":\"Article 104136\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2025-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part E-Logistics and Transportation Review\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1366554525001772\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part E-Logistics and Transportation Review","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1366554525001772","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
SPPformer: A transformer-based model with a sparse attention mechanism for comprehensive and interpretable ship price analysis
Accurate prediction of vessel prices across multiple ship types is crucial for providing scientific decision-making support to shipping enterprises, investment institutions, and policymakers. However, traditional statistical methods and machine learning models face significant limitations in accuracy, generalization, applicability, interpretability, and data coverage, rendering them inadequate for high-precision forecasting in the complex shipping market. Inspired by the successful application of Transformer-based models like ChatGPT across various domains, this study proposes a novel ship price prediction model based on the Transformer architecture—SPPformer. The model integrates architectural optimization, pre-training, and fine-tuning techniques to enable comprehensive price forecasting for multiple ship types. On the data front, this paper consolidated over one million data variables from 12 maritime-related domains to pre-train the model, forming a Basic Model with foundational time-series processing capabilities. Subsequently, domain-specific maritime data were incorporated through fine-tuning to develop the Sectional Model, enhancing its specialization in the shipping sector. For interpretability, the SHAP method was embedded into the SPPformer prediction framework to visualize the impact of feature variables on target variables during the forecasting process. In terms of efficiency, a sparse attention mechanism was introduced by combining Atrous Self-Attention and Local Self-Attention, replacing the global attention mechanism of traditional Transformer, thereby significantly improving training efficiency and solving overfitting issues. The empirical study focused on predicting newbuilding and secondhand vessel prices for dry bulk, container, tanker, and the overall shipping market. The results demonstrate that the SPPformer model outperforms traditional ones in accuracy and interpretability. At the same time, the introduction of sparse attention reduces training time and memory usage by 22.91 % and 26.12 %, respectively, compared to global attention mechanisms. This research provides essential references for shipping enterprises to enhance economic efficiency and for financial institutions to manage risks. It also offers data-driven support and analytical frameworks for governments to regulate market order and promote the stable development of the shipping industry.
期刊介绍:
Transportation Research Part E: Logistics and Transportation Review is a reputable journal that publishes high-quality articles covering a wide range of topics in the field of logistics and transportation research. The journal welcomes submissions on various subjects, including transport economics, transport infrastructure and investment appraisal, evaluation of public policies related to transportation, empirical and analytical studies of logistics management practices and performance, logistics and operations models, and logistics and supply chain management.
Part E aims to provide informative and well-researched articles that contribute to the understanding and advancement of the field. The content of the journal is complementary to other prestigious journals in transportation research, such as Transportation Research Part A: Policy and Practice, Part B: Methodological, Part C: Emerging Technologies, Part D: Transport and Environment, and Part F: Traffic Psychology and Behaviour. Together, these journals form a comprehensive and cohesive reference for current research in transportation science.