船舶燃油效率建模的数据融合和机器学习:第三部分-传感器数据和气象数据

IF 14.5 Q1 TRANSPORTATION

Communications in Transportation Research Pub Date : 2022-12-01 DOI:10.1016/j.commtr.2022.100072

Yuquan Du , Yanyu Chen , Xiaohe Li , Alessandro Schönborn , Zhuo Sun

{"title":"船舶燃油效率建模的数据融合和机器学习:第三部分-传感器数据和气象数据","authors":"Yuquan Du , Yanyu Chen , Xiaohe Li , Alessandro Schönborn , Zhuo Sun","doi":"10.1016/j.commtr.2022.100072","DOIUrl":null,"url":null,"abstract":"<div>Sensors installed on a ship return high quality data that can be used for ship bunker fuel efficiency analysis. However, important information about weather and sea conditions the ship sails through, such as waves, sea currents, and sea water temperature, is often absent from sensor data. This study addresses this issue by fusing sensor data and publicly accessible meteorological data, constructing nine datasets accordingly, and experimenting with widely adopted machine learning (ML) models to quantify the relationship between a ship's fuel consumption rate (ton/day, or ton/h) and its voyage-based factors (sailing speed, draft, trim, weather conditions, and sea conditions). The best dataset found reveals the benefits of fusing sensor data and meteorological data for ship fuel consumption rate quantification. The best ML models found are consistent with our previous studies, including Extremely randomized trees (ET), Gradient Tree Boosting (GB) and XGBoost (XG). Given the best dataset from data fusion, their R2 values over the training set are 0.999 or 1.000, and their R2 values over the test set are all above 0.966. Their fit errors with RMSE values are below 0.75 ton/day, and with MAT below 0.52 ton/day. These promising results are well beyond the requirements of most industry applications for ship fuel efficiency analysis. The applicability of the selected datasets and ML models is also verified in a rolling horizon approach, resulting in a conjecture that a rolling horizon strategy of “5-month training + 1-month test/applicatoin” could work well in practice and sensor data of less than five months could be insufficient to train ML models.</div>","PeriodicalId":100292,"journal":{"name":"Communications in Transportation Research","volume":"2 ","pages":"Article 100072"},"PeriodicalIF":14.5000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772424722000221/pdfft?md5=c316fbf94739020173b1e22edeeb9aee&pid=1-s2.0-S2772424722000221-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Data fusion and machine learning for ship fuel efficiency modeling: Part III – Sensor data and meteorological data\",\"authors\":\"Yuquan Du , Yanyu Chen , Xiaohe Li , Alessandro Schönborn , Zhuo Sun\",\"doi\":\"10.1016/j.commtr.2022.100072\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>Sensors installed on a ship return high quality data that can be used for ship bunker fuel efficiency analysis. However, important information about weather and sea conditions the ship sails through, such as waves, sea currents, and sea water temperature, is often absent from sensor data. This study addresses this issue by fusing sensor data and publicly accessible meteorological data, constructing nine datasets accordingly, and experimenting with widely adopted machine learning (ML) models to quantify the relationship between a ship's fuel consumption rate (ton/day, or ton/h) and its voyage-based factors (sailing speed, draft, trim, weather conditions, and sea conditions). The best dataset found reveals the benefits of fusing sensor data and meteorological data for ship fuel consumption rate quantification. The best ML models found are consistent with our previous studies, including Extremely randomized trees (ET), Gradient Tree Boosting (GB) and XGBoost (XG). Given the best dataset from data fusion, their R2 values over the training set are 0.999 or 1.000, and their R2 values over the test set are all above 0.966. Their fit errors with RMSE values are below 0.75 ton/day, and with MAT below 0.52 ton/day. These promising results are well beyond the requirements of most industry applications for ship fuel efficiency analysis. The applicability of the selected datasets and ML models is also verified in a rolling horizon approach, resulting in a conjecture that a rolling horizon strategy of “5-month training + 1-month test/applicatoin” could work well in practice and sensor data of less than five months could be insufficient to train ML models.</div>\",\"PeriodicalId\":100292,\"journal\":{\"name\":\"Communications in Transportation Research\",\"volume\":\"2 \",\"pages\":\"Article 100072\"},\"PeriodicalIF\":14.5000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2772424722000221/pdfft?md5=c316fbf94739020173b1e22edeeb9aee&pid=1-s2.0-S2772424722000221-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications in Transportation Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772424722000221\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Transportation Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772424722000221","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION","Score":null,"Total":0}

引用次数: 0

摘要

安装在船上的传感器返回高质量的数据，可用于船舶燃油效率分析。然而，关于天气和海况的重要信息，如波浪、海流和海水温度，通常不在传感器数据中。本研究通过融合传感器数据和可公开访问的气象数据来解决这个问题，相应地构建了9个数据集，并使用广泛采用的机器学习(ML)模型进行实验，以量化船舶的燃油消耗率(吨/天或吨/小时)与其基于航行的因素(航行速度、吃水、纵倾、天气条件和海况)之间的关系。发现的最佳数据集揭示了融合传感器数据和气象数据对船舶燃油消耗率量化的好处。发现的最佳ML模型与我们之前的研究一致，包括极端随机树(ET)，梯度树增强(GB)和XGBoost (XG)。对于数据融合得到的最佳数据集，它们在训练集上的R2值分别为0.999或1.000，在测试集上的R2值都在0.966以上。它们与RMSE值的拟合误差小于0.75吨/天，与MAT的拟合误差小于0.52吨/天。这些有希望的结果远远超出了大多数工业应用对船舶燃油效率分析的要求。所选数据集和ML模型的适用性也在滚动水平方法中得到验证，由此推测“5个月训练+ 1个月测试/应用”的滚动水平策略在实践中可以很好地工作，少于5个月的传感器数据可能不足以训练ML模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Data fusion and machine learning for ship fuel efficiency modeling: Part III – Sensor data and meteorological data

Sensors installed on a ship return high quality data that can be used for ship bunker fuel efficiency analysis. However, important information about weather and sea conditions the ship sails through, such as waves, sea currents, and sea water temperature, is often absent from sensor data. This study addresses this issue by fusing sensor data and publicly accessible meteorological data, constructing nine datasets accordingly, and experimenting with widely adopted machine learning (ML) models to quantify the relationship between a ship's fuel consumption rate (ton/day, or ton/h) and its voyage-based factors (sailing speed, draft, trim, weather conditions, and sea conditions). The best dataset found reveals the benefits of fusing sensor data and meteorological data for ship fuel consumption rate quantification. The best ML models found are consistent with our previous studies, including Extremely randomized trees (ET), Gradient Tree Boosting (GB) and XGBoost (XG). Given the best dataset from data fusion, their R² values over the training set are 0.999 or 1.000, and their R² values over the test set are all above 0.966. Their fit errors with RMSE values are below 0.75 ton/day, and with MAT below 0.52 ton/day. These promising results are well beyond the requirements of most industry applications for ship fuel efficiency analysis. The applicability of the selected datasets and ML models is also verified in a rolling horizon approach, resulting in a conjecture that a rolling horizon strategy of “5-month training + 1-month test/applicatoin” could work well in practice and sensor data of less than five months could be insufficient to train ML models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Communications in Transportation Research

CiteScore

15.20

自引率

0.00%

发文量