船舶燃油效率建模的数据融合和机器学习:第三部分-传感器数据和气象数据

IF 12.5 Q1 TRANSPORTATION
Yuquan Du , Yanyu Chen , Xiaohe Li , Alessandro Schönborn , Zhuo Sun
{"title":"船舶燃油效率建模的数据融合和机器学习:第三部分-传感器数据和气象数据","authors":"Yuquan Du ,&nbsp;Yanyu Chen ,&nbsp;Xiaohe Li ,&nbsp;Alessandro Schönborn ,&nbsp;Zhuo Sun","doi":"10.1016/j.commtr.2022.100072","DOIUrl":null,"url":null,"abstract":"<div><p>Sensors installed on a ship return high quality data that can be used for ship bunker fuel efficiency analysis. However, important information about weather and sea conditions the ship sails through, such as waves, sea currents, and sea water temperature, is often absent from sensor data. This study addresses this issue by fusing sensor data and publicly accessible meteorological data, constructing nine datasets accordingly, and experimenting with widely adopted machine learning (ML) models to quantify the relationship between a ship's fuel consumption rate (ton/day, or ton/h) and its voyage-based factors (sailing speed, draft, trim, weather conditions, and sea conditions). The best dataset found reveals the benefits of fusing sensor data and meteorological data for ship fuel consumption rate quantification. The best ML models found are consistent with our previous studies, including Extremely randomized trees (ET), Gradient Tree Boosting (GB) and XGBoost (XG). Given the best dataset from data fusion, their <em>R</em><sup>2</sup> values over the training set are 0.999 or 1.000, and their <em>R</em><sup>2</sup> values over the test set are all above 0.966. Their fit errors with RMSE values are below 0.75 ton/day, and with MAT below 0.52 ton/day. These promising results are well beyond the requirements of most industry applications for ship fuel efficiency analysis. The applicability of the selected datasets and ML models is also verified in a rolling horizon approach, resulting in a conjecture that a rolling horizon strategy of “5-month training + 1-month test/applicatoin” could work well in practice and sensor data of less than five months could be insufficient to train ML models.</p></div>","PeriodicalId":100292,"journal":{"name":"Communications in Transportation Research","volume":"2 ","pages":"Article 100072"},"PeriodicalIF":12.5000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772424722000221/pdfft?md5=c316fbf94739020173b1e22edeeb9aee&pid=1-s2.0-S2772424722000221-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Data fusion and machine learning for ship fuel efficiency modeling: Part III – Sensor data and meteorological data\",\"authors\":\"Yuquan Du ,&nbsp;Yanyu Chen ,&nbsp;Xiaohe Li ,&nbsp;Alessandro Schönborn ,&nbsp;Zhuo Sun\",\"doi\":\"10.1016/j.commtr.2022.100072\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Sensors installed on a ship return high quality data that can be used for ship bunker fuel efficiency analysis. However, important information about weather and sea conditions the ship sails through, such as waves, sea currents, and sea water temperature, is often absent from sensor data. This study addresses this issue by fusing sensor data and publicly accessible meteorological data, constructing nine datasets accordingly, and experimenting with widely adopted machine learning (ML) models to quantify the relationship between a ship's fuel consumption rate (ton/day, or ton/h) and its voyage-based factors (sailing speed, draft, trim, weather conditions, and sea conditions). The best dataset found reveals the benefits of fusing sensor data and meteorological data for ship fuel consumption rate quantification. The best ML models found are consistent with our previous studies, including Extremely randomized trees (ET), Gradient Tree Boosting (GB) and XGBoost (XG). Given the best dataset from data fusion, their <em>R</em><sup>2</sup> values over the training set are 0.999 or 1.000, and their <em>R</em><sup>2</sup> values over the test set are all above 0.966. Their fit errors with RMSE values are below 0.75 ton/day, and with MAT below 0.52 ton/day. These promising results are well beyond the requirements of most industry applications for ship fuel efficiency analysis. The applicability of the selected datasets and ML models is also verified in a rolling horizon approach, resulting in a conjecture that a rolling horizon strategy of “5-month training + 1-month test/applicatoin” could work well in practice and sensor data of less than five months could be insufficient to train ML models.</p></div>\",\"PeriodicalId\":100292,\"journal\":{\"name\":\"Communications in Transportation Research\",\"volume\":\"2 \",\"pages\":\"Article 100072\"},\"PeriodicalIF\":12.5000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2772424722000221/pdfft?md5=c316fbf94739020173b1e22edeeb9aee&pid=1-s2.0-S2772424722000221-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications in Transportation Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772424722000221\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Transportation Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772424722000221","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0

摘要

安装在船上的传感器返回高质量的数据,可用于船舶燃油效率分析。然而,关于天气和海况的重要信息,如波浪、海流和海水温度,通常不在传感器数据中。本研究通过融合传感器数据和可公开访问的气象数据来解决这个问题,相应地构建了9个数据集,并使用广泛采用的机器学习(ML)模型进行实验,以量化船舶的燃油消耗率(吨/天或吨/小时)与其基于航行的因素(航行速度、吃水、纵倾、天气条件和海况)之间的关系。发现的最佳数据集揭示了融合传感器数据和气象数据对船舶燃油消耗率量化的好处。发现的最佳ML模型与我们之前的研究一致,包括极端随机树(ET),梯度树增强(GB)和XGBoost (XG)。对于数据融合得到的最佳数据集,它们在训练集上的R2值分别为0.999或1.000,在测试集上的R2值都在0.966以上。它们与RMSE值的拟合误差小于0.75吨/天,与MAT的拟合误差小于0.52吨/天。这些有希望的结果远远超出了大多数工业应用对船舶燃油效率分析的要求。所选数据集和ML模型的适用性也在滚动水平方法中得到验证,由此推测“5个月训练+ 1个月测试/应用”的滚动水平策略在实践中可以很好地工作,少于5个月的传感器数据可能不足以训练ML模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Data fusion and machine learning for ship fuel efficiency modeling: Part III – Sensor data and meteorological data

Sensors installed on a ship return high quality data that can be used for ship bunker fuel efficiency analysis. However, important information about weather and sea conditions the ship sails through, such as waves, sea currents, and sea water temperature, is often absent from sensor data. This study addresses this issue by fusing sensor data and publicly accessible meteorological data, constructing nine datasets accordingly, and experimenting with widely adopted machine learning (ML) models to quantify the relationship between a ship's fuel consumption rate (ton/day, or ton/h) and its voyage-based factors (sailing speed, draft, trim, weather conditions, and sea conditions). The best dataset found reveals the benefits of fusing sensor data and meteorological data for ship fuel consumption rate quantification. The best ML models found are consistent with our previous studies, including Extremely randomized trees (ET), Gradient Tree Boosting (GB) and XGBoost (XG). Given the best dataset from data fusion, their R2 values over the training set are 0.999 or 1.000, and their R2 values over the test set are all above 0.966. Their fit errors with RMSE values are below 0.75 ton/day, and with MAT below 0.52 ton/day. These promising results are well beyond the requirements of most industry applications for ship fuel efficiency analysis. The applicability of the selected datasets and ML models is also verified in a rolling horizon approach, resulting in a conjecture that a rolling horizon strategy of “5-month training + 1-month test/applicatoin” could work well in practice and sensor data of less than five months could be insufficient to train ML models.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
15.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信