{"title":"用于预测低速车辆碰撞损坏部件的机器学习","authors":"M. Koch, Hao Wang, Thomas Bäck","doi":"10.1109/ICDIM.2018.8846974","DOIUrl":null,"url":null,"abstract":"Using time series of on-board car data, this research focuses on predicting the damaged parts of a vehicle in a low speed crash by machine learning techniques. Based on a relatively small and class-imbalanced dataset, we present our automatic and for small datasets optimized method to use time series for machine learning. Based on 3982 extracted features, we are using feature selection algorithms to find the most significant ones for each component. We train random forest models per part with its most relevant set of features and optimize the hyper-parameters by different techniques. This so-called part-wise approach provides good insights into the model performance for each part and offers opportunities for optimizing the models. The final F1 prediction scores (reaching up to 94%) show the large potential of predicting damaged parts with on-board data only. Furthermore, for the worse performing parts of this small and imbalanced dataset, it indicates the potential for reaching good prediction scores when adding more training data. The utilization of such method offers great possibilities, e.g., in vehicle insurance processing for automatized settling of low speed crash damages.","PeriodicalId":120884,"journal":{"name":"2018 Thirteenth International Conference on Digital Information Management (ICDIM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Machine Learning for Predicting the Damaged Parts of a Low Speed Vehicle Crash\",\"authors\":\"M. Koch, Hao Wang, Thomas Bäck\",\"doi\":\"10.1109/ICDIM.2018.8846974\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Using time series of on-board car data, this research focuses on predicting the damaged parts of a vehicle in a low speed crash by machine learning techniques. Based on a relatively small and class-imbalanced dataset, we present our automatic and for small datasets optimized method to use time series for machine learning. Based on 3982 extracted features, we are using feature selection algorithms to find the most significant ones for each component. We train random forest models per part with its most relevant set of features and optimize the hyper-parameters by different techniques. This so-called part-wise approach provides good insights into the model performance for each part and offers opportunities for optimizing the models. The final F1 prediction scores (reaching up to 94%) show the large potential of predicting damaged parts with on-board data only. Furthermore, for the worse performing parts of this small and imbalanced dataset, it indicates the potential for reaching good prediction scores when adding more training data. The utilization of such method offers great possibilities, e.g., in vehicle insurance processing for automatized settling of low speed crash damages.\",\"PeriodicalId\":120884,\"journal\":{\"name\":\"2018 Thirteenth International Conference on Digital Information Management (ICDIM)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Thirteenth International Conference on Digital Information Management (ICDIM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDIM.2018.8846974\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Thirteenth International Conference on Digital Information Management (ICDIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2018.8846974","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine Learning for Predicting the Damaged Parts of a Low Speed Vehicle Crash
Using time series of on-board car data, this research focuses on predicting the damaged parts of a vehicle in a low speed crash by machine learning techniques. Based on a relatively small and class-imbalanced dataset, we present our automatic and for small datasets optimized method to use time series for machine learning. Based on 3982 extracted features, we are using feature selection algorithms to find the most significant ones for each component. We train random forest models per part with its most relevant set of features and optimize the hyper-parameters by different techniques. This so-called part-wise approach provides good insights into the model performance for each part and offers opportunities for optimizing the models. The final F1 prediction scores (reaching up to 94%) show the large potential of predicting damaged parts with on-board data only. Furthermore, for the worse performing parts of this small and imbalanced dataset, it indicates the potential for reaching good prediction scores when adding more training data. The utilization of such method offers great possibilities, e.g., in vehicle insurance processing for automatized settling of low speed crash damages.