A Data Annotation and Recognition Method Based on Zero Statistical Hypothesis Test and Multi Variable Binary Classification Theory

Zhan Ying, Chen Chao, Wang Lei, Shuai Zhao, Xianglei Zhu
{"title":"A Data Annotation and Recognition Method Based on Zero Statistical Hypothesis Test and Multi Variable Binary Classification Theory","authors":"Zhan Ying, Chen Chao, Wang Lei, Shuai Zhao, Xianglei Zhu","doi":"10.1109/RCAR54675.2022.9872218","DOIUrl":null,"url":null,"abstract":"Based on typical Chinese natural driving data, from natural driving scenario data collection to scenario automatic labeling and classification, this paper proposed a specific scenario automatic labeling and classification method by using statistical tools and machine learning methods. The front vehicle cut-in data of more than 4000 typical road scenarios in China are collected and extracted, and the parametric statistics and analysis are carried out for the relevant 6 variables. Considering the statistical uncertainty of the variables, the statistical exclusion curve of “normal front vehicle cut-in scenario” is calculated by using the hypothesis test method based on the principle of mathematical statistics, by comparing the distribution curve of any event with the statistical exclusion curve, the annotation of the front vehicle entry scenario data is realized. At the same time, using the positive and negative sample classification method of machine learning based on bagging decision tree classifier, the integrated learning classification method based on boosting decision tree, and the depth learning method based on improved resnet-18 convolution Network + LSTM recurrent neural network, the multi-variable binary classifiers are trained respectively to realize the classification task of the front vehicle cut-in scenario. Furthermore, comparing the three classification methods, the test results on the verification show that the BDT classifier has the best result, effectively realizes the classification tasks of “dangerous front vehicle cut-in scenario” and “normal front vehicle cut-in scenario”, and this technical tool chain can be reused in the fine-grained classification of other driving scenes in the future","PeriodicalId":304963,"journal":{"name":"2022 IEEE International Conference on Real-time Computing and Robotics (RCAR)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Real-time Computing and Robotics (RCAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RCAR54675.2022.9872218","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Based on typical Chinese natural driving data, from natural driving scenario data collection to scenario automatic labeling and classification, this paper proposed a specific scenario automatic labeling and classification method by using statistical tools and machine learning methods. The front vehicle cut-in data of more than 4000 typical road scenarios in China are collected and extracted, and the parametric statistics and analysis are carried out for the relevant 6 variables. Considering the statistical uncertainty of the variables, the statistical exclusion curve of “normal front vehicle cut-in scenario” is calculated by using the hypothesis test method based on the principle of mathematical statistics, by comparing the distribution curve of any event with the statistical exclusion curve, the annotation of the front vehicle entry scenario data is realized. At the same time, using the positive and negative sample classification method of machine learning based on bagging decision tree classifier, the integrated learning classification method based on boosting decision tree, and the depth learning method based on improved resnet-18 convolution Network + LSTM recurrent neural network, the multi-variable binary classifiers are trained respectively to realize the classification task of the front vehicle cut-in scenario. Furthermore, comparing the three classification methods, the test results on the verification show that the BDT classifier has the best result, effectively realizes the classification tasks of “dangerous front vehicle cut-in scenario” and “normal front vehicle cut-in scenario”, and this technical tool chain can be reused in the fine-grained classification of other driving scenes in the future
基于零统计假设检验和多变量二分类理论的数据标注与识别方法
本文基于中国典型的自然驾驶数据,从自然驾驶场景数据采集到场景自动标注与分类,利用统计工具和机器学习方法,提出了一种具体的场景自动标注与分类方法。收集并提取了中国4000多个典型道路场景的前车切入数据,并对相关的6个变量进行了参数统计分析。考虑到变量的统计不确定性,采用基于数理统计原理的假设检验方法计算了“正常前车插车场景”的统计排除曲线,通过将任意事件的分布曲线与统计排除曲线进行比较,实现了前车插车场景数据的标注。同时,采用基于bagging决策树分类器的机器学习正负样本分类方法、基于boosting决策树的综合学习分类方法、基于改进型resnet-18卷积网络+ LSTM递归神经网络的深度学习方法,分别训练多变量二值分类器,实现前车切入场景的分类任务。此外,对比三种分类方法,验证上的测试结果表明,BDT分类器效果最好,有效实现了“危险前方车辆切入场景”和“正常前方车辆切入场景”的分类任务,该技术工具链可在未来其他驾驶场景的细粒度分类中重用
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信