An Optimal BDCNN ML Architecture for Car Make Model Prediction

人工智能技术学报(英文) Pub Date : 2023-08-28 DOI:10.37965/jait.2023.0268

Kriti Kashyap, Rohit Miri

{"title":"An Optimal BDCNN ML Architecture for Car Make Model Prediction","authors":"Kriti Kashyap, Rohit Miri","doi":"10.37965/jait.2023.0268","DOIUrl":null,"url":null,"abstract":"We take on the challenge of classifying car photos, from the most general car type to the precise make, model, and year of the vehicle for a given input. Analyzing pre-existing datasets, we find that the CompCars-SV are a great place to begin our classification project. We demonstrate that convolutional neural networks can obtain a classification accuracy of more than 90% on the most difficult task. Due to a skewed mix between training and testing, this impressive result isn't really typical of how people do in the actual world. Using an ML system for car detection, we automatically generate a vehicle-tight bounding box for each picture, which we disseminate to the full dataset together with the existing (but limited) type-level annotation. We have designed and implemented car classification algorithms to analyze this car dataset, two of which take advantage of the hierarchical nature of car annotations. According to our research, a more precise classification of car type at a finer resolution now achieves an accuracy of 99.25%. It serves as a baseline benchmark for future research. Focusing on \"vehicle\" tasks, this work intends to bring attention to the vision community's lack of attention to these tasks compared to other objects. The important reason getting higher accuracy is extraction of binary descriptor (BD) feature using edge detection before training the CNN. This step reduced the size of the car dataset; hence network took less time to get trained. From the result outcomes shown it is clear that the presented network architecture having 31 layers of 2d convolutional layer, batch normalization, maxpool, ReLU, fully connected layer and Softmax classifier layer, has given higher accuracy. Numerous relevant car-related issues and solutions have yet to be carefully examined and researched, according to our findings. Car model categorization, model verification, and attribute prognosis are just a few examples of how the dataset might be put to use.","PeriodicalId":70996,"journal":{"name":"人工智能技术学报(英文)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"人工智能技术学报(英文)","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.37965/jait.2023.0268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

We take on the challenge of classifying car photos, from the most general car type to the precise make, model, and year of the vehicle for a given input. Analyzing pre-existing datasets, we find that the CompCars-SV are a great place to begin our classification project. We demonstrate that convolutional neural networks can obtain a classification accuracy of more than 90% on the most difficult task. Due to a skewed mix between training and testing, this impressive result isn't really typical of how people do in the actual world. Using an ML system for car detection, we automatically generate a vehicle-tight bounding box for each picture, which we disseminate to the full dataset together with the existing (but limited) type-level annotation. We have designed and implemented car classification algorithms to analyze this car dataset, two of which take advantage of the hierarchical nature of car annotations. According to our research, a more precise classification of car type at a finer resolution now achieves an accuracy of 99.25%. It serves as a baseline benchmark for future research. Focusing on "vehicle" tasks, this work intends to bring attention to the vision community's lack of attention to these tasks compared to other objects. The important reason getting higher accuracy is extraction of binary descriptor (BD) feature using edge detection before training the CNN. This step reduced the size of the car dataset; hence network took less time to get trained. From the result outcomes shown it is clear that the presented network architecture having 31 layers of 2d convolutional layer, batch normalization, maxpool, ReLU, fully connected layer and Softmax classifier layer, has given higher accuracy. Numerous relevant car-related issues and solutions have yet to be carefully examined and researched, according to our findings. Car model categorization, model verification, and attribute prognosis are just a few examples of how the dataset might be put to use.

查看原文本刊更多论文

一种用于汽车制造模型预测的最佳BDCNN ML体系结构

我们面临着对汽车照片进行分类的挑战，从最普通的汽车类型到给定输入的汽车的精确品牌、型号和年份。通过分析已有的数据集，我们发现CompCars SV是开始我们分类项目的好地方。我们证明了卷积神经网络在最困难的任务上可以获得90%以上的分类准确率。由于训练和测试之间的不平衡，这个令人印象深刻的结果并不是人们在现实世界中的典型表现。使用用于汽车检测的ML系统，我们自动为每张图片生成一个车辆紧密边界框，并将其与现有（但有限）的类型级别注释一起传播到整个数据集。我们设计并实现了汽车分类算法来分析这个汽车数据集，其中两个算法利用了汽车注释的层次性。根据我们的研究，以更精细的分辨率对车型进行更精确的分类，现在的准确率达到了99.25%。这是未来研究的基准。这项工作专注于“车辆”任务，旨在引起视觉界对这些任务与其他对象相比缺乏关注的关注。获得较高精度的重要原因是在训练CNN之前使用边缘检测来提取二进制描述符（BD）特征。这一步骤减少了汽车数据集的大小；因此，网络训练所花费的时间更少。从结果可以清楚地看出，所提出的具有31层2d卷积层、批量归一化、maxpool、ReLU、全连接层和Softmax分类器层的网络架构具有更高的精度。根据我们的研究结果，许多与汽车相关的问题和解决方案尚待仔细检查和研究。汽车模型分类、模型验证和属性预测只是如何使用数据集的几个例子。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

人工智能技术学报(英文)

CiteScore

8.70

自引率

0.00%

发文量