{"title":"Intelligent Mineral Identification and Classification based on Vision Transformer","authors":"Xiaobo Cui, Cheng Peng, Hao Yang","doi":"10.1109/DSA56465.2022.00095","DOIUrl":null,"url":null,"abstract":"Image recognition technology in deep learning plays a vital role in many research fields. The intelligent recognition of mineral images based on deep learning technology brings new ideas for the development of traditional fields. It improves the efficiency of identification and brings some economic benefits. This paper proposes an intelligent mineral recognition classification model based on Vision Transformer. Firstly, More than 2000 images of twelve minerals, such as biotite, bornite, phoenix, and quartz, were collected. The data set was expanded by the data enhancement method, which was used to train and test the model, and the model's generalization ability was enhanced. Secondly, a self-attentive mechanism is introduced for feature extraction, and a new activation function is used to optimize the convergence speed of the model further. In the end, the accuracy of this model on the test set Top-1 reached 96.08 %, and the F1 score was 95.40 %. Compared with the network models such as ResNet50, VGG16, and DenseNet, the proposed model's recognition accuracy is higher, and the recognition stability is also better. According to the analysis of the experimental results, the pre-processing of the data also has a particular influence on the accuracy of the model, which provides an essential reference for the subsequent intelligent recognition and classification of minerals.","PeriodicalId":208148,"journal":{"name":"2022 9th International Conference on Dependable Systems and Their Applications (DSA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th International Conference on Dependable Systems and Their Applications (DSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSA56465.2022.00095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Image recognition technology in deep learning plays a vital role in many research fields. The intelligent recognition of mineral images based on deep learning technology brings new ideas for the development of traditional fields. It improves the efficiency of identification and brings some economic benefits. This paper proposes an intelligent mineral recognition classification model based on Vision Transformer. Firstly, More than 2000 images of twelve minerals, such as biotite, bornite, phoenix, and quartz, were collected. The data set was expanded by the data enhancement method, which was used to train and test the model, and the model's generalization ability was enhanced. Secondly, a self-attentive mechanism is introduced for feature extraction, and a new activation function is used to optimize the convergence speed of the model further. In the end, the accuracy of this model on the test set Top-1 reached 96.08 %, and the F1 score was 95.40 %. Compared with the network models such as ResNet50, VGG16, and DenseNet, the proposed model's recognition accuracy is higher, and the recognition stability is also better. According to the analysis of the experimental results, the pre-processing of the data also has a particular influence on the accuracy of the model, which provides an essential reference for the subsequent intelligent recognition and classification of minerals.