{"title":"基于注意机制的验证码识别","authors":"Yu Zheng","doi":"10.1145/3522749.3523077","DOIUrl":null,"url":null,"abstract":"Captcha recognition is a worthful work to study, since it does help Internet security and also promotes the field of pattern recognition. In this work, we concentrate on the attention mechanism to this classification task for ameliorating the function of our baseline network. In our experiment, we arbitrarily combine the two modules in CBAM and the coordinate attention module that is considered to be efficient and novel. Then we add this combined attention to our baseline network. From the test results, we see that the better attention for this task is CBAM (spatial first, then channel attention), which improves the recognition accuracy to about 0.8% based on the mini-dataset generated by ourselves.","PeriodicalId":361473,"journal":{"name":"Proceedings of the 6th International Conference on Control Engineering and Artificial Intelligence","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Captcha Recognition Based on Attention Mechanism\",\"authors\":\"Yu Zheng\",\"doi\":\"10.1145/3522749.3523077\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Captcha recognition is a worthful work to study, since it does help Internet security and also promotes the field of pattern recognition. In this work, we concentrate on the attention mechanism to this classification task for ameliorating the function of our baseline network. In our experiment, we arbitrarily combine the two modules in CBAM and the coordinate attention module that is considered to be efficient and novel. Then we add this combined attention to our baseline network. From the test results, we see that the better attention for this task is CBAM (spatial first, then channel attention), which improves the recognition accuracy to about 0.8% based on the mini-dataset generated by ourselves.\",\"PeriodicalId\":361473,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Control Engineering and Artificial Intelligence\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Control Engineering and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3522749.3523077\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Control Engineering and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3522749.3523077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
验证码识别是一项值得研究的工作,因为它有助于互联网安全,也促进了模式识别领域的发展。在这项工作中,我们专注于对该分类任务的注意机制,以改善我们的基线网络的功能。在我们的实验中,我们将CBAM中的两个模块和被认为是高效和新颖的坐标注意模块任意地结合在一起。然后我们将这种综合注意力添加到我们的基线网络中。从测试结果中可以看出,在我们自己生成的小数据集上,CBAM (spatial first, then channel attention)的识别准确率提高到了0.8%左右。
Captcha recognition is a worthful work to study, since it does help Internet security and also promotes the field of pattern recognition. In this work, we concentrate on the attention mechanism to this classification task for ameliorating the function of our baseline network. In our experiment, we arbitrarily combine the two modules in CBAM and the coordinate attention module that is considered to be efficient and novel. Then we add this combined attention to our baseline network. From the test results, we see that the better attention for this task is CBAM (spatial first, then channel attention), which improves the recognition accuracy to about 0.8% based on the mini-dataset generated by ourselves.