Text-based CAPTCHA Vulnerability Assessment using a Deep Learning-based Solver

2021 IEEE Fifth Ecuador Technical Chapters Meeting (ETCM) Pub Date : 2021-10-12 DOI:10.1109/ETCM53643.2021.9590750

Daniel Aguilar, Daniel Riofrío, D. Benítez, Noel Pérez, Ricardo Flores Moyano

引用次数: 0

Abstract

The focus of this work is to test the security offered by Text-based CAPTCHAs. We present different types of CAPTCHAs and a preprocessing and segmentation process to clean noise in CAPTCHA images and crop digits or characters in single images. We present a convolutional neural network architecture trained under several hyperparameters, comparing multiple models with different batch sizes, epochs, and optimizers. We confirmed that using Text-based CAPTCHAs is no longer a secure mechanism for protection because, with simple computer vision techniques and current machine learning algorithms, they can be broken. We achieved a 90.49% accuracy with our model trained with a mix of four datasets and up to 97.10% with one dataset, which is enough to consider these schemes insecure in practice.

查看原文本刊更多论文

基于文本的CAPTCHA漏洞评估，基于深度学习的求解器

这项工作的重点是测试基于文本的captcha提供的安全性。我们提出了不同类型的CAPTCHA和预处理和分割过程，以清除CAPTCHA图像中的噪声和裁剪单个图像中的数字或字符。我们提出了一个在几个超参数下训练的卷积神经网络架构，比较了具有不同批大小、时代和优化器的多个模型。我们确认，使用基于文本的captcha不再是一种安全的保护机制，因为使用简单的计算机视觉技术和当前的机器学习算法，它们可以被破解。我们用四个数据集混合训练的模型达到了90.49%的准确率，用一个数据集训练的准确率高达97.10%，这足以认为这些方案在实践中是不安全的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE Fifth Ecuador Technical Chapters Meeting (ETCM)

自引率

0.00%

发文量