P. Umamaheswari, S. Ezhilarasi, P. Harish, B. Gowrishankar, S. Sanjiv
{"title":"利用深度学习技术设计基于文本的验证码破解和求解器","authors":"P. Umamaheswari, S. Ezhilarasi, P. Harish, B. Gowrishankar, S. Sanjiv","doi":"10.1109/ICADEE51157.2020.9368949","DOIUrl":null,"url":null,"abstract":"Text-based CAPTCHAs are most commonly used by various websites to distinguish between humans and computers. It is used as a security measure. This work consists of a dynamic approach that is proposed to predict Text-based CAPTCHAs that challenges the supposition that they cannot be solved by computers. Three types of CAPTCHAs namely Rotated, Noisy Arc, Complicated Background have been taken. The CAPTCHAs are pre-processed based on their type. Different pre-processing techniques like Erosion, Dilation, Binarization are used to remove the noise from the CAPTCHA. The pre-processed CAPTCHAs are then fed to the Convolutional neural network (CNN) which generates a feature vector. This feature vector is then passed to the long short term memory (LSTM) which generates a sequence of characters. This sequence is displayed as outcome to the user. The dataset for Rotated, Noisy Arc, Complicated Background CAPTCHAs consisted of 9,955, 1,070 and 1,000 images respectively. The model was also tested for CAPTCHAs involving a combination of different resistance mechanisms. The model was able to predict Rotated CAPTCHAs with an accuracy of 85.97%, Noisy Arc CAPTCHAs with an accuracy of 84.52% and Complicated BackgroundCAPTCHAswithanaccuracyof82.91 %.","PeriodicalId":202026,"journal":{"name":"2020 IEEE International Conference on Advances and Developments in Electrical and Electronics Engineering (ICADEE)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Designing a Text-based CAPTCHA Breaker and Solver by using Deep Learning Techniques\",\"authors\":\"P. Umamaheswari, S. Ezhilarasi, P. Harish, B. Gowrishankar, S. Sanjiv\",\"doi\":\"10.1109/ICADEE51157.2020.9368949\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text-based CAPTCHAs are most commonly used by various websites to distinguish between humans and computers. It is used as a security measure. This work consists of a dynamic approach that is proposed to predict Text-based CAPTCHAs that challenges the supposition that they cannot be solved by computers. Three types of CAPTCHAs namely Rotated, Noisy Arc, Complicated Background have been taken. The CAPTCHAs are pre-processed based on their type. Different pre-processing techniques like Erosion, Dilation, Binarization are used to remove the noise from the CAPTCHA. The pre-processed CAPTCHAs are then fed to the Convolutional neural network (CNN) which generates a feature vector. This feature vector is then passed to the long short term memory (LSTM) which generates a sequence of characters. This sequence is displayed as outcome to the user. The dataset for Rotated, Noisy Arc, Complicated Background CAPTCHAs consisted of 9,955, 1,070 and 1,000 images respectively. The model was also tested for CAPTCHAs involving a combination of different resistance mechanisms. The model was able to predict Rotated CAPTCHAs with an accuracy of 85.97%, Noisy Arc CAPTCHAs with an accuracy of 84.52% and Complicated BackgroundCAPTCHAswithanaccuracyof82.91 %.\",\"PeriodicalId\":202026,\"journal\":{\"name\":\"2020 IEEE International Conference on Advances and Developments in Electrical and Electronics Engineering (ICADEE)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Conference on Advances and Developments in Electrical and Electronics Engineering (ICADEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICADEE51157.2020.9368949\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Advances and Developments in Electrical and Electronics Engineering (ICADEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICADEE51157.2020.9368949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Designing a Text-based CAPTCHA Breaker and Solver by using Deep Learning Techniques
Text-based CAPTCHAs are most commonly used by various websites to distinguish between humans and computers. It is used as a security measure. This work consists of a dynamic approach that is proposed to predict Text-based CAPTCHAs that challenges the supposition that they cannot be solved by computers. Three types of CAPTCHAs namely Rotated, Noisy Arc, Complicated Background have been taken. The CAPTCHAs are pre-processed based on their type. Different pre-processing techniques like Erosion, Dilation, Binarization are used to remove the noise from the CAPTCHA. The pre-processed CAPTCHAs are then fed to the Convolutional neural network (CNN) which generates a feature vector. This feature vector is then passed to the long short term memory (LSTM) which generates a sequence of characters. This sequence is displayed as outcome to the user. The dataset for Rotated, Noisy Arc, Complicated Background CAPTCHAs consisted of 9,955, 1,070 and 1,000 images respectively. The model was also tested for CAPTCHAs involving a combination of different resistance mechanisms. The model was able to predict Rotated CAPTCHAs with an accuracy of 85.97%, Noisy Arc CAPTCHAs with an accuracy of 84.52% and Complicated BackgroundCAPTCHAswithanaccuracyof82.91 %.