拟声词验证码的评价与发展

2018 16th Annual Conference on Privacy, Security and Trust (PST) Pub Date : 2018-08-01 DOI:10.1109/PST.2018.8514155

Michihiro Yamada, Riko Shigeno, Hiroaki Kikuchi, Maki Sakamoto

{"title":"拟声词验证码的评价与发展","authors":"Michihiro Yamada, Riko Shigeno, Hiroaki Kikuchi, Maki Sakamoto","doi":"10.1109/PST.2018.8514155","DOIUrl":null,"url":null,"abstract":"The Completely Automated Public Turing Test to Tell Computers and Humans Apart (CAPTCHA) is a type of challenge–response test used for avoiding malicious software during automated registration. It plays an important role in security because fraud with computer agents is serious. The requirements of CAPTCHAs are: 1) a human can easily solve a CAPTCHA, 2) a computer cannot solve a CAPTCHA, and 3) CAPTCHAs can be generated automatically. As deep learning technology has been developed, many of the existing CAPTCHAs were compromised and fail to satisfy condition (2). In this study, we propose a new “onomatopoeia CAPTCHA” that applies onomatopoeia; i.e., words containing sounds similar to the noises they describe. Humans usually understand onomatopoeia unconsciously and use it in daily conversation; thus, it is clearly easy for humans to solve. However, it is difficult for computers because the mechanisms to recognize onomatopoeia are not very clear even now [1]. One of the difficulties of CAPTCHA schemes is the lack of reliable accuracy metrics. Some of the existing works deal with successful rate defined as a fraction of correctly answered tests. However, if we modify schemes as more complicated so that condition 2) is satisfied, then it may be hard to be solved by human resulting failure of condition 1) . So, we need to balance the tradeoff of two conditions. To address the above issues of CAPTCHA scheme, we introduce two evaluation metrics, Human Acceptance Rate (HAR) and Machine Acceptance Rate (MAR), measured through comprehensive experiments. To balance both acceptance rates, we try to improve HAR with the five proposed schemes looking for the best scheme that allows humans solve CAPTCHA easily. Similarly, we attempt to reduce MAR as smaller as possible, that is, to make CAPTCHA unbreakable against attackers. Our experiment is evaluated by 63 Japanese and 63 foreigners participating from 16 countries. One of the proposed style of CAPTCHA is based on the Manga comics with onomatopoeia that may be recognized wide range of subject without suffering form language barrier and hence it helps to extend the coverage of users. Our contribution of this work is as follows. • A new CAPTCHA scheme using Onomatopoeia that is able to be synthesized from system. • A comprehensive evaluation of accuracy metrics with respects to both human and machine (HAR and MAR), with five styles of queries and five smart attackers. • An evaluation made be by a broad domain of subjects with distinct background knowledge in the world including 63 Japanese and 63 non-Japanese.","PeriodicalId":265506,"journal":{"name":"2018 16th Annual Conference on Privacy, Security and Trust (PST)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Evaluation and Development of Onomatopoeia CAPTCHAs\",\"authors\":\"Michihiro Yamada, Riko Shigeno, Hiroaki Kikuchi, Maki Sakamoto\",\"doi\":\"10.1109/PST.2018.8514155\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Completely Automated Public Turing Test to Tell Computers and Humans Apart (CAPTCHA) is a type of challenge–response test used for avoiding malicious software during automated registration. It plays an important role in security because fraud with computer agents is serious. The requirements of CAPTCHAs are: 1) a human can easily solve a CAPTCHA, 2) a computer cannot solve a CAPTCHA, and 3) CAPTCHAs can be generated automatically. As deep learning technology has been developed, many of the existing CAPTCHAs were compromised and fail to satisfy condition (2). In this study, we propose a new “onomatopoeia CAPTCHA” that applies onomatopoeia; i.e., words containing sounds similar to the noises they describe. Humans usually understand onomatopoeia unconsciously and use it in daily conversation; thus, it is clearly easy for humans to solve. However, it is difficult for computers because the mechanisms to recognize onomatopoeia are not very clear even now [1]. One of the difficulties of CAPTCHA schemes is the lack of reliable accuracy metrics. Some of the existing works deal with successful rate defined as a fraction of correctly answered tests. However, if we modify schemes as more complicated so that condition 2) is satisfied, then it may be hard to be solved by human resulting failure of condition 1) . So, we need to balance the tradeoff of two conditions. To address the above issues of CAPTCHA scheme, we introduce two evaluation metrics, Human Acceptance Rate (HAR) and Machine Acceptance Rate (MAR), measured through comprehensive experiments. To balance both acceptance rates, we try to improve HAR with the five proposed schemes looking for the best scheme that allows humans solve CAPTCHA easily. Similarly, we attempt to reduce MAR as smaller as possible, that is, to make CAPTCHA unbreakable against attackers. Our experiment is evaluated by 63 Japanese and 63 foreigners participating from 16 countries. One of the proposed style of CAPTCHA is based on the Manga comics with onomatopoeia that may be recognized wide range of subject without suffering form language barrier and hence it helps to extend the coverage of users. Our contribution of this work is as follows. • A new CAPTCHA scheme using Onomatopoeia that is able to be synthesized from system. • A comprehensive evaluation of accuracy metrics with respects to both human and machine (HAR and MAR), with five styles of queries and five smart attackers. • An evaluation made be by a broad domain of subjects with distinct background knowledge in the world including 63 Japanese and 63 non-Japanese.\",\"PeriodicalId\":265506,\"journal\":{\"name\":\"2018 16th Annual Conference on Privacy, Security and Trust (PST)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 16th Annual Conference on Privacy, Security and Trust (PST)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PST.2018.8514155\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 16th Annual Conference on Privacy, Security and Trust (PST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PST.2018.8514155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

完全自动化公共图灵测试来区分计算机和人类(CAPTCHA)是一种挑战响应测试，用于在自动注册期间避免恶意软件。它在安全方面起着重要的作用，因为计算机代理的欺诈是严重的。验证码的要求是:1)人类可以很容易地解决一个验证码，2)计算机不能解决一个验证码，3)验证码可以自动生成。随着深度学习技术的发展，许多现有的CAPTCHA被破坏，无法满足条件(2)。在本研究中，我们提出了一种新的“拟声CAPTCHA”，该CAPTCHA应用拟声词;例如，包含与它们所描述的噪音相似的声音的单词。人类通常在不知不觉中理解拟声词，并在日常会话中使用;因此，人类显然很容易解决这个问题。然而，对于计算机来说，识别拟声词是很困难的，因为即使到现在，识别拟声词的机制也不是很清楚[1]。CAPTCHA方案的难点之一是缺乏可靠的准确性度量。现有的一些工作处理的成功率定义为正确回答的测试的一部分。然而，如果我们将方案修改得更复杂，以满足条件2)，那么可能很难通过人为导致条件1)的失败来解决。所以，我们需要平衡两种情况的权衡。为了解决CAPTCHA方案的上述问题，我们引入了两个评估指标，人类接受率(HAR)和机器接受率(MAR)，通过综合实验测量。为了平衡两种接受率，我们尝试用五个提议的方案来改进HAR，寻找允许人类轻松解决CAPTCHA的最佳方案。类似地，我们尝试尽可能地减少MAR，也就是说，使CAPTCHA对攻击者坚不可摧。我们的实验由来自16个国家的63名日本人和63名外国人参与评估。建议的CAPTCHA风格之一是基于带有拟声词的漫画，可以识别广泛的主题，而不会遇到语言障碍，因此有助于扩大用户的覆盖范围。我们对这项工作的贡献如下。•一个新的CAPTCHA方案使用拟声词，能够从系统合成。•对人类和机器(HAR和MAR)的准确性指标进行全面评估，包括五种查询风格和五种智能攻击者。•由具有不同背景知识的广泛学科进行评估，其中包括63名日本人和63名非日本人。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Evaluation and Development of Onomatopoeia CAPTCHAs

The Completely Automated Public Turing Test to Tell Computers and Humans Apart (CAPTCHA) is a type of challenge–response test used for avoiding malicious software during automated registration. It plays an important role in security because fraud with computer agents is serious. The requirements of CAPTCHAs are: 1) a human can easily solve a CAPTCHA, 2) a computer cannot solve a CAPTCHA, and 3) CAPTCHAs can be generated automatically. As deep learning technology has been developed, many of the existing CAPTCHAs were compromised and fail to satisfy condition (2). In this study, we propose a new “onomatopoeia CAPTCHA” that applies onomatopoeia; i.e., words containing sounds similar to the noises they describe. Humans usually understand onomatopoeia unconsciously and use it in daily conversation; thus, it is clearly easy for humans to solve. However, it is difficult for computers because the mechanisms to recognize onomatopoeia are not very clear even now [1]. One of the difficulties of CAPTCHA schemes is the lack of reliable accuracy metrics. Some of the existing works deal with successful rate defined as a fraction of correctly answered tests. However, if we modify schemes as more complicated so that condition 2) is satisfied, then it may be hard to be solved by human resulting failure of condition 1) . So, we need to balance the tradeoff of two conditions. To address the above issues of CAPTCHA scheme, we introduce two evaluation metrics, Human Acceptance Rate (HAR) and Machine Acceptance Rate (MAR), measured through comprehensive experiments. To balance both acceptance rates, we try to improve HAR with the five proposed schemes looking for the best scheme that allows humans solve CAPTCHA easily. Similarly, we attempt to reduce MAR as smaller as possible, that is, to make CAPTCHA unbreakable against attackers. Our experiment is evaluated by 63 Japanese and 63 foreigners participating from 16 countries. One of the proposed style of CAPTCHA is based on the Manga comics with onomatopoeia that may be recognized wide range of subject without suffering form language barrier and hence it helps to extend the coverage of users. Our contribution of this work is as follows. • A new CAPTCHA scheme using Onomatopoeia that is able to be synthesized from system. • A comprehensive evaluation of accuracy metrics with respects to both human and machine (HAR and MAR), with five styles of queries and five smart attackers. • An evaluation made be by a broad domain of subjects with distinct background knowledge in the world including 63 Japanese and 63 non-Japanese.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 16th Annual Conference on Privacy, Security and Trust (PST)

自引率

0.00%

发文量