Yet Another Text Captcha Solver: A Generative Adversarial Network Based Approach

Ye, G, Tang, Z, Fang, D et al. (5 more authors) (2018) Yet Another Text Captcha Solver: A Generative Adversarial Network Based Approach. In: Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security. CCS 2018: 25th ACM SIGSAC Conference on Computer and Communications Security, 15-19 Oct 2018, Toronto, Canada. ACM , pp. 332-348. ISBN 978-1-4503-5693-0

Abstract

Despite several attacks have been proposed, text-based CAPTCHAs are still being widely used as a security mechanism. One of the reasons for the pervasive use of text captchas is that many of the prior attacks are scheme-specific and require a labor-intensive and time-consuming process to construct. This means that a change in the captcha security features like a noisier background can simply invalid an earlier attack. This paper presents a generic, yet effective text captcha solver based on the generative adversarial network. Unlike prior machine-learning-based approaches that need a large volume of manually-labeled real captchas to learn an effective solver, our approach requires significantly fewer real captchas but yields much better performance. This is achieved by first learning a captcha synthesizer to automatically generate synthetic captchas to learn a base solver, and then fine-tuning the base solver on a small set of real captchas using transfer learning. We evaluate our approach by applying it to 33 captcha schemes, including 11 schemes that are currently being used by 32 of the top-50 popular websites including Microsoft, Wikipedia, eBay and Google. Our approach is the most capable attack on text captchas seen to date. It outperforms four state-of-the-art text-captcha solvers by not only delivering a significant higher accuracy on all testing schemes, but also successfully attacking schemes where others have zero chance. We show that our approach is highly efficient as it can solve a captcha within 0.05 second using a desktop GPU. We demonstrate that our attack is generally applicable because it can bypass the advanced security features employed by most modern text captcha schemes. We hope the results of our work can encourage the community to revisit the design and practical use of text captchas.

Metadata

Item Type:	Proceedings Paper
Authors/Creators:	Ye, G Tang, Z Fang, D Zhu, Z Feng, Y Xu, P Chen, X Wang, Z https://orcid.org/0000-0001-6157-0662
Copyright, Publisher and Additional Information:	© 2018 Association for Computing Machinery. This is an author produced version of a paper published in Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security. Uploaded in accordance with the publisher's self-archiving policy.
Dates:	Published: 8 October 2018 Published (online): 8 October 2018 Accepted: 24 July 2018
Institution:	The University of Leeds
Academic Units:	The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds)
Depositing User:	Symplectic Publications
Date Deposited:	01 Oct 2019 14:36
Last Modified:	11 Feb 2020 16:51
Status:	Published
Publisher:	ACM
Identification Number:	10.1145/3243734.3243754
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:151526

CORE (COnnecting REpositories)

Yet Another Text Captcha Solver: A Generative Adversarial Network Based Approach

Abstract

Metadata

Download

Accepted Version

Export

Statistics