Solving Avatar Captchas Automatically

Mohammed Korayem, Abdallah A. Mohamed, David Crandall and Roman V. Yampolskiy

Captchas are challenge-response tests used in many online systems to prevent attacks by automated bots. Avatar Captchas are a recently-proposed variant in which users are asked to classify between human faces and computer generated avatar faces, and have been shown to be secure if bots employ random guessing. We test a variety of modern object recognition and machine learning approaches on the problem of avatar versus human face classification. Our results show that using these techniques, a bot can successfully solve Avatar Captchas as often as humans can. These experiments suggest that this high performance is caused more by biases in the facial datasets used by Avatar Captchas and not by a fundamental flaw in the concept itself, but nevertheless our results highlight the difficulty in creating Captcha tasks that are immune to automatic solution.

ROC curves for the human versus avatar classification task. Top left: Naive Bayes classifiers, Top right: feature selection and Naive Bayes, Bottom row: LibLinear classifiers.

Papers and presentations

BibTeX entries:

@inproceedings{facecaptcha2012amlta,
    author = {Mohammed Korayem and Abdallah A. Mohamed and David Crandall and Roman V. Yampolskiy},
    title = {Solving {A}vatar {C}aptchas automatically},
    booktitle = {International Conference on Advanced Machine Learning Technologies and Applications},
    year = {2012}
}

@inproceedings{avatarcaptcha2012icmla,
    author = {Mohammed Korayem and Abdallah A. Mohamed and David Crandall and Roman V. Yampolskiy},
    title = {Learning visual features for the {A}vatar {C}aptcha {R}ecognition {C}hallenge},
    booktitle = {International Conference on Machine Learning Applications},
    year = {2012}
}

The IU Computer Vision Lab's projects and activities have been funded, in part, by grants and contracts from the Air Force Office of Scientific Research (AFOSR), the Defense Threat Reduction Agency (DTRA), Dzyne Technologies, EgoVid, Inc., ETRI, Facebook, Google, Grant Thornton LLP, IARPA, the Indiana Innovation Institute (IN3), the IU Data to Insight Center, the IU Office of the Vice Provost for Research through an Emerging Areas of Research grant, the IU Social Sciences Research Commons, the Lilly Endowment, NASA, National Science Foundation (IIS-1253549, CNS-1834899, CNS-1408730, BCS-1842817, CNS-1744748, IIS-1257141, IIS-1852294), NVidia, ObjectVideo, Office of Naval Research (ONR), Pixm, Inc., and the U.S. Navy. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the U.S. Government, or any sponsor.