Novel CAPTCHA Design based on Cognitive Factors

Proc. of Int. Conf. on Advances in Communication, Network, and Computing, CNC
Novel CAPTCHA Design based on Cognitive Factors
Krishna Chaurasia1, Mohit Singhaniya2, Sowmya Jain3 and B Sivaselvan4
IIITD&M Kancheepuram, Melakottaiyur, Chennai-127.
1
Email: [email protected]
2
Email: [email protected]
3
Email: [email protected]
4
Email: [email protected]
Abstract— Secure web applications such as email, e-commerce sites, etc. employ
CAPTCHAs so as to distinguish a human attempt to gain access to the service as opposed to
an automated robot (bot). Literature supports different types of CAPTCHAs such as
Gimpy, Question, Collage, etc.. Usability Engineering, a key aspect of Human Computer
Interaction (HCI) focuses on creating solutions with usable interfaces. This paper addresses
usability issues in the domain of CAPTCHAs. A survey of existing types of CAPTCHAs is
presented and novel CAPTCHA designs from a usability perspective are proposed.
Proposed designs factor in cognitive issues and also attempts to explore a marketing /
awareness creation exercise, whilst enhancing the difficulty level for the bot by
accommodating cognitive factors in the design and at the same time make it usable for the
end user.
Index Terms— CAPTCHA, Cognitive, Human Computer Interaction, Usability.
I. INTRODUCTION
CAPTCHA which stands for Completely Automated Public Turing Test To Tell Computers and Humans
Apart, evolved from the primary objective of having to differentiate human from automated robot attempts to
login / gain access to an authenticated system [1],[11]. Most successful and popular commercial websites
these days have some form of CAPTCHA in their process of authentication and hence reduce chances of bot
attempts. There are different types of CAPTCHAs such as Gimpy, Question, Aural, etc. that have evolved
over the years. All the CAPTCHAs that have evolved have primarily focused on making it extremely
difficult for the bot to crack the CAPTCHA, without complicating things for the user (human). With this
contradicting optimization goal of maximising difficulty level for the bot, while minimizing the same for the
human, most CAPTCHA's have exploited human beings capability of recognizing data which are not
complete such as distorted / slanted text, blurred images, etc. Events of an online opinion poll on the best
graduate school, where efficient programmers were able to come up with programs that voted (multiple
times) institutes of their choice, only highlight the need to have an element of human touch to the interaction
process of authentication between the human and the computer (possibly the website). HCI solutions
usability are enhanced by characteristics of Learnability, Flexibility and Robustness. Not much work has
gone into analysing / enhancing the usability of CAPTCHAs along these characteristics [2],[3],[13]. Most
image based CAPTCHAs exploit the capability of the human to do quick and effective segmentation among a
© Elsevier, 2014
clutter as opposed to the bot. Usability Engineering is that field in HCI which focuses on creating useful,
usable and used products / solutions. This paper focuses on the design & development of a few novel
CAPTCHAs in line with the optimization goal described earlier. Section 2 details on the various existing
CAPTCHAs such as Gimpy, ReCAPTCHA, Question, etc [4], [5]. The proposed CAPTCHAs that enhance
the usability experience of the user and the difficulty level for the bot, is presented in Section 3. Section 4
discusses the need for a framework for CAPTCHA evaluation and ideas for the same.
Figure.1. GIMPY CAPTCHA
II. CAPTCHA TYPES - LITERATURE
A. GIMPY
These CAPTCHAs exploit the human capability of efficiently reading / interpreting distorted text / text
displayed in different orientations. The user is displayed an image involving 7 words from the dictionary
(displayed in a distorted and duplicated fashion) and is expected to enter any three unique distorted words
displayed in the CAPTCHA image. An example of such a CAPTCHA is given in Figure 1.
B. ReCAPTCHA
These types of CAPTCHAs are based on the principle of effectively using the human effort that goes into
recognizing distorted and hard to read (by the machine) characters [6],[10]. For instance Google Books
effectively uses this as a mechanism to digitize scanned words that are otherwise non readable.
Authentication systems employing ReCAPTCHA display two words, referred as the Control and the
Unknown word. Entry to the application is based on the control word (dictionary based); whilst the other
unknown word is the one that has to be digitized (yesteryear document's words may have got distorted /
erased over a period of time). The unknown word is digitized used as a future control word based on a
threshold number of users successfully identifying the unknown word. An instance of such a ReCAPTCHA
is shown in Figure 2.
C. Drawing CAPTCHAs
This is based on the principle of prompting the user to connect specific dots that are displayed to him on a
noisy background / grid. The human eye would be capable of easily recognizing and connecting the dots as
desired by the prompt as opposed to the bot. This also exploits the exclusive human capacity of moving in a
grid in a random order, and being a mouse oriented CAPTCHA, it should have increased usability, as
illustrated in Figure 3.
D. Collage CAPTCHAs
These function on the principle of promoting the user to select (click via a mouse) a specific object from an
image composed of several objects [7]. The various objects in the image are displayed in a distorted fashion
to enhance the difficult y level for the bot, whilst utilizing the human capability to parse images in different
orientations. An example of such a CAPTCHA is given in Figure 4.
E. Question CAPTCHAs
Questions from varied domains are posed at the user as part of this CAPTCHA. The difficulty level of the
questions is maintained straightforward for quick response by the human, whilst the domain and nature of the
question bring in the human touch to increase the difficulty level for the bot [8]. A simple but powerful
question CAPTCHA is given in Figure 5. An instance of an extremely usable design (achieving the objective
of CAPTCHA) could be “There are 3 pencils, 4 books and 1 mouse on the table. The question could be “How
790
many stationery are there?' or 'How many fruits are there?” varying the difficulty level for the user / bot
respectively. The second question increased the cognitive aspect wherein any attempts by the bot to predict
the count based on ”how many” would be misled as a result of the domain knowledge requirement.
Figure.2. ReCAPTCHA
Figure.3. Drawing CAPTCHA
Figure.4. Collage CAPTCHA
791
Figure.5. Question CAPTCHA
F. Miscellaneous
Literature also identifies the use of Video CAPTCHA's wherein users are presented with a video, from which
they are expected to enter three words. User's words can be entered even as the video is being played,
exploiting the domain knowledge of the user to identify objects in the video being rendered [9]. Aural
CAPTCHA's have distorted sound clips played back to the user with the user prompted to enter the uttered
word(s). A variant or hybrid version of this is the aural question CAPTCHA's wherein the question
CAPTCHA's are played back in aural form. This would present the bot the challenge of recognizing spoken
language, interpreting the question and solving the same, whilst the human counterpart would do with
relative ease.
III. PROPOSED USABLE CAPTCHAS
Despite the existence of several and varied types of CAPTCHAs and their success rates, there have also been
in-stances of the process becoming difficult for the human being to pass the test as a result of badly designed
(over distorted beyond human eye recognition) CAPTCHAs. This section deals with the different types of
CAPTCHAs that we propose factoring in the usability, learnability and predictability aspects in HCI
Design[12].
A. Hybrid Question CAPTCHAs
Proposed as a build on to the existing question (hybrid) CAPTCHA, it is proposed to have images displayed
to the user, who is prompted to respond with the object name in the image. It is intended to use the space
occupied by the CAPTCHAs for profitable marketing, wherein the images displayed relate to sponsors. This
would enhance the visibility of the product, with the user being advertised about the product in the process of
his solving the CAPTHCHA. These may be referred to as Advertisement CAPTCHAs. An example of such a
CAPTCHA is given in Figure 6, in which case the question can be “Enter the Brand Name”, which also
serves as an advertisement both visually and as part of the CAPTCHA solving process.
B. Awareness CAPTCHAs
This class of CAPTCHAs can again be thought of as extensions to the question type. The question is framed
from an image that displays information about government policies / emergency contacts / other information
that would be of benefit to the society. This type of CAPTCHA aims at using the users’ effort in recognition
of images to spread awareness about some critical issues such as environmental degradation, world peace,
etc. An instance of such a CAPTCHA is given in Figure 7.
792
Figure.6. Hybrid Question CAPTCHA
Figure.7. Awareness CAPTCHA
C. Domain CAPTCHAs
It is intended to have images that are relevant for the audience. For instance a website that requires
CAPTCHA and projects out computer science relevant information could have images / CAPTCHAs framed
out from computer science dictionary. For instance in Figure 8 the user is prompted to enter the name of a
programming language, wherein the displayed image would include one programming language word and
other non domain words. The domain knowledge of the user in computer science and programming in
specific would quickly relate him to the word that constitutes the CAPTCHA.
Figure.8. Domain CAPTCHA
Figure.9. Context CAPTCHA
D. Context CAPTCHAs
A variant of the question type CAPTCHAs the user is prompted with a question that requires reasoning from
the user based on facts in real life. For instance the CAPTCHA in Figure 9 requires the user to relate the
actors name to the relevant portion of the image.
E. Matching / Puzzle CAPTCHAs
The user will be displayed a sequence of words which he is expected to group / pair using context / real life
facts. An example of such a CAPTCHA is displayed in Figure 10 and the user would be mandated to identify
n out of m pairs correctly. Another instance of a Puzzle CAPTCHA is shown in Figure 11, wherein the user
would be expected to enter the solution to the puzzle in place, which is the number of objects in the order
24ifh73 (ifh is the text CAPTCHA, while the numbers are the counts of objects in a left to right direction)s.
Given the other possible predictions from the user, there has to be a learning phase associated with the use of
such CAPTCHAs, which could be in the form of demonstrations (on directions / clues) as to how the puzzle
can be solved, till it finds acceptance. Given that human beings can identify characters / images even in low
793
contrast as opposed to increased difficulty level for the bot, image CAPTCHA can also exploit such human
centric features and achieve the overall objective.
Figure. 10 Matching CAPTCHA
Figure. 11 Puzzle CAPTCHA
IV. METRICS FOR CAPTCHA EVALUATION
Most existing systems that make use of CAPTCHAs to sense human login support the option for refreshing
(fresh CAPTCHAs) in case of increased difficulty in solving the CAPTCHA currently displayed, while a
feasible solution would be to provide either enhanced / reduced level complexity of CAPTCHAs based on
some predefined number of failed / successful attempts. Apart from this, evolving a framework for
CAPTCHA evaluation for their usability, readability, predictability (by the user), and other factors could
enhance the experience level of the user. Google’s similar effort in the domain of search engines resulted in
the PULSE and HEART frameworks, focusing on statistical / threshold and HCI / User Psychology measures
for evaluating search engines. Usability engineering related factors such as Learnability, Flexibility and
Robustness and other cognitive measures could be explored in the domain of CAPTCHA design and
evaluation. A few metrics along the lines of PULSE could be time required by the user to crack the
CAPTCHA, number of refreshes, time and difficulty level of the CAPTCHA, etc. Cognitive measures would
only add value to the purpose of CAPTCHAs. Time required solving domain v/s non domain CAPTCHAs,
comfort levels of user for specific type of CAPTCHAs, etc. could be helpful HCI metrics for CAPTCHA
design. The facial expressions of the user in response to a CAPTCHA solving session(s) could also key give
inputs on the usability of the CAPTCHA. An established cognitive / statistical metric will also contribute to
the design and development of Usable CAPTCHAs. Design of CAPTHCA’s has primarily focused on
increasing difficulty level for the bot and ease of user access. However an established set of metrics could
result in an optimized framework for the bot and the human user.
V. CONCLUSION
Usability Engineering is an increasingly sought after trend in application development and more so in HCI.
The paper proposed variants to existing CAPTCHA types that accommodate more cognitive abilities of the
user such as inference, deduction, interpretation, etc. A more generic categorization on HCI measures and
specifics such as Learnability, Flexibility, Robustness, Synthesizability, etc. is being explored and shall be
reported in the future. As mentioned attempts are in the direction of a HCI / Mathematical framework to
evaluate the success of CAPTCHA with respect to the objective of separating Humans and Computers Apart,
which could serve as templates for the testable design of Cognitive CAPTCHAs.
REFERENCES
[1] K. A. Kluever and R. Zanibbi, “Balancing usability and security in a video CAPTCHA,” in Proceedings of the 5th
Symposium on Usable Privacy and Security, ser. SOUPS ’09. ACM, 2009, pp. 14:1–14:11.
[2] M. Shirali-Shahreza and S. Shirali-Shahreza, “CAPTCHA for blind people,” in Signal Processing and Information
Technology, 2007 IEEE International Symposium on, dec. 2007, pp. 995 –998.
[3] Collage CAPTCHA, 2007. [Online]. Available: http://dx.doi.org/10. 1109/ISSPA.2007.4555329
[4] K. Chellapilla, K. Larson, P. Y. Simard, and M. Czerwinski, “Computers beat humans at single character
recognition in reading based human interaction proofs (hips),” in CEAS, 2005.
[5] A. Rusu and V. Govindaraju, “Handwritten CAPTCHA: Using the difference in the abilities of humans and
machines in reading handwritten words,” in Proceedings of the Ninth International Workshop on Frontiers in
Handwriting Recognition, ser. IWFHR ’04. IEEE Computer Society, 2004, pp. 226–231.
794
[6] M. Shirali-Shahreza and S. Shirali-Shahreza, “Question-based CAPTCHA,” in Proceedings of the International
Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007) - Volume 04, ser.
ICCIMA ’07. IEEE Computer Society, 2007, pp. 54–58.
[7] A. S. El Ahmad, J. Yan, and L. Marshall, “The robustness of a new CAPTCHA,” in Proceedings of the Third
European Workshop on System Security, ser. EUROSEC ’10. New York, NY, USA: ACM, 2010, pp. 36–41.
[8] J. Yan and A. S. El Ahmad, “Usability of CAPTCHAs or usability issues in CAPTCHA design,” in Proceedings of
the 4th symposium on Usable privacy and security, ser. SOUPS ’08. New York, NY, USA: ACM, 2008, pp. 44–52.
[9] R. Gossweiler, M. Kamvar, and S. Baluja, “What’s up CAPTCHA?: a CAPTCHA based on image orientation,” in
Proceedings of the 18th international conference on World wide web, ser. WWW ’09. New York, NY, USA: ACM,
2009, pp. 841–850.
[10] L. von Ahn, B. Maurer, C. McMillen, D. Abraham, and M. Blum, “reCAPTCHA: Human-based character
recognition via web security measures,” Science, vol. 321, no. 5895, pp. 1465–1468, 2008.
[11] L. von Ahn, M. Blum, and J. Langford, “Telling humans and computers apart automatically,” Commun. ACM, vol.
47, no. 2, pp. 56–60, Feb. 2004.
[12] J. Nielsen, “Usability 101: Introduction to usability,” Jakob Nielsens Alertbox, 2003.
[13] www.w3.org/TRturingtest, “Inaccessibility of CAPTCHA alternatives to visual turing tests on the web,” W3C
Working Group note.
795