Usability of CAPTCHAs Or “usability issues in CAPTCHA design”
July 24, 2008 by Richard Conlanhttp://cups.cs.cmu.edu/soups/2008/proceedings/p44Yan.pdf
CAPTCHAs were originally invented at CMU. The goal of a CAPTCHA is to allow humans through but block automated scripts. They are now widely deployed as a method of preventing spam.
Text-based schemes typically require the use to complete a text recognition tasks. Some sites offer a sound-based scheme, typically for accessibility reasons. There have also been some image-based schemes, such as Microsoft’sw Assira.
This research is on designing usable and robust CAPTCHAs. Many deployed CAPTCHAs are not very usable. Many of them are also not very robust - low-cost attacks on schemes by Microsoft, Yahoo! and Google will be presented at CCS’08.
A framework for CAPTCHA usability:
- distortion
- content
- presentation
It is well known that under distortion some characters as as 1 and l, O and 0, and 5 and S have a high potential for confusion. In the name of security, Google and Yahoo! CAPTCHAs have actually created new confusing characters, such as vv and w, cl and d, and rn and m. It would found that ~6% of the CAPTCHAs received from Google included such confusing characters such that a normal user would be unable to distinguish the intended letters. In the latest Yahoo! scheme ~10% of the challenges include such confusing combinations.
An example of a content concern is whether the length of the CAPTCHA string is fixed or variable length. A constant length is more usable, but is also easier for an attacker to segment the image since they know the proper number of segments.
A place presentation comes up is in the use of colors in CAPTCHA design. The researchers found that color can often be an impediment to usability, and sometimes even reduces security. For example, many CAPTCHAs have a complexly colored background, but machines can usually distinguish the text from the colored noise and extract out the background, so it is has hurt usability without helping security. In some cases color is overly relied upon because some CAPTCHAs don’t even distort the character, just relying on the confusion caused by colorful noise, but again, the noise can be stripped out relatively easily.