[For this post I'm presuming that you are familiar with CAPTCHA, if not please read this Introduction to CAPTCHA]
Today we are going to see how CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) works and how it minimizes automatic sign-up of forms. We will also be creating a simple CAPTCHA script in PHP to illustrate this.
Basically CAPTCHA works in the following manner:
-
Create Random Value: Some random string is generated, random values are often hard to guess and predict.
-
Generate an Image: Images are used as these are generally a lot harder to read for computers while being nice and readable to humans. This is also the most important step as simple text in images can be read (and CAPTCHA cracked) quite easily. To make it difficult for them, developers employ different techniques so that the text in the image becomes hard to read for computers. Some create zig-zag lines for background while others twist-and-turn individual characters in the image. Possibilities are many and new techniques are being developed all the time as crackers are always into finding ways to break them.
-
Store it: The random string generated (which is also in the image) is stored for matching the user input. The easiest way to do so is to use the Session variables.
-
Matching: After the above step, the CAPTCHA image is generated and shown on some form which we want to protect from being abused. The users fills in the form along with the CAPTCHA text and submits it. Now we have the following:
-
All submitted form data.
-
CAPTCHA string (from form), input by user.
-
CAPTCHA string (real one, generated by us), from session variable. Session variable is generally used as it can keep stored values across page requests. Here, we needed to preserve stored values from one page (form page) to another (action page-that receives form data).
-
-
If both match, it's okay otherwise not, in that case we can give the user a message that the CAPTCHA they had entered was wrong and their form could not be submitted. You could also ask them to verify it again.
The following image might illustrates this better:
How CAPTCHA is Generated and Matched
From the above image it's quite clear that when someone requests the form page, the CAPTCHA text is generated and sent back to requesting user, but only in the form of an image. If the requester is a human he'd not have much difficulty reading the image and inputting the text when asked but if it's a bot it might face difficulties guessing whats in the image. In the next step when we match the string generated and the one the user had input, we can restrict automated form submissions.
The following is the code that does this, it'll just output the CAPTCHA image to the browser when the script is requested:
<?php
/********************************************************
* File: captcha.php *
* Author: Arvind Gupta (www.arvindgupta.co.in) *
* Date: 12-Mar-2009 *
* Description: This file can be embedded as image *
* to show CAPTCHA/ *
********************************************************/
// The number of characters you
// want your CAPTCHA text to have
define('CAPTCHA_STRENGTH', 5);
/****************************
* INITIALISE *
****************************/
// Tell PHP we're going to use
// Session vars
session_start();
// Md5 to generate the random string
$random_str = md5(microtime());
// Trim required number of characters
$captcha_str = substr($random_str, 0, CAPTCHA_STRENGTH);
// Allocate new image
$width = (CAPTCHA_STRENGTH * 10)+10;
$height = 20;
$captcha_img =ImageCreate($width, $height);
// ALLOCATE COLORS
// Background color-black
$back_color = ImageColorAllocate($captcha_img, 0, 0, 0);
// Text color-white
$text_color = ImageColorAllocate($captcha_img, 255, 255, 255);
// Line color-red
$line_color = ImageColorAllocate($captcha_img, 255, 0, 0);
/****************************
* DRAW BACKGROUND & *
* LINES *
****************************/
// Fill background color
ImageFill($captcha_img, 0, 0, $back_color);
// Draw lines accross the x-axis
for($i = 0; $i <$width
; $i += 5)
ImageLine($captcha_img, $i, 0, $i, 20, $line_color);
// Draw lines accross the y-axis
for($i = 0; $i < 20; $i += 5)
ImageLine($captcha_img, 0, $i,$width
, $i , $line_color);
/****************************
* DRAW AND OUTPUT *
* IMAGE *
****************************/
// Draw the random string
ImageString($captcha_img, 5, 5, 2, $captcha_str, $text_color);
// Carry the data (KEY) through session
$_SESSION['key'] = $captcha_str;
// Send data type
header("Content-type: image/jpeg");
// Output image to browser
ImageJPEG($captcha_img);
// Free-Up resources
ImageDestroy($captcha_img);
?>
Okay, this it for this, in the next one we'll integrate this CAPTCHA script into one form and see how it works. Till then goodbye!