Skip to main content

Basic "Custom Tags" Parsing Script

Basic “Custom Tags” Parsing Script

Today we are going to create a basic Custom Tags parsing script that will parse special symbols (tags) in text for formatting purpose. Just like writing <b>BOLD</b>, a web browser parses it as “BOLD” in bold letters, same way our script will parse tags created by us. One very popular example of custom tag parsing for formatting purpose is, BBCode which most of the bulletin boards use to let users format their posts.

This will be a basic example of parsing custom tags so we will only be parsing two tags. One will convert the enclosing text into bold and other will be used for italics. After understanding the basic idea, you can easily add more tags according to your needs and can also use it wherever necessary. One of its good use will be in Shout Boxes that we had designed a few months back.

Though many would like the use of Regular Expressions for parsing, we will not be using them here. For the sake of simplicity, we will be using only the basic string manipulation functions available in PHP.

If you look at the code below, you can see an array (2D) holding our custom tags. Here we’ll be having four information for each tag. Start tag, end tag (both defined by us), HTML start tag and HTML end tag. To make this more clear, let’s suppose we want to parse the text “[b]Text[/b]” so that it’s displayed as “Text” in bold. Our start (custom) tag will be [b], end tag will be [/b], HTML start tag will be <b> and HTML end tag will be </b>.

As we will be parsing two different custom tags, we have eight elements in the array. If you want to add more tags, add four elements for each tag, just like the way the others are. No need to change anything else.

The code:

<form name="form1" method="get" action="">
    <!-- textarea should display previously wriiten text -->
    <textarea name="content" cols="35" rows="12" id="content"><? 
if (isset(
$_GET['content'])) echo $_GET['content']; ?></textarea>
    <input name="parse" type="submit" id="parse" value="Parse">

$content $_GET['content'];
//convert newlines in the text to HTML "<br />"
    //required to keep formatting (newlines)
$content nl2br($content);

    //For Tag 1
$tag[0][0] = '[b]';
$tag[0][1] = '[/b]';
$tag[0][2] = '<strong>';
$tag[0][3] = '</strong>';

//For Tag 2    
$tag[1][0] = '[i]';
$tag[1][1] = '[/i]';
$tag[1][2] = '<i>';
$tag[1][3] = '</i>';

//count total no. of tags to parse
$total_tags count($tag); //2 for now
    //parse our custom tags adding HTML tags instead
    //which a browser can understand
for($i 0$i<$total_tags$i++)
$content str_replace($tag[$i][0],$tag[$i][2],$content);
$content str_replace($tag[$i][1],$tag[$i][3],$content);
//now the variable $content contains HTML formatted text
    //display it
echo '<hr />';

The code is pretty straightforward. Isn’t it!

Previous Posts:

Popular posts from this blog

Fix For Toshiba Satellite "RTC Battery is Low" Error (with Pictures)

RTC Battery is Low Error on a Toshiba Satellite laptop "RTC Battery is Low..." An error message flashing while you try to boot your laptop is enough to panic many people. But worry not! "RTC Battery" stands for Real-Time Clock battery which almost all laptops and PCs have on their motherboard to power the clock and sometimes to also keep the CMOS settings from getting erased while the system is switched off.  It is not uncommon for these batteries to last for years before requiring a replacement as the clock consumes very less power. And contrary to what some people tell you - they are not rechargeable or getting charged while your computer or laptop is running. In this article, we'll learn everything about RTC batteries and how to fix the error on your Toshiba Satellite laptop. What is an RTC Battery? RTC or CMOS batteries are small coin-shaped lithium batteries with a 3-volts output. Most laptops use

The Best Way(s) to Comment out PHP/HTML Code

PHP supports various styles of comments. Please check the following example: <?php // Single line comment code (); # Single line Comment code2 (); /* Multi Line comment code(); The code inside doesn't run */ // /* This doesn NOT start a multi-line comment block /* Multi line comment block The following line still ends the multi-line comment block //*/ The " # " comment style, though, is rarely used. Do note, in the example, that anything (even a multi-block comment /* ) after a " // " or " # " is a comment, and /* */ around any single-line comment overrides it. This information will come in handy when we learn about some neat tricks next. Comment out PHP Code Blocks Check the following code <?php //* Toggle line if ( 1 ) {      // } else {      // } //*/ //* Toggle line if ( 2 ) {      // } else {      // } //*/ Now see how easy it is to toggle a part of PHP code by just removing or adding a single " / " from th

Generating XML Feeds (RSS, Atom) Using PHP

RSS/ATOM feeds are very common these days and almost all Content Management Systems (CMS) can generate it. But in the case when you want to generate it yourself or just want to learn how you can, read on! Both RSS and ATOM feeds are written in eXtensible Markup Language (XML) standard markup. Not just standard markups, you also need to be sure of what and how you put data in those markup elements (tags). For all this refer to the feed specifications of RSS and ATOM . XML itself is very strict and the standard specifications makes it even harder to generate valid feeds. And moreover, why re-invent the wheel when we can have it – ready-made. The solution I'm referring to here is, to use a third-party Library – Universal FeedWriter. FeedWriter is a PHP class written by Anis uddin Ahmad that can dramatically  ease-off feeds (both RSS and Atom) generation. You can download this library from  here . Every feed should have at least the following data: Feed title URL(of the webs