Ad

Thursday, November 20, 2008

Custom Tags Parsing Using Regular Expressions

In the last post, we had created a simple custom tag parsing script using PHP string functions. In this post, we are going to continue our discussion on custom tag parsing but rather using Regular Expressions. Here we will see how regular expressions can used to parse strings, we will also see where to and where not to use Regular Expressions. Before continuing, I expect that you have a working knowledge of Regular Expressions if not please first check out this websites.

Let us first create the previous custom tag parsing script using expressions:

<form name="form1" method="get" action="">
  <p>
    <!-- textarea should display previously wriiten text -->
    <textarea name="content" cols="35" rows="12" id="content"><? if (isset($_GET['content'])) echo $_GET['content']; ?></textarea>
  </p>
  <p>
    <input name="parse" type="submit" id="parse" value="Parse">
  </p>
</form>
<?

if(isset($_GET['parse']))
{
    
$content $_GET['content'];
    
//convert newlines in the text to HTML "<br />"
    //required to keep formatting (newlines)
    
$content nl2br($content);
    
    
//PHP function 'eregi_replace' replaces all occurences of the expression with the one mentioned
    //'\\1' is the string matched (one in parentheses '()' in the regular expression
    //it's a 'eregi_replace' thing not PHP's

    
$content eregi_replace('\.b\.(.+)\./b\.''<strong>\\1</strong>'$content);
    
$content eregi_replace('\.i\.(.+)\./i\.''<i>\\1</i>'$content);
    
    
//now the variable $content contains HTML formatted text
    //display it
    
echo '<hr />';
    echo 
$content;
}
?>

But should we use regular expressions here, answer is NO, because, first regular expressions run slower and they add a fair bit of complexity where the same thing could have been done easily using just string functions.

The reason for me staring this post with something contradicting to the theme of the post is because people tend to avoid regular expressions thinking that the same thing can be done otherwise (I just gave them one more chance!). Well it may be case sometimes but in many other cases where complex string manipulation is required with efficiency there is but one choice, regular expressions. The next example will illustrate this.

For this example we will parse ‘*’ (asterisk) and ‘_’ (underscore) for bolding and italicizing text (as in Google Talk / IM applications). The following text:

Hello *World*. Hello _World_.

Will be parsed and displayed as:

Hello World. Hello World.

It is quite obvious that both tags’ start and end tags are the same. Now let us see how this can be implemented (using regular expressions).

<form name="form1" method="get" action="">
  <p>
    <!-- textarea should display previously wriiten text -->
    <textarea name="content" cols="35" rows="12" id="content"><? if (isset($_GET['content'])) echo $_GET['content']; ?></textarea>
  </p>
  <p>
    <input name="parse" type="submit" id="parse" value="Parse">
  </p>
</form>
<?

if(isset($_GET['parse']))
{
    
$content $_GET['content'];
    
//convert newlines in the text to HTML "<br />"
    //required to keep formatting (newlines)
    
$content nl2br($content);
    
    
//match anything between the tags but not the tag itself
    //otherwise '*hello* world *hello*'
    //will be print 'hello* world *hello' in bold
    //and not 'hello(in bold) world hello(again in bold)'

    
$content eregi_replace('\*(.[^*]+)\*''<strong>\\1</strong>'$content);
    
$content eregi_replace('\_(.[^_]+)\_''<i>\\1</i>'$content);
    
    
//now the variable $content contains HTML formatted text
    //display it
    
echo '<hr />';
    echo 
$content;
}
?>

If we try to implement this using string functions it will take quite a lot more lines of extra coding but I leave that to you.

Previous Posts:

15 comments:

  1. I was looking everywhere to find out how I can parse the input for my website. This PHP script is a great solution and fits in well with my design. Great post!

    ReplyDelete
  2. @ programmer23

    Thank you.

    Enjoy!

    ReplyDelete
  3. You may want to change your blog design a bit. The fonts are a bit hard to read, they're little. I'm using 1280x1024 resolution at the moment.

    ReplyDelete
  4. The black, and grey background of the "code font" makes it much harder to see. ima

    ReplyDelete
  5. This is not related to ur post, but its regarding a doubt that I have. Considering the fact that I have a university paper tomorrow, I'd appreciate an early answer. Thank you!

    Can a funtion call be passed as a parameter to another function?

    Methinks, it's possible only when the function returns some value.

    Please help. Much appreciated!

    ReplyDelete
  6. @ Aparna

    Maybe this is what you're looking for:
    http://www.daniweb.com/forums/thread37812.html

    Hope that helps!

    ReplyDelete
  7. thanks for the detailed script...

    ReplyDelete
  8. @videoconference,

    You're welcome.

    Enjoy

    ReplyDelete
  9. thanks for the script. its great. do keep submitting article related to blog.

    ReplyDelete
  10. This comment has been removed by a blog administrator.

    ReplyDelete
  11. This comment has been removed by a blog administrator.

    ReplyDelete
  12. This comment has been removed by a blog administrator.

    ReplyDelete
  13. This comment has been removed by a blog administrator.

    ReplyDelete

You are free to comment anything, although you can comment as 'Anonymous' it is strongly recommended that you supply your name. Thank You.

Please don't use abusive language.