Filtering HTML Content Before Loading to Page using PHP Tutorial

In this tutorial, you will learn how to Filter HTML Content before loading or displaying it on the page using PHP Language. The tutorial aims to provide the IT/CS students and new PHP programs with a reference to learn to enhance their knowledge and programming capabilities using the PHP Language. Here, I will provide some snippets and a sample source code that demonstrate the goal of this tutorial.

What is Content Filtering?

Content Filtering is also known as Web Content Filtering which is a set of solutions for monitoring, altering, or analyzing page content. This kind of solution or feature is mostly used in some Content Management System platforms such as WordPress which allows the site users to filter and alter the page or article content before displaying it.

How can we Achieve Content Filtering using PHP?

Using PHP Language, we can simply read the HTML content even displaying it on the page yet using the ob_start() and ob_get_clean() so we could do something with the content before loading it on the page. See the example below for a better idea.

  1.     <?php
  2.     ob_start();
  3.     ?>
  4.     <!-- HTML Content to Filter before loading to Page -->
  5.     <div>
  6.         <h1>Hello World!!!</h1>
  7.     </div>
  8.     <!-- End of HTML Content to Filter before loading to Page -->
  9.  
  10.     <?php
  11.     // HTML content that can be alter
  12.     $content = ob_get_clean();
  13.  
  14.     $altered_content = "{$content}\n<p>This is a sample additional content added to html content</p>";
  15.  
  16.  
  17.     //Load the Filtered HTML Content to Page
  18.     echo $altered_content;
  19.     ?>

The snippet above demonstrates a simple way to filter the HTML content and alter the content before showing it.

Sample Application Source Code

Here are the scripts of a simple web application that demonstrate also Content Filtering using PHP. Follow the instructions so you can run it also on your end. Although, I also provided some snapshots of the results of the application.

First, create a new directory named pages inside your source code root folder. Save the following HTML Files.

about.html

  1. <p>Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec sed faucibus lacus. Quisque sagittis imperdiet viverra. Fusce ac nulla pretium velit ultricies ultricies. In hac habitasse platea dictumst. Cras risus lacus, iaculis eu metus non, <b>rutrum</b> mollis nulla. Vestibulum semper vitae eros pellentesque dapibus. Nulla blandit fermentum faucibus. <b><span style="color:red">Donec nec lobortis</span> diam</b>. Etiam ultrices lectus vulputate velit molestie, eu hendrerit quam auctor. Nullam ex lorem, tincidunt eget euismod ac, <em>euismod vel ligula</em>. Aliquam venenatis mollis massa et bibendum.</p>
  2.  
  3. <img src="./images/img1.jpg" alt="Sample Image">
  4.  
  5. <p><a href="https://sourcecodester.com">SourceCodester</a></p>
  6.  
  7. <p><span style="font-family: arial, helvetica, sans-serif; color: rgb(0, 0, 0); font-size: 14px;">Duis tempus nisi at vehicula rutrum. Donec faucibus eros sed magna venenatis hendrerit. Duis nec arcu mollis, interdum ligula at, aliquet lectus. Proin fringilla, odio ac lacinia blandit, neque augue bibendum urna, <em>non mattis urna orci a mi</em>. Suspendisse vitae convallis quam. Integer at vestibulum erat, sit amet porta ante. Integer vel cursus ante. <span style="background-color: rgb(251, 238, 184);"><em>Integer eu massa sit amet lectus ullamcorper imperdiet</em></span>. Donec porttitor felis eu quam eleifend iaculis. In eget nulla orci. <strong>Cras fringilla</strong> dapibus justo, id mattis sapien tempus vitae. Cras congue ipsum eu consectetur tincidunt. Nam dictum rhoncus dui nec porta. </span></p>

about.html

  1.     <img src="./images/img2.jpg" alt="">
  2.     <p><span style="font-family: arial, helvetica, sans-serif; color: rgb(0, 0, 0); font-size: 14px;">Duis tempus nisi at vehicula rutrum. Donec faucibus eros sed magna venenatis hendrerit. Duis nec arcu mollis, interdum ligula at, aliquet lectus. Proin fringilla, odio ac lacinia blandit, neque augue bibendum urna, <em>non mattis urna orci a mi</em>. Suspendisse vitae convallis quam. Integer at vestibulum erat, sit amet porta ante. Integer vel cursus ante. <span style="background-color: rgb(251, 238, 184);"><em>Integer eu massa sit amet lectus ullamcorper imperdiet</em></span>. Donec porttitor felis eu quam eleifend iaculis. In eget nulla orci. <strong>Cras fringilla</strong> dapibus justo, id mattis sapien tempus vitae. Cras congue ipsum eu consectetur tincidunt. Nam dictum rhoncus dui nec porta. </span></p>
  3.    
  4.    
  5.    
  6.     <p><span style="color: rgb(0, 0, 0); font-family: arial, helvetica, sans-serif; font-size: 12px;">Sed eu venenatis felis. Morbi elementum ornare magna ut venenatis. Nunc eget congue sem. Fusce porttitor, lacus sed ultrices laoreet, erat leo bibendum est, id iaculis erat est vel ante. Curabitur congue nulla eget nulla accumsan rhoncus. Sed a tellus libero. Nam varius id nisl tempus finibus. Nullam metus purus, pellentesque a venenatis eu, faucibus at eros. Donec eleifend neque bibendum, pulvinar sapien sit amet, maximus ligula. Etiam aliquet ante enim, molestie condimentum velit gravida in. Lorem ipsum dolor sit amet, consectetur adipiscing elit. </span></p>
  7.    
  8.     <img src="./images/img3.jpg" alt="">

The given HTML scripts are the sample HTML script that contains the different contents to filter. Don't forget to change the sample image source on each HTML file that is available on your end.

Next, in your source code root folder, create a new PHP file named index.php and paste the following script. The code below is a combined PHP and HTML script that lists the page contents.

  1. <!DOCTYPE html>
  2. <html lang="en">
  3. <head>
  4.     <meta charset="UTF-8">
  5.     <meta http-equiv="X-UA-Compatible" content="IE=edge">
  6.     <meta name="viewport" content="width=device-width, initial-scale=1.0">
  7.     <title>PHP - HTML Content Filtering</title>
  8.     <link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.2.0/css/all.min.css" integrity="sha512-xh6O/CkQoPOWDdYTDqeRdPCVd1SpvCA9XXcUnZS2FmJNp1coAFzvtCN9BmamE+4aHK8yyUHUSCcJHgXloTyT2A==" crossorigin="anonymous" referrerpolicy="no-referrer" />
  9.     <link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/[email protected]/dist/css/bootstrap.min.css" integrity="sha384-Zenh87qX5JnK2Jl0vWa8Ck2rdkQ2Bzep5IDxbcnCeuOxjzrPF/et3URy9Bv1WTRi" crossorigin="anonymous">
  10.     <link rel="stylesheet" href="style.css">
  11.    
  12.     <script src="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.2.0/js/all.min.js" integrity="sha512-naukR7I+Nk6gp7p5TMA4ycgfxaZBJ7MO5iC3Fp6ySQyKFHOGfpkSZkYVWV5R7u7cfAicxanwYQ5D1e17EfJcMA==" crossorigin="anonymous" referrerpolicy="no-referrer"></script>
  13.     <script src="./jquer-3.6.1.min.js"></script>
  14.     <script src="https://cdn.jsdelivr.net/npm/[email protected]/dist/js/bootstrap.bundle.min.js" integrity="sha384-OERcA2EqjJCMA+/3y+gxIOqMEjwtxJY7qPCqsdltbNJuaOe923+mo//f6V8Qbsw3" crossorigin="anonymous"></script>
  15. </head>
  16. <body>
  17.     <nav class="navbar navbar-expand-lg navbar-dark bg-primary bg-gradient">
  18.         <div class="container">
  19.             <a class="navbar-brand" href="./">PHP - HTML Content Filtering</a>
  20.             <div>
  21.                 <a href="https://sourcecodester.com" class="text-light fw-bolder h6 text-decoration-none" target="_blank">SourceCodester</a>
  22.             </div>
  23.         </div>
  24.     </nav>
  25.     <div class="container-fluid px-5 my-3">
  26.         <div class="col-lg-8 col-md-10 col-sm-12 mx-auto py-5">
  27.             <h2 class="text-center"><b>HTML Content Filtering Befor Displaying using PHP</b></h2>
  28.             <div class="col-lg-2 col-md-3 col-sm-4 col-xs-4 mx-auto">
  29.                 <hr class="bg-primary border-primary opacity-100" style="border-top-width: 4px;">
  30.             </div>
  31.         </div>
  32.         <div class="col-lg-6 col-md-8 col-sm-12 mx-auto">
  33.             <div class="card rounded-0">
  34.                 <div class="card-header">
  35.                     <div class="card-title text-muted"><b>List of Sample HTML Content</b></div>
  36.                 </div>
  37.                 <div class="card-body rounded-0">
  38.                     <div class="container-fluid">
  39.                         <div class="list-group rounded-0">
  40.                             <div class="list-group-item list-group-item-action">
  41.                                 <div class="d-flex w-100 justifuc-content-between align-items-center">
  42.                                     <div class="col-auto flex-shrink-1 flex-grow-1">
  43.                                         <b>About Us Page Content</b>
  44.                                     </div>
  45.                                     <div class="col-auto flex-shrink-1">
  46.                                         <a href="view.php?page=about" class="text-muted text-docoration-none"><i class="fa fa-eye"></i></a>
  47.                                     </div>
  48.                                 </div>
  49.                             </div>
  50.                             <div class="list-group-item list-group-item-action">
  51.                                 <div class="d-flex w-100 justifuc-content-between align-items-center">
  52.                                     <div class="col-auto flex-shrink-1 flex-grow-1">
  53.                                         <b>Welcome Page Content</b>
  54.                                     </div>
  55.                                     <div class="col-auto flex-shrink-1">
  56.                                         <a href="view.php?page=welcome" class="text-muted text-docoration-none"><i class="fa fa-eye"></i></a>
  57.                                     </div>
  58.                                 </div>
  59.                             </div>
  60.                         </div>
  61.                     </div>
  62.                 </div>
  63.             </div>
  64.         </div>
  65.     </div>
  66. </body>
  67. </html>

Next, let's create the sample filtering function for this application. The script below is named filtering.php. It contains the function that is being called when filtering the original content and returns the altered content.

  1. <?php
  2.  
  3. function filter_content_before($content=""){
  4.     // If HTML content is not empty, filter content
  5.     if(!empty($content)){
  6.  
  7.         /**
  8.             * Forcing Links to view in new window
  9.             */
  10.         $links_match_count = preg_match_all('/\<a(.*?)\/a\>/', $content, $matches, PREG_OFFSET_CAPTURE);
  11.         if($links_match_count > 0){
  12.             // Loop matched links
  13.                 foreach($matches[0] as $link){
  14.                 // Check of links is already set to open into a new window or not
  15.                 $link_in_window_count = preg_match_all('/(:?target\s?=\s?["\']_blank["\'])/si', $link[0], $new_window_matches, PREG_OFFSET_CAPTURE);
  16.                 if($link_in_window_count <= 0){
  17.                     $new_link = preg_replace('/(<a\b[^><]*)/i', '$1 target="_blank"', $link[0]);
  18.                     $content = str_replace($link[0], $new_link, $content);
  19.                 }
  20.  
  21.             }
  22.         }
  23.        
  24.         /**
  25.             * Labeling Paragraphs
  26.             */
  27.         $paragraph_match_count = preg_match_all('/<p(.*?)>(.*?)<\/p>/si', $content, $paragraph_matches, PREG_OFFSET_CAPTURE);
  28.  
  29.         /**
  30.             * Add label befor and after paragraphs
  31.             */
  32.         if($paragraph_match_count > 0){
  33.             $pi = 1;
  34.             foreach($paragraph_matches[0] as $paragraph){
  35.                 $new_paragraph = "<div class=\"text-muted text-center\"><em>{Start of Paragraph #{$pi}}</em></div>\n";
  36.                 $new_paragraph .= $paragraph[0]."\n";
  37.                 $new_paragraph .= "<div class=\"text-muted text-center\"><em>{End of Paragraph #{$pi}}</em></div>\n";
  38.                 // $new_paragraph .= "<div class=\"text-muted text-center\"><hr></div>\n";
  39.                 $content = str_replace($paragraph[0], $new_paragraph, $content);
  40.                 $pi++;
  41.             }
  42.         }
  43.         // print_r($paragraph_match_count);exit;
  44.  
  45.         /**
  46.             * Labeling Images
  47.             */
  48.         $img_match_count = preg_match_all('/<img(.*?)(\/?)>/si', $content, $img_matches, PREG_OFFSET_CAPTURE);
  49.  
  50.         /**
  51.             * Add label befor and after Images
  52.             */
  53.         if($img_match_count > 0){
  54.             $ii = 1;
  55.             foreach($img_matches[0] as $img){
  56.                 $new_img = "<div class=\"text-muted text-center\"><em>{Start of Image #{$ii}}</em></div>\n";
  57.                 $new_img .= $img[0]."\n";
  58.                 $new_img .= "<div class=\"text-muted text-center\"><em>{End of Image #{$ii}}</em></div>\n";
  59.                 $content = str_replace($img[0], $new_img, $content);
  60.                 $ii++;
  61.             }
  62.         }
  63.     }
  64.    
  65.     return $content;
  66. }

Lastly, let's create the view page which displays the filtered content of the selected HTML scripts. The file below is named view.php. It contains a 2-panel window that represents the original output of the HTML content scripts and the altered one. On the left panel, it previews the original output of the content while the altered content preview is on the right panel.

  1. <?php
  2. require_once('filtering.php');
  3. $original_conent = "No Page Content Selected";
  4. $filtered_conent = "No Page Content Selected";
  5. $page = isset($_GET['page']) ? ucwords($_GET['page']).".html" : "";
  6. if(!empty($page)){
  7.     ob_start();
  8.     include_once('./pages/'.$page);
  9.     $original_conent = ob_get_clean();
  10.     $filtered_conent = filter_content_before($original_conent);
  11.     // echo "<pre>";
  12.     // print_r($filtered_conent);
  13.     // echo "</pre>";
  14.     // exit;
  15. }
  16. ?>
  17. <!DOCTYPE html>
  18. <html lang="en">
  19. <head>
  20.     <meta charset="UTF-8">
  21.     <meta http-equiv="X-UA-Compatible" content="IE=edge">
  22.     <meta name="viewport" content="width=device-width, initial-scale=1.0">
  23.     <title> <?= isset($_GET['page']) ? ucwords($_GET['page'])." Page Content" : "No Page Content Selected" ?> | PHP - HTML Content Filtering</title>
  24.     <link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.2.0/css/all.min.css" integrity="sha512-xh6O/CkQoPOWDdYTDqeRdPCVd1SpvCA9XXcUnZS2FmJNp1coAFzvtCN9BmamE+4aHK8yyUHUSCcJHgXloTyT2A==" crossorigin="anonymous" referrerpolicy="no-referrer" />
  25.     <link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/[email protected]/dist/css/bootstrap.min.css" integrity="sha384-Zenh87qX5JnK2Jl0vWa8Ck2rdkQ2Bzep5IDxbcnCeuOxjzrPF/et3URy9Bv1WTRi" crossorigin="anonymous">
  26.     <link rel="stylesheet" href="style.css">
  27.    
  28.     <script src="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.2.0/js/all.min.js" integrity="sha512-naukR7I+Nk6gp7p5TMA4ycgfxaZBJ7MO5iC3Fp6ySQyKFHOGfpkSZkYVWV5R7u7cfAicxanwYQ5D1e17EfJcMA==" crossorigin="anonymous" referrerpolicy="no-referrer"></script>
  29.     <script src="./jquer-3.6.1.min.js"></script>
  30.     <script src="https://cdn.jsdelivr.net/npm/[email protected]/dist/js/bootstrap.bundle.min.js" integrity="sha384-OERcA2EqjJCMA+/3y+gxIOqMEjwtxJY7qPCqsdltbNJuaOe923+mo//f6V8Qbsw3" crossorigin="anonymous"></script>
  31.  
  32.     <style>
  33.         img{
  34.             width: 100%;
  35.             /* max-height:35vh; */
  36.             object-fit:scale-down;
  37.             object-position:center center;
  38.         }
  39.     </style>
  40. </head>
  41. <body>
  42.     <nav class="navbar navbar-expand-lg navbar-dark bg-primary bg-gradient">
  43.         <div class="container">
  44.             <a class="navbar-brand" href="./">PHP - HTML Content Filtering</a>
  45.             <div>
  46.                 <a href="https://sourcecodester.com" class="text-light fw-bolder h6 text-decoration-none" target="_blank">SourceCodester</a>
  47.             </div>
  48.         </div>
  49.     </nav>
  50.     <div class="container-fluid px-5 my-3">
  51.         <div class="col-lg-8 col-md-10 col-sm-12 mx-auto py-5">
  52.             <h2 class="text-center"><b>HTML Content View Page</b></h2>
  53.             <div class="col-lg-2 col-md-3 col-sm-4 col-xs-4 mx-auto">
  54.                 <hr class="bg-primary border-primary opacity-100" style="border-top-width: 4px;">
  55.             </div>
  56.             <h4 class="text-center"><b><?= isset($_GET['page']) ? ucwords($_GET['page'])." Page Content" : "No Page Content Selected" ?></b></h4>
  57.         </div>
  58.         <div class="row">
  59.             <!-- Original Page Content Preview Container -->
  60.             <div class="col-lg-6 col-md-6 col-sm-12 col-xs-12">
  61.                 <div class="card shadow-sm rounded-0">
  62.                     <div class="card-header rounded-0">
  63.                         <div class="card-title"><b>Original Page Content Preview</b></div>
  64.                     </div>
  65.                     <div class="card-body rounded-0">
  66.                         <div class="container-fluid">
  67.                             <?php echo $original_conent ?>
  68.                         </div>
  69.                     </div>
  70.                 </div>
  71.             </div>
  72.             <!-- End of Original Page Content Preview Container -->
  73.             <!-- Filtered Page Content Preview Container -->
  74.             <div class="col-lg-6 col-md-6 col-sm-12 col-xs-12">
  75.                 <div class="card shadow-sm rounded-0">
  76.                     <div class="card-header rounded-0">
  77.                         <div class="card-title"><b>Filtered Page Content Preview</b></div>
  78.                     </div>
  79.                     <div class="card-body rounded-0">
  80.                         <div class="container-fluid">
  81.                             <?php echo $filtered_conent ?>
  82.                         </div>
  83.                     </div>
  84.                 </div>
  85.             </div>
  86.             <!-- End of Filtered Page Content Preview Container -->
  87.         </div>
  88.     </div>
  89. </body>
  90. </html>

Output Snapshots

Here are the snapshots of the output of the script I have provided above.

"About Us" Page Content

Content Filtering in PHP

Content Filtering in PHP

"Welcome" Page Content

Content Filtering in PHP

Content Filtering in PHP

That's it! You can now test the sample web applications on your end. I have also provided the source code zip file that I created for this tutorial. You can download it by clicking the Download Button located below this article/content.

There you go! I hope this Filtering HTML Content Before Loading to Page using PHP Tutorial helps you with what you are looking for and that you'll find this useful for your current and future PHP Projects.

Explore more on this website for more Tutorials and Free Source Codes.

Happy Coding =)

Add new comment