Search
Recommended Products
Related Links


 

 

Informative Articles

Let's Flash!
Most of you have visited web sites filled with video-like animation, sound effects and music synchronized to the animation, enhanced interactivity, and stunning graphics - all of which appear to load and play almost instantly. These sites seem to...

Paid vs. Free Web Hosting
It really depends on what your needs are and what you are trying to accomplish. There are advantages and disadvantages for both paid and free web hosting. Free Web Hosting Advantages: - The most obvious...

Shopping Carts For The Weary
To choose the means whereby we put our products on the world-wide-web, we proceed by a process of elimination. The chief criteria for judging a shopping cart is the number of credit card processors and shipping services it supports, and the...

Web Site Management: Statistics
Statistics are your most important resource! Used properly, web site statistics will tell you who is visiting your site, where they came from, what search engines they used, their browser types and even their monitor resolutions. These statistics...

What Makes For A Good Host?
I've had to change web hosts a number of times. In fact, I spent most of this week changing from one host to another. Believe me, it is a major pain, although I have made sure that my site is always ready to move if necessary. One thing I've become...

 
The Proper Way To Use The robot.txt File

When optimizing your web site most webmasters don’t consider using the robot.txt file. This is a very important file for your site. It let the spiders and crawlers know what they can and can not index. This is helpful in keeping them out of folders that you do not want index like the admin or stats folder.

Here is a list of variables that you can include in a robot.txt file and there meaning:

  1. User-agent: In this field you can specify a specific robot to describe access policy for or a “*” for all robots more explained in example.
  2. Disallow: In the field you specify the files and folders not to include in the crawl.
  3. The # is to represent comments

Here are some examples of a robot.txt file

User-agent:  *
Disallow:  

The above


would let all spiders index all content.

Here another

User-agent:  *
Disallow:  /cgi-bin/

The above would block all spiders from indexing the cgi-bin directory.

User-agent:  googlebot
Disallow:  

User-agent:  *
Disallow:  /admin.php
Disallow:  /cgi-bin/
Disallow:  /admin/
Disallow:  /stats/

In the above example googlebot can index everything while all other spiders can not index admin.php, cgi-bin, admin, and stats directory. Notice that you can block single files like admin.php.

About The Author

Jimmy Whisenhunt is the webmaster at VIP Enterprises http://www.vipenterprises.org

vipenter@vipenterprises.org

 

 

 

Our Partners

Online Matrimonial Website
http://www.ManMel.com

Online Free Job Portal
http://www.EJobPost.com

Online free Video
http://www.IndiaStudio.in

Software & Web Development
http://www.AasthaComputers.com

Online free Video
http://www.SmartVideoClips.com

Social Networking Site
http://www.IndiaZone.in

Domain and Hosting Solution
http://www.AasthaInfoMark.com

Free Dating
http://www.IndiaExperts.in

Graphic & Web Designing
http://www.Aastha.in

Online Matrimonial Website
http://www.HastMelap.com