Web Design Articles

Search-engine-crawling


Technical level: Basic/Beginner || Date: 5th November 2002 || Author: Nigel Peck

Background

First a little background about Search Engines. Search Engines attempt to bring order to the chaos that the Internet essentially is. Through the use of a Search Engine you can locate web sites related to your chosen topic.

This is done by entering one or more words or phrases that relate to the topic you are looking for. For you using a Search Engine is very simple, for the Search Engine it's a lot harder.

To give you an idea of the scale of running a Search Engine, Google currently runs on 10,000 servers and employs 50 or more PHD level Software Engineers to constantly work on the Search Engine Software.

And what does the Software do? We are only looking at a small part of it, the part that actually goes out and finds the web pages on the Internet.

The Bot

A program called the GoogleBot visits your website and reads each page by following the links within your site. It makes a note of all the words used on your pages that can later be used to find your pages when searching for those words.

This process is called "crawling" or "spidering" because of the way in which the robot (GoogleBot in this case) finds it's way through your site.

This does not just apply to Google's robot; most Internet Search Engines work in the same way.

Once GoogleBot has examined your site the information is passed on to another part of the software which goes on to analyse the words and phrases to get them ready to be added to the index.

After your site has been crawled, you just need to wait for the next Google Update.

Related Pages

We provide Search Engine Optimisation Services




 

 

why-css-is-good-for-google
using-relatve-font-sizes
random-content-rotation
web-design-xhtml-1-1
web-design-resources
accessibility-intro
web-design-xhtml-2-1
why-internet-marketing
xhtml-latin-1-character-references
google-updates
google-dance
css-positioning-properties
fancy-paragraphs
bob-regan-macromedia-accessibility
web-design-technologies
wrox-beginning-php-4-chapter-3-1
julie-howell-rnib-accessibility
handy-hints-web-design
mod_accessibility
pagerank-1
search-engine-crawling
Definition lists - misused or misunderstood.html
Accessible Data Tables
Developing sites for users with Cognitive disabilities and learning difficulties
An Accessibility Frontier Cognitive disabilities and learning difficulties
Inline elements and padding
Basic webstandards Workshop
Internet Explorer and column collapse
Building a page template in CSS - a step by step tutorial
Remote control CSS
Colored boxes - one method of building full CSS layouts
Replicating a Tree table
Creating a graph using percentage background images
Simple, accessible external links
Simple, accessible more links
Styling abbreviations and acronyms
 Web standards checklist
Floated items inside containers
Liquid layouts the easy way
Two columns with color
CSS Centering - fun for all!
Body padding and margin
List inheritance and Descendant Selectors
Taming the Taming lists model
Headings as images - The Lindsay method
Ideal line length for content
Validating Australian Museum Online
Styling the hr element
Styling and font family names that contain whitespace
Sample CSS Page Layouts
how_to_find_good_freelancer
using_colors_on_website
annoying-website-design
 
cross_browser_compatibility
banner-design-success-techniques
 
craig-tanner-freelance
ecommerce-website-design
 
ppc
separating-content-from-presentation
 
alternative
web_page_optimization
 
personalization
communicating_needs_web_designer
 
buzz
design_it_yourself_or_hire_pro
 
social_bookmarking2
w3c_validation.php
social_bookmarking
web_design_guidelines
 
digg
photo_optimization
 
buzz2
website_templates
 
viral_video
web_design_versus_web_development
 
controversy
graphic_formats
 
 
good_website_navigation_is_important