Onetomarket

23
Jun
2010

Google’s crawl order

auteur: Rolf Broer in Featured, SEO

tags:

It’s very interesting to know in which order Googlebot crawls your website. If you know the order, you can manipulate Google to crawl your most important pages first. Especially on new large content websites, you want Google to crawl the important sections of your site first.

To test this, we took a domain with no history (never registered, no backlinks) and made a page with 250 links on it. Those links refer to pages with also 250 on it (and so on…). The linktext and URL’s were numbered from 1 till 250, In the same order as they appeared in the source code. We submitted the URL via “addurl” and waited.

On the first visit of Googlebot it only visited the root page (http://example.com/). After a few hours it returned and visited all the 250 pages it found on the root page. At first it seemed that Google was dividing all the links on the pages into three blocks:

  • Block1: links 1 till 9
  • Block2: links 10 till 99
  • Block3: links 100 till 250

If Googlebot visits a page there is a good chance that it will follow links from one or more of the blocks. The links are crawled in a batch per block in a random order. The chances that block 1 is crawled is three times greater then block 2 and even 6 times greater then block 3. In block 3 there are two links which have a slightly higher chance to be crawled: link /100/ and /200/.

Afbeelding 1 Scriptie 300x202 Google’s crawl order

With only the results of this test it is to early to conclude that Google always divides a page into these blocks. It is possible that Google bases its blocks on the length of the link text, on the length of a URL, on the position of the link etcetera. To exclude al the other possibilities we have set up some other tests.

The test which gave us the decisive answer was a test with the length of URL’s in random order on a page. Those links linked to similar pages with also links with various lengths in random order (etcetera). Googlebot seems to crawl links in order of length. A good thing to know if you want to do some advanced site structure sculpting.

Afbeelding 2 Scriptie 300x154 Google’s crawl order

Takeaways

With these insights choosing the length of your URL’s becomes more important. It is a good way to influence Googlebot so you should choose the length of each URL wisely. Google crawls short URL’s before it crawls the longer URL’s. It doesn’t help to make all your URL’s short. If they all have an equal length they will get crawled at random. Always consider the need for indexation and the need to be crawled if you choose your URL.

So it’s good to take it into account when you are designing your site structure. Probably it is even more important for linkbuilding. We have not tested it yet, but there is a good chance Google crawls external links the same way. This would mean that if your URL is on a page with hundreds of other links, you may increase your crawl chance if you have the shortest URL.

note: This insight is part of a thesis, more to come soon. (link will be placed here.)

Rolf Broer

Over Rolf Broer

Rolf Broer is Search Engine Optimization Consultant at Onetomarket
meer reacties Plaats uw reactie!

Gerelateerde artikelen

4 reacties to “Google’s crawl order”

  • Reactie van z3zim4 op 9 July 2010 at 08:48

    I think it’s not a good thing that Google crawls links in lenght order. And I think Google doesn’t want that site owners could decide the crawling order. Makes no sense to me.

  • Reactie van Paul op 22 July 2010 at 17:16

    Hi Rolf
    Interesting study.

    Just because the url is shorter and has been indexed, does not mean it will be ranked for the topic/keywords/phrase…..

  • Reactie van Laurent Colas(athlete enddurance) op 22 July 2010 at 21:04

    Very interesting

  • Reactie van Rolf Broer op 30 July 2010 at 14:28

    @z3zim4
    In the end Google will probably always try to crawl all the links on a page. The reason they’ve chosen to do that in order of length will probably have something to do with levels of a website. (a deeper level will probably have a longer URL.) I think it’s just their way to figure out the link structure of a website.

    @Paul
    So true, you always should decide for a page if it’s should rank well or is meant to be a page used for building your sitestructure. In the last case you could use a short URL to get the page crawled faster.

    @Laurent Colas
    thnx!

Reageer!

Naam (verplicht)
E-mail (wordt niet getoond) (verplicht)
Website


Subscribe by e-mail

We won’t share your address with others, we will keep you informed about everything that is in your company’s best interest.

Related articles

Related services

Get in touch with us






Knowledgebase

7 reasons why link building needs patience

Lets just paraphrase this article; link building costs time. Lots of time. Because a lot of people want immediate results I will focus a lot of time and energy on this post to clarify why it is important to have loads of patience. To name a few: it takes time to become an authority it takes time [...]
more information

Linkbait: what it is, tips & tricks

An compelling explanation of a phenomenon that is called linkbait, how it works, how you can recognize it and what it can do for your business.
more information

Preventing 404s in 7 steps

In this article I will explain you as a webmaster or marketer how you could prevent these messages on the basis of a simple roadmap.
more information

Take me to the: knowlegdebase

Why work with us?

You are looking for a reliable partner. Onetomarket understands the importance of a good partnership. This partnership is based on trust, performance, communication and a pro-active work ethic.

Imagine your hassle taken away and you are completely able to focus on a strategic level.

If you like the sound of this we recommend you take action right now!

Get in contact with one of our