New Insights into Googlebot

Comments 43

Please keep your comments TAGFEE by following the community etiquette.

E-mail me when new comments are posted

Sort by:

Comments are closed on posts more than 30 days old. Got a burning question? Head to our Q&A section to start a new conversation.

Staff

Dr. Peter J. Meyers
Staff

2010-07-08T06:48:43-07:00

Although I may not agree with all of your conclusions, I really liked the data collection attempts and the close look at crawler behavior. It's very important to keep in mind the 3 major steps in the process:

(1) Crawling
(2) Indexation
(3) Ranking

I will say that something I see a lot with XML sitemaps is that they seem to jump-start crawling, but, if the site is huge without the PR to support it, pages get crawled and indexed in the short-term but then quickly dropped. I don't think Matt's statement is all or none. The indexation cap is proportional to PR/authority, but it isn't directly proportional. Even new sites can get some love.

Regarding breadcrumbs, I have a hunch that what you might be seeing is that Google doesn't always crawl back up (at least, that you'd see in your log files), because it recognizes that those pages have already been crawled. Their may still be a PR-flow component to that link, though. Now that Google is using site breadcrumbs to produce mini-breadcrumbs in the SERPs, it's clear they're paying attention to these architecture cues.

12 1

Although I may not agree with all of your conclusions, I really liked the data collection attempts and the close look at crawler behavior. It's very important to keep in mind the 3 major steps in the process: (1) Crawling (2) Indexation (3) Ranking I will say that something I see a lot with XML sitemaps is that they seem to jump-start crawling, but, if the site is huge without the PR to support it, pages get crawled and indexed in the short-term but then quickly dropped. I don't think Matt's statement is all or none. The indexation cap is proportional to PR/authority, but it isn't directly proportional. Even new sites can get some love. Regarding breadcrumbs, I have a hunch that what you might be seeing is that Google doesn't always crawl back up (at least, that you'd see in your log files), because it recognizes that those pages have already been crawled. Their may still be a PR-flow component to that link, though. Now that Google is using site breadcrumbs to produce mini-breadcrumbs in the SERPs, it's clear they're paying attention to these architecture cues.
Cancel
- Alessandro Minnocci
 
 2010-07-08T07:34:08-07:00
 
 Great point Dr. Pete! I would love to see a followup where Google's index is checked for the inclusion of the pages that were visited once (whether be by initial crawl or adding the sitemap) and then not crawled again.
 
 At some point, I hope to test the value of a Tweet-link for crawling purposes. It's fairly low on my "must answer" list but I have a hypothesis that Tweeting a link to new content will be one of the fastest ways to get a page included in the index- faster than a sitemap, submitting a URL, getting a link from another site etc. Even though Tweet-links are nofollow, I think there can some serious value in them... but I have to prove it first :)
 
 4 1
 
 Great point Dr. Pete! I would love to see a followup where Google's index is checked for the inclusion of the pages that were visited once (whether be by initial crawl or adding the sitemap) and then not crawled again. At some point, I hope to test the value of a Tweet-link for crawling purposes. It's fairly low on my "must answer" list but I have a hypothesis that Tweeting a link to new content will be one of the fastest ways to get a page included in the index- faster than a sitemap, submitting a URL, getting a link from another site etc. Even though Tweet-links are nofollow, I think there can some serious value in them... but I have to prove it first :)
 Cancel
- Carter Cole
 
 2010-07-08T10:31:13-07:00
 
 First great post i think it should be promoted... that said i have a few questions... what content was put on the pages? was it just the links? i think that the behavior that google is doing here
 - https://example.com/1
 - https://example.com/10/
 - https://example.com/100/
 - https://example.com/1000/
 is them trying to test for "infinite space" https://j.mp/agCQPU (WMT blog post)
 
 as far as breadcrumbs i think they may be like site-links... i have a hunch that list based navs have a lot to do with Google being able to identify navigational elements (might not be only signal but a strong one thats easy to identify) but haven't had time to crawl some results and compare their code for similarities
 
 im lame and have no domains earning site-links so please let me in on whatever insight yall have :)
 
 cartercole edited 2010-07-08T10:39:55-07:00
 1 0
 First great post i think it should be promoted... that said i have a few questions... what content was put on the pages? was it just the links? i think that the behavior that google is doing here <ul><li>https://example.com/1</li><li>https://example.com/10/</li><li>https://example.com/100/</li><li>https://example.com/1000/ </li></ul> is them trying to test for "infinite space" <a href="https://j.mp/agCQPU" rel="nofollow">https://j.mp/agCQPU</a> (WMT blog post) as far as breadcrumbs i think they may be like site-links... i have a hunch that list based navs have a lot to do with Google being able to identify navigational elements (might not be only signal but a strong one thats easy to identify) but haven't had time to crawl some results and compare their code for similarities im lame and have no domains earning site-links so please let me in on whatever insight yall have :)
 Cancel
 - rolfbroer
 
 2010-07-09T00:24:12-07:00
 
 There was a little bit more content on the page then just the links. This was done to make every page unique. The only content on every page was a with the number of a page fully written out. So for example:
 
 https://example.com/200/3/25/ the content is: / two hundred / three / twenty-five.
 
 I don’t think Google was doing an infinite space test on that point. I think it was just being stupid and taking the first link of every URL length. (see the influence of the URL length ) link.
 
 I think Google did do an infinite space test by requesting a bunch of self-made-up pages. (pages like: https://example.com/tpylnqbqyjlo.html ) Although it’s possible that Google uses that test for looking at the status codes, I’m not sure.
 
 On all tests Google did get a 404 in return when it was requesting those pages. but we also made a test where it always got a 200 in return, even when it was requesting those self-made-up pages. The only difference was that Google didn’t stop making those strange requests, but the Googlebot came at least 18 levels deep on that test. So even if it is doing a infinite space test, it just keeps crawling anyway.
 
 1 0
 
 There was a little bit more content on the page then just the links. This was done to make every page unique. The only content on every page was a with the number of a page fully written out. So for example: https://example.com/200/3/25/ the content is: / two hundred / three / twenty-five. I don’t think Google was doing an infinite space test on that point. I think it was just being stupid and taking the first link of every URL length. (see the influence of the URL length ) link. I think Google did do an infinite space test by requesting a bunch of self-made-up pages. (pages like: https://example.com/tpylnqbqyjlo.html ) Although it’s possible that Google uses that test for looking at the status codes, I’m not sure. On all tests Google did get a 404 in return when it was requesting those pages. but we also made a test where it always got a 200 in return, even when it was requesting those self-made-up pages. The only difference was that Google didn’t stop making those strange requests, but the Googlebot came at least 18 levels deep on that test. So even if it is doing a infinite space test, it just keeps crawling anyway. 
 Cancel
 - pbhj
 
 2010-07-09T05:35:55-07:00
 
 "two hundred / three / twenty-five"
 
 Only returns this current SEOmoz page for me.
 
 1 0
 
 "two hundred / three / twenty-five" Only returns this current SEOmoz page for me. 
 Cancel
 - rolfbroer
 
 2010-07-09T05:55:10-07:00
 
 Sorry it was a fictive example, I like to use the domains for some other tests for a while.
 
 3 0
 
 Sorry it was a fictive example, I like to use the domains for some other tests for a while. 
 Cancel
- rolfbroer
 
 2010-07-08T23:49:02-07:00
 
 You are right about the three steps in the process. For this post we only looked at the crawling part of it, but I like to give you some information about the indexing part:
 
 Domain1 has now 4.310 indexed pages (no sitemap)
 Domain3 has now 241.000 indexed pages (with sitemap)
 
 The amount of indexed pages of both is still growing. I think that’s a whole lot of love for a new website and not really proportional to the content it has.
 
 About the breadcrumb pages, I think Google recognizes that those pages are higher in the sitestructure, not that they were crawled before (because Google never saw a link to those higher pages before). We only linked from down to top instead of top-down. Of course there’s probably a PR-flow to that link, but then again the focus was crawling and not indexing or PageRank.
 
 2 0
 
 You are right about the three steps in the process. For this post we only looked at the crawling part of it, but I like to give you some information about the indexing part: Domain1 has now 4.310 indexed pages (no sitemap) Domain3 has now 241.000 indexed pages (with sitemap) The amount of indexed pages of both is still growing. I think that’s a whole lot of love for a new website and not really proportional to the content it has. About the breadcrumb pages, I think Google recognizes that those pages are higher in the sitestructure, not that they were crawled before (because Google never saw a link to those higher pages before). We only linked from down to top instead of top-down. Of course there’s probably a PR-flow to that link, but then again the focus was crawling and not indexing or PageRank. 
 Cancel
Sean Weigold Ferguson

2010-07-08T08:08:52-07:00

Very interesting study! In my YOUmoz post, I found that PageRank shared only a 0.30 correlation with the number of pages indexed by Google (although a 0.52 correlation with the number indexed by Bing). This relationship was approximated by an exponential equation with a base of 1.78^PageRank. Much of what I've seen suggests that PageRank transforms with a base somewhere between 1.5 and 2.5. I think you may be overestimating the difference in pages crawled between a PR = 0 site and a PR = 1 site. My study goes into detail about why PageRank may be a metric worth considering.

I definitely enjoyed your post and hope that you will continue to contribute to the community.

SeanWF edited 2010-07-08T09:02:48-07:00
5 0

Very interesting study! In <a href="../blog/what-is-pagerank-good-for-anyway-statistics-galore">my YOUmoz post</a>, I found that PageRank shared only a 0.30 correlation with the number of pages indexed by Google (although a 0.52 correlation with the number indexed by Bing). This relationship was approximated by an exponential equation with a base of 1.78^PageRank. Much of what I've seen suggests that PageRank transforms with a base somewhere between 1.5 and 2.5. I think you may be overestimating the difference in pages crawled between a PR = 0 site and a PR = 1 site. My study goes into detail about why PageRank may be a metric worth considering. I definitely enjoyed your post and hope that you will continue to contribute to the community.
Cancel
- goodnewscowboy
 
 2010-07-08T08:53:27-07:00
 
 LOL. I love reading your comments Sean. Even though I can usually only understand the opening and closing sentences! You are a statistics animal dude!
 
 2 0
 
 LOL. I love reading your comments Sean. Even though I can usually only understand the opening and closing sentences! You are a statistics animal dude!
 Cancel
 - Sean Weigold Ferguson
 
 2010-07-08T09:01:27-07:00
 
 Haha, thanks GNC. It's tricky to share statistical information in a way that satisfies the hardcore, methodology-critiquing data enthusiasts, while also being accessible to those who aren't as passionate about such things. I'm still trying to figure out the best way to satisfy both groups, while staying focused on the utility and practical applicability of research.
 
 2 0
 
 Haha, thanks GNC. It's tricky to share statistical information in a way that satisfies the hardcore, methodology-critiquing data enthusiasts, while also being accessible to those who aren't as passionate about such things. I'm still trying to figure out the best way to satisfy both groups, while staying focused on the utility and practical applicability of research.
 Cancel
 - Dr. Peter J. Meyers
 
 2010-07-08T09:04:06-07:00
 
 Not to bash academia, but it's part of why I left that world after grad. school. If we spend all of our time arguing and none of our time sharing and communicating the results, then what's the point? I have a much greater appreciation for the people who attempt to explain these things, even if they aren't 100% perfect.
 
 1 0
 
 Not to bash academia, but it's part of why I left that world after grad. school. If we spend all of our time arguing and none of our time sharing and communicating the results, then what's the point? I have a much greater appreciation for the people who attempt to explain these things, even if they aren't 100% perfect.
 Cancel
 - Sean Weigold Ferguson
 
 2010-07-08T09:13:51-07:00
 
 It will be interesting to see how academia, and in particular, the journal/publishing/peer-review process evolves with the ever-increasing free flow of information on the web. I think that there is a huge opportunity for the current system to evolve into something that will truly benefit all mankind. Then again, I could say the same thing about journalism, and most of them are still dragging their feet...
 
 1 0
 
 It will be interesting to see how academia, and in particular, the journal/publishing/peer-review process evolves with the ever-increasing free flow of information on the web. I think that there is a huge opportunity for the current system to evolve into something that will truly benefit all mankind. Then again, I could say the same thing about journalism, and most of them are still dragging their feet...
 Cancel
 
 firstconversion
 
 2010-07-09T02:07:30-07:00
 
 As someone doing an MA in social media and working for an academic software company in that space, i can assure you, change will only come at a slow pace with rivers of tears and much screaming.
 
 Having been part of the community here for a while, I can say I learn more in a daily basis here than clueless academics can teach in a year.
 
 Try write a social media paper when published literature is 5 years old and written by someone with little understanding in the first place
 
 /rant :P
 
 2 0
 
 As someone doing an MA in social media and working for an academic software company in that space, i can assure you, change will only come at a slow pace with rivers of tears and much screaming. Having been part of the community here for a while, I can say I learn more in a daily basis here than clueless academics can teach in a year. Try write a social media paper when published literature is 5 years old and written by someone with little understanding in the first place /rant :P
 Cancel
Chris Horner

2010-07-20T04:20:12-07:00

Great post, found some gems in the comments as well.

Thanks for the subject matter.

2 0

Great post, found some gems in the comments as well. Thanks for the subject matter. 
Cancel
firstconversion

2010-07-08T08:57:52-07:00

Dr Pete, Id say there is an argument for adding seasonality as a (4) to your list

I have been trying to find out how or whether the Google index moves quite fast wrt seasonality
- A few months ago, at the height of my season, we had 800k pages indexed, now in the lowest point in the seaon, we have 200k.
- Webmastertools shows 4mill urls indexed, but site: shows only 200k.
- Looking back, indexed pages reported by the site: command show no relation to organic traffic.
When a site is crawed and very soon pages drop out of the rankings that can be

a) your content sucks

or possibly?

b) there isnt enough traffic on your keywords for Google to keep your pages in the main index at the low point of season

firstconversion edited 2010-07-08T08:58:40-07:00
2 0
Dr Pete, Id say there is an argument for adding seasonality as a (4) to your list I have been trying to find out how or whether the Google index moves quite fast wrt seasonality <ul><li>A few months ago, at the height of my season, we had 800k pages indexed, now in the lowest point in the seaon, we have 200k. </li><li>Webmastertools shows 4mill urls indexed, but site: shows only 200k. </li><li>Looking back, indexed pages reported by the site: command show no relation to organic traffic.</li></ul> When a site is crawed and very soon pages drop out of the rankings that can be a) your content sucks or possibly? b) there isnt enough traffic on your keywords for Google to keep your pages in the main index at the low point of season
Cancel
- Dr. Peter J. Meyers
 
 2010-07-08T09:05:39-07:00
 
 That's really interesting, especially if you track it over a year or more (and it's not just an algorithm change). I think generally Google is getting harsher about low-value, long-tail pages, and that Mayday was part of that change in philosophy, but it would be interesting to see if traffic and other cues affected indexation.
 
 1 0
 
 That's really interesting, especially if you track it over a year or more (and it's not just an algorithm change). I think generally Google is getting harsher about low-value, long-tail pages, and that Mayday was part of that change in philosophy, but it would be interesting to see if traffic and other cues affected indexation.
 Cancel
 - firstconversion
 
 2010-07-09T02:20:34-07:00
 
 TBH, my data in a growing site is very noisy (especially when I often have no idea what the noise is).
 
 We certainly did get hit by the May update. Ive been tracking for 4 months now, will let you know in 20 months time ;)
 
 1 0
 
 TBH, my data in a growing site is very noisy (especially when I often have no idea what the noise is). We certainly did get hit by the May update. Ive been tracking for 4 months now, will let you know in 20 months time ;) 
 Cancel
goodnewscowboy

2010-07-08T08:50:36-07:00

Thanks loads for sharing the results of your tests with us. This is the kind of post that I thought of when reading Dr. Pete's post about 7 types of SEO evidence. This would fall into the Secondhand Evidence section for me.

While not exhaustively researched, it gives me more than enough direction to head towards while doing my own testing and research.

Thanks rolfbroer. Excellent post.

2 0

Thanks loads for sharing the results of your tests with us. This is the kind of post that I thought of when reading Dr. Pete's post about <a href="https://www.seomoz.org/blog/7-types-of-seo-evidence" rel="nofollow">7 types of SEO evidence</a>. This would fall into the Secondhand Evidence section for me. While not exhaustively researched, it gives me more than enough direction to head towards while doing my own testing and research. Thanks rolfbroer. Excellent post. 
Cancel
searchlackey

2010-07-20T16:18:59-07:00

Thank you very much for this info. It certainly shows some insight.

1 0

Thank you very much for this info. It certainly shows some insight.
Cancel
Tyler Harris

2010-07-20T17:13:37-07:00

Great post I love these test data examples thanks

1 0

Great post I love these test data examples thanks
Cancel
tjgill99

2010-07-20T08:32:35-07:00

whoah i didn't mean to comment 3 times. I clicked 'post comment' and the chrome wheels went turning. I click some more out of impatients and ended up commenting 3 times. Anyone know how to delete comments. If so please remove my redundancy and repetition. :)

tjgill99 edited 2010-07-20T08:35:36-07:00
1 0

whoah i didn't mean to comment 3 times. I clicked 'post comment' and the chrome wheels went turning. I click some more out of impatients and ended up commenting 3 times. Anyone know how to delete comments. If so please remove my redundancy and repetition. :)
Cancel
- jennita
 
 2010-07-20T09:10:10-07:00
 
 I got ya covered :)
 
 1 0
 
 I got ya covered :)
 Cancel
williamsjohn333

2010-07-21T16:56:07-07:00

Hi, I have a website and I have created a links directory page. However, adding the links to the directory will take hours or DAYS or YEARS!! So anyway, I want to know if I can download or get a html code or something for a thing a bit like the Googlebot which automatically pulls in links. IS this possible, PS. I don't need to have a directory filled with hundreds of millions of links - just a few thou would do.

williamsjohn333

<Links removed>

jennita edited 2010-07-22T13:12:23-07:00
1 0

Hi, I have a website and I have created a links directory page. However, adding the links to the directory will take hours or DAYS or YEARS!! So anyway, I want to know if I can download or get a html code or something for a thing a bit like the Googlebot which automatically pulls in links. IS this possible, PS. I don't need to have a directory filled with hundreds of millions of links - just a few thou would do. williamsjohn333 <Links removed> 
Cancel
VladimirObuhov

2010-07-31T22:25:12-07:00

Dear rolfbroer,

Thank you for the very interesting article. I have found some very good recomendation concerning the site better indexation and I am going to check how it is work with russian language sites.

1 0

Dear rolfbroer, Thank you for the very interesting article. I have found some very good recomendation concerning the site better indexation and I am going to check how it is work with russian language sites. 
Cancel
HowToDelete

2012-03-01T18:45:36-08:00

Excellent post, really nice way to explain how google bot crawls web pages

KeriMorgret edited 2012-03-01T22:31:53-08:00
1 0

Excellent post, really nice way to explain how google bot crawls web pages
Cancel
Chris Roman

2011-07-22T23:52:12-07:00

Excellent analysis on how the google bot crawls the web pages. I really liked your insight about Google bot crawling links within Javascript in them. So with more incoming links, Google will put more focus on crawling Javascript eh?

1 0

Excellent analysis on how the google bot crawls the web pages. I really liked your insight about Google bot crawling links within Javascript in them. So with more incoming links, Google will put more focus on crawling Javascript eh?
Cancel
Unmatched Solutions

2010-08-18T05:05:31-07:00

here you are explaining in very nice way. any one can understand eaily.

Thanks Buddy

1 0

here you are explaining in very nice way. any one can understand eaily. Thanks Buddy 
Cancel
tjgill99

2010-07-20T08:32:31-07:00

This is a great share. I think it sheds some lite on why some blog sites interlink as deeply as they do.

1 0

This is a great share. I think it sheds some lite on why some blog sites interlink as deeply as they do. 
Cancel
MrMean

2010-07-29T08:36:02-07:00

Really great information here - thank you for sharing your clearly extensive research with the rest of the SEO community.

1 0

Really great information here - thank you for sharing your clearly extensive research with the rest of the SEO community.
Cancel
Igal_Zeifman

2012-07-23T02:53:53-07:00

It should be noted that not all Googlebot visits in your logs are actually from Googlebot.* We recently conducted a study on this subject and it showed that 16.3% of all Googlebot visits were fake, 75% of this were also harmful. You can read more here: Is this really a Googlebot crawling my site?

* Webmaster Tools is still reliable, of course.

1 0

It should be noted that not all Googlebot visits in your logs are actually from Googlebot.* We recently conducted a study on this subject and it showed that 16.3% of all Googlebot visits were fake, 75% of this were also harmful. You can read more here: <a href="https://www.incapsula.com/the-incapsula-blog/item/369-was-that-really-a-google-bot-crawling-my-site" rel="nofollow">Is this really a Googlebot crawling my site?</a> * Webmaster Tools is still reliable, of course. 
Cancel
JoeLeo51

2010-07-08T22:05:06-07:00

I appreciate the section on the sitemap. We just added a second sitemap to help index the real estate properties we have recently included as content to our website. It's good to know Google will be influenced by this.

1 0

I appreciate the section on the sitemap. We just added a second sitemap to help index the real estate properties we have recently included as content to our website. It's good to know Google will be influenced by this.
Cancel
Alex Swindells

2010-07-08T21:12:33-07:00

Food for Thought

1 0

Food for Thought 
Cancel
Sorano

2010-07-08T10:47:18-07:00

Great summary, thanks

1 0

Great summary, thanks
Cancel
Steven Wong

2010-07-10T11:30:17-07:00

Thanks for the research on the sitemaps. Good to see it is a useful thing to do.

Not sure I agree about using javascript forsculpting purposes though. We need to think about usability. Write for the audience not search engines we always hear.

1 0

Thanks for the research on the sitemaps. Good to see it is a useful thing to do. Not sure I agree about using javascript forsculpting purposes though. We need to think about usability. Write for the audience not search engines we always hear.
Cancel
Andy_Fletcher

2010-07-12T03:20:28-07:00

This would seem to make it clear an XML sitemap is one of these "more than 200" signals Googles ranking algorythm is using. Be interesting if SEOMoz (or one of the expert posters here) would be brave enough to come up with a best guess on the entire list of them ;)

1 0

This would seem to make it clear an XML sitemap is one of these "more than 200" signals Googles ranking algorythm is using. Be interesting if SEOMoz (or one of the expert posters here) would be brave enough to come up with a best guess on the entire list of them ;)
Cancel
- freshfishdesign
 
 2010-07-12T07:35:26-07:00
 
 Someone could probably come up with a list... then tomorrow, it'd be a different list.
 
 1 0
 
 Someone could probably come up with a list... then tomorrow, it'd be a different list.
 Cancel
 - Andy_Fletcher
 
 2010-07-12T15:01:53-07:00
 
 OK, I probably oversimplified the statement. I agree it'd be different over time, but I doubt they ever really drop anything, it'd get longer (as it evidently just has with speed).
 
 1 0
 
 OK, I probably oversimplified the statement. I agree it'd be different over time, but I doubt they ever really drop anything, it'd get longer (as it evidently just has with speed).
 Cancel
bootleg

2010-07-20T01:00:52-07:00

I really like this post - but I have to contradict to your recommendation not to use bread crumbs. You should take into account that bread crumbs are very useful for users ... if they don't help getting your pages indexed (although I made a different experience with one of my sites) - they certainly help users understanding your site architecture.

1 0

I really like this post - but I have to contradict to your recommendation not to use bread crumbs. You should take into account that bread crumbs are very useful for users ... if they don't help getting your pages indexed (although I made a different experience with one of my sites) - they certainly help users understanding your site architecture. 
Cancel
- rolfbroer
 
 2010-07-20T01:19:05-07:00
 
 What i'm trying to say is that you shouldn't rely only on breadcrumbs for your sitestructure.
 
 Sure they can help crawling your site and yes you definitly shoult always offer them to your users. But what we've experiences on a few large sites is that Google wasn't using the breadcrumbs very well during the crawlproces.
 
 That's the reason why I tried to reproduce that. I'm not saying "don't use breadcrumbs" I'm trying to say: be aware of this fact and be sure to always link top-down as well.
 
 2 0
 
 What i'm trying to say is that you shouldn't rely only on breadcrumbs for your sitestructure. Sure they can help crawling your site and yes you definitly shoult always offer them to your users. But what we've experiences on a few large sites is that Google wasn't using the breadcrumbs very well during the crawlproces. That's the reason why I tried to reproduce that. I'm not saying "don't use breadcrumbs" I'm trying to say: be aware of this fact and be sure to always link top-down as well. 
 Cancel
Anirban Das

2010-07-19T23:39:43-07:00

Hello,

I would say its decent case study.But I have some different experience in terms of bread crumps...so when we use to make inner pages Googlebot allways fetch those pages through its "Real time search" method, but withinminmal span of time Googlebot fetch the higher pages as well.So according to my experience I would say Googlbot indexation depends on fresh content,images and videos...

Apart from that the case study..is appriciated..

1 0

Hello, I would say its decent case study.But I have some different experience in terms of bread crumps...so when we use to make inner pages Googlebot allways fetch those pages through its "Real time search" method, but withinminmal span of time Googlebot fetch the higher pages as well.So according to my experience I would say Googlbot indexation depends on fresh content,images and videos... Apart from that the case study..is appriciated..
Cancel
Asim Ali

2010-07-19T23:35:19-07:00

Really informative post. If you find the total number of links that google allows, do post an update.

Thanks.

Asim

1 0

Really informative post. If you find the total number of links that google allows, do post an update. Thanks. Asim
Cancel
Andy Kuiper

2010-07-20T08:13:15-07:00

Appreciate the way this article was written and presented - thanks for taking the time to share your test data. Interesting about the increase in spidering per Sitemap submission :-)

1 0

Appreciate the way this article was written and presented - thanks for taking the time to share your test data. Interesting about the increase in spidering per Sitemap submission :-)
Cancel
hindsoft

2010-07-20T03:37:47-07:00

Hi,

First, thank you for this lovely data and peace of work. Congratulation!

For this close watch of Google crawler, still I am little suspect about this data and your assumption......

https://example.com/1

https://example.com/10/

https://example.com/100/

https://example.com/1000/

Because, I have a site that is https://www.fragranceville.com/

the root domain of this site is crawl every 6-7 days, I have some pages of html in this site likes https://www.fragranceville.com/women-perfumes.html

https://www.fragranceville.com/miniature-perfumes.html

And some more didn't crawl last 2 months and other deep pages are also crawled by bot. So, what would you say about that, and what should i do for it?

thanks

Allen Pradhan

1 2

 Hi, First, thank you for this lovely data and peace of work. Congratulation! For this close watch of Google crawler, still I am little suspect about this data and your assumption...... https://example.com/1 https://example.com/10/ https://example.com/100/ https://example.com/1000/ Because, I have a site that is <a href="https://www.fragranceville.com/" rel="nofollow">https://www.fragranceville.com/</a> the root domain of this site is crawl every 6-7 days, I have some pages of html in this site likes <a href="https://www.fragranceville.com/" rel="nofollow">https://www.fragranceville.com/women-perfumes.html</a> https://www.fragranceville.com/miniature-perfumes.html And some more didn't crawl last 2 months and other deep pages are also crawled by bot. So, what would you say about that, and what should i do for it? thanks Allen Pradhan 
Cancel

Post Analytics

Comments 43

Log in to Moz

Don't have an account?