Restricting Robot Access for Improved SEO

Comments 54

Please keep your comments TAGFEE by following the community etiquette.

E-mail me when new comments are posted

Sort by:

Comments are closed on posts more than 30 days old. Got a burning question? Head to our Q&A section to start a new conversation.

Staff

Dr. Peter J. Meyers
Staff

2011-03-16T07:49:58-07:00

LOL - "So easy a monkey could do it" is definitely a warning sign, in SEO and in life.

2 0

LOL - "So easy a monkey could do it" is definitely a warning sign, in SEO and in life.
Cancel
- Gianluca Fiorelli
 
 2011-03-16T08:51:16-07:00
 
 Well... sometimes monkeys do things better than me :)
 
 2 0
 
 Well... sometimes monkeys do things better than me :)
 Cancel
Ryan O'Donnell

2011-03-15T22:35:16-07:00

A nice overview of content indexation and crawling control. We use the robots.txt file to prevent directories and a few inconsequential pages from being seen by the bots. You can see it all here.

I would agree with your opinion on the meta robots tag. Especially in the case of one-off pages. I can think of two challenges the meta robots tag presents. First, in the case of a site built on a framework it can be difficult to get such indexation exceptions onto individual pages. As part of the Marketing, it can take a lot of work to get such an update to the top of the Engineering queue. Second, once said exceptions are in place, managing the meta robots tags can become a challenge if they start to add up (unless a management tool is added to the CMS as part of the framework update).

We also use the 'rel canonical' markup for all our pages as we have many, many inbound links with tracking parameters.

Just a couple thoughts I had. All in all, an informative piece. I liked the baby analogy (I have two baby girls).

RyanOD edited 2011-03-15T22:43:40-07:00
2 0

A nice overview of content indexation and crawling control. We use the robots.txt file to prevent directories and a few inconsequential pages from being seen by the bots. You can see it all <a href="https://www.bigfishgames.com/robots.txt" rel="nofollow">here</a>. I would agree with your opinion on the meta robots tag. Especially in the case of one-off pages. I can think of two challenges the meta robots tag presents. First, in the case of a site built on a framework it can be difficult to get such indexation exceptions onto individual pages. As part of the Marketing, it can take a lot of work to get such an update to the top of the Engineering queue. Second, once said exceptions are in place, managing the meta robots tags can become a challenge if they start to add up (unless a management tool is added to the CMS as part of the framework update). We also use the 'rel canonical' markup for all our pages as we have many, many inbound links with tracking parameters. Just a couple thoughts I had. All in all, an informative piece. I liked the baby analogy (I have two baby girls).
Cancel
- Eric Wagner
 
 2011-03-21T18:42:23-07:00
 
 Working with the robots is always a dangerous thing. I've just seen too many uninformed people deny access to their whole site and then wonder what happened. I guess now I have a good post to send them to.
 
 1 0
 
 Working with the robots is always a dangerous thing. I've just seen too many uninformed people deny access to their whole site and then wonder what happened. I guess now I have a good post to send them to.
 Cancel
algogmbh_petra

2011-03-15T23:09:41-07:00

Bit by bit I have leaned to use the meta robots tag, the canonical tag and the robots.txt, the right way. Mainly through such useful post like that.I have never used the x-robots tag - I have to admit I didn't knew it even exists. I have to take a closer look at ...

2 0

Bit by bit I have leaned to use the meta robots tag, the canonical tag and the robots.txt, the right way. Mainly through such useful post like that.I have never used the x-robots tag - I have to admit I didn't knew it even exists. I have to take a closer look at ...
Cancel
- Moosa Hemani
 
 2011-03-16T01:31:48-07:00
 
 so do i... i never know X Robots Tag even Exisit. Great post this is turly educational to me!
 
 Thanks for that!
 
 1 0
 
 so do i... i never know X Robots Tag even Exisit. Great post this is turly educational to me! Thanks for that!
 Cancel
Aidan McCarthy

2011-03-15T22:48:18-07:00

Great article and very timely as I'm about to launch a re-designed site, will now go and double check for those noindex,nofollow booboos.

3 1

Great article and very timely as I'm about to launch a re-designed site, will now go and double check for those noindex,nofollow booboos.
Cancel
- Aidan McCarthy
 
 2011-04-11T17:25:38-07:00
 
 WTH - a thumbs down because I thanked a writer?
 
 4 0
 
 WTH - a thumbs down because I thanked a writer?
 Cancel
 - Eric Wagner
 
 2011-07-07T09:52:00-07:00
 
 I think someone thought it was spam.
 
 ericwagner edited 2011-07-07T09:52:34-07:00
 1 0
 
 I think someone thought it was spam.
 Cancel
 - Casey Henry
 
 2011-07-07T10:02:12-07:00
 
 I'll give you an extra thumb up just to fix it. =)
 
 3 0
 
 I'll give you an extra thumb up just to fix it. =)
 Cancel
Cooking.Com

2011-03-16T14:03:05-07:00

thanks for artilce...I have noticed that Google tends to ignore robots.txt if there are too many directories being listed...usually a few are OK but if you have a long list then google will ingore them...so trick is to consolidate all pages you don't want indexed in as few directories as possible.

1 0

thanks for artilce...I have noticed that Google tends to ignore robots.txt if there are too many directories being listed...usually a few are OK but if you have a long list then google will ingore them...so trick is to consolidate all pages you don't want indexed in as few directories as possible.
Cancel
Jeremy Webb

2011-03-17T01:17:25-07:00

And you can hide a nice Easter Egg in your robots.txt tag for those curious enough to look...as they do at https://www.searchenginefriendlyhosting.com/robots.txt

1 0

And you can hide a nice Easter Egg in your robots.txt tag for those curious enough to look...as they do at https://www.searchenginefriendlyhosting.com/robots.txt
Cancel
Seth Helgeson

2011-03-16T19:33:36-07:00

Great Post, you have gotten my mind thinking through our current indexing strategy and now I'm wondering if is it as tidy as it could be?

You are correct, its a lot like having kids. You think your house is pretty clean and child proof and then you head over to a friends for a play date only to find that your definition of clean and tidy sucks... thanks, I now think my house sucks! ;)

1 0

Great Post, you have gotten my mind thinking through our current indexing strategy and now I'm wondering if is it as tidy as it could be? You are correct, its a lot like having kids. You think your house is pretty clean and child proof and then you head over to a friends for a play date only to find that your definition of clean and tidy sucks... thanks, I now think my house sucks! ;)
Cancel
Paul Marshall

2011-03-16T11:11:40-07:00

Lindsay, you get a thumbs up just for mentioning the SEOmoz robots.txt file!

Seeing it, I had a great laugh. Thanks.

PaulMarshall55 edited 2011-03-16T11:13:21-07:00
1 0

Lindsay, you get a thumbs up just for mentioning the SEOmoz robots.txt file! Seeing it, I had a great laugh. Thanks.
Cancel
- Gianluca Fiorelli
 
 2011-03-16T13:21:07-07:00
 
 If you liked the SEOmoz robots.txt, you'll love the https://explicitly.me/robots.txt by Rishi
 
 gfiorelli1 edited 2011-03-16T13:22:05-07:00
 2 0
 
 If you liked the SEOmoz robots.txt, you'll love the <a href="https://explicitly.me/robots.txt" rel="nofollow">https://explicitly.me/robots.txt</a> by Rishi
 Cancel
TakeCareof.Biz

2011-03-16T10:42:38-07:00

I have recently written a blog entry about the importance of SEO for start up businesses and really appreciate the information to make this processes more succesful. Thanks for the info

https://takecareof.biz/seo-is-about-people-not-robots/

1 0

I have recently written a blog entry about the importance of SEO for start up businesses and really appreciate the information to make this processes more succesful. Thanks for the info https://takecareof.biz/seo-is-about-people-not-robots/ 
Cancel
eTundra

2011-03-17T11:24:29-07:00

Great post, thank you!

I use the robots.txt file (I know, shame on me) to block a large number of pages on a large ecommerce site I work on. I'm pretty sure it's the best option in this case because the site architecture uses a generic www.mysite.com/Browse.aspx URL for all pages when certain filtering elements on category pages are clicked.

For instance, if someone wanted to sort a category by manufacturer, the site redirects to /Browse.aspx but keeps the identical page content. This created thousands of duplicate pages with the identical URL - /Browse.aspx!

I used robots.txt to block this URL and soon saw indexed versions of pages dropping from the index. Since then we have seen a considerable increase in long tail keyword traffic.

Anyways, I just wanted to share an example of a robots.txt working well. We could not have used the canonical URL tag here because lots of different content had the same URL, not the other way around.

That said, do you think the meta robots tag would be a better solution? If so, why?

1 0

Great post, thank you! I use the robots.txt file (I know, shame on me) to block a large number of pages on a large ecommerce site I work on. I'm pretty sure it's the best option in this case because the site architecture uses a generic www.mysite.com/Browse.aspx URL for all pages when certain filtering elements on category pages are clicked. For instance, if someone wanted to sort a category by manufacturer, the site redirects to /Browse.aspx but keeps the identical page content. This created thousands of duplicate pages with the identical URL - /Browse.aspx! I used robots.txt to block this URL and soon saw indexed versions of pages dropping from the index. Since then we have seen a considerable increase in long tail keyword traffic. Anyways, I just wanted to share an example of a robots.txt working well. We could not have used the canonical URL tag here because lots of different content had the same URL, not the other way around. That said, do you think the meta robots tag would be a better solution? If so, why?
Cancel
Dubs

2011-03-16T12:54:31-07:00

Great Article!! The article discusses several great ways of controlling what can be indexed without killing the link juice from those pages. Thanks for all the great info!!

1 0

Great Article!! The article discusses several great ways of controlling what can be indexed without killing the link juice from those pages. Thanks for all the great info!!
Cancel
Glenn Hammer

2011-03-16T13:30:14-07:00

Thanks for tha X-ROBOTS suggestion. I have been wondering how to block pdf files from being indexed (besides in the robots.txt file).

Now, I need help with the right .htaccess code to block the indexing of pdf files (all or just certain ones). Any suggestions?

1 0

Thanks for tha X-ROBOTS suggestion. I have been wondering how to block pdf files from being indexed (besides in the robots.txt file). Now, I need help with the right .htaccess code to block the indexing of pdf files (all or just certain ones). Any suggestions?
Cancel
lruellan

2011-03-25T11:48:26-07:00

Hello,

I would be interested to hear what is the best recommended practice to handle URL containing campaign tags. I looked at the HTML suggestion in Webmaster Tools and I relaized that a lot of my URL are flagged as having duplicate title tags because Google has a record of both the 'canonical' URL and URL's tagged with my Google Analytics / Webtrends tracking variables (utm_source, WT.mc_id, glcid, etc...).

I absolutely must maintain those tags otherwise I lose tracking of my PPC campaigns - but obviously they are causing some duplicate content issues for Google.

I read the example #2 on the post about the canonical meta tag where it is suggested to 'record the referral' and 301 the taffed URL to the canonical URL. My concern is that this solution doesn't seem to be compatible with the way most analytics package work. Google Analytics and other tracking technologies use client side scripting to call the Web Analytics server once the page has loaded and send information about the current page viewed by the user, grabbing various parameters added to the URL. If I 301 the tagged URL to the canonical URL, the Web Analytics tag will fire only on the canonical page and it will not be able to grab the campaign tracking variables which have been removed via the 301.

What would be the recommended solution then?

For now, I have tried going into the parameter handling tab in google's webmaster tool and setting all my campaign variables to ignore.

Lothaire

1 0

Hello, I would be interested to hear what is the best recommended practice to handle URL containing campaign tags. I looked at the HTML suggestion in Webmaster Tools and I relaized that a lot of my URL are flagged as having duplicate title tags because Google has a record of both the 'canonical' URL and URL's tagged with my Google Analytics / Webtrends tracking variables (utm_source, WT.mc_id, glcid, etc...). I absolutely must maintain those tags otherwise I lose tracking of my PPC campaigns - but obviously they are causing some duplicate content issues for Google. I read the example #2 on the post about the canonical meta tag where it is suggested to 'record the referral' and 301 the taffed URL to the canonical URL. My concern is that this solution doesn't seem to be compatible with the way most analytics package work. Google Analytics and other tracking technologies use client side scripting to call the Web Analytics server once the page has loaded and send information about the current page viewed by the user, grabbing various parameters added to the URL. If I 301 the tagged URL to the canonical URL, the Web Analytics tag will fire only on the canonical page and it will not be able to grab the campaign tracking variables which have been removed via the 301. What would be the recommended solution then? For now, I have tried going into the parameter handling tab in google's webmaster tool and setting all my campaign variables to ignore. Lothaire
Cancel
Jan Kolstee

2012-11-05T03:02:39-08:00

What about the basket and checkout from an E commerce site
Robot.txt or noindex ?

1 0

What about the basket and checkout from an E commerce site Robot.txt or noindex ? 
Cancel
Anand Mistry

2013-05-28T03:28:11-07:00

Hello Lindsay,

Today, I was searching for solution to set up Meta Robots NOINDEX, Follow on Ecommerce website. I have found one attractive YouTube video from Google webmaster tools' help desk and your blog post on SEOmoz!

https://www.youtube.com/watch?v=ZjRGkc__FwQ

https://www.gunholstersunlimited.com/airguns.html#!/p=clear&manufacturer=228&order=name

https://www.knobdeco.com/cabinet-hardware/cabinet-knobs.html?line=edwardian

I have big question regarding dynamic pages which compiled by Narrow by Search or Shop By section on my Ecommerce website.

Now, I have strong conclusion regarding rel=canonical! And, We don't need to implement rel=canonical on dynamic pages.

So, I can set Meta Robots NOINDEX, Follow on all pages which I have described above. But, I would like to double confirm before make it happen on live website.

What you think about it? Can you please give me more ideas on it?

1 0

Hello Lindsay, Today, I was searching for solution to set up Meta Robots NOINDEX, Follow on Ecommerce website. I have found one attractive YouTube video from Google webmaster tools' help desk and your blog post on SEOmoz! https://www.youtube.com/watch?v=ZjRGkc__FwQ https://www.gunholstersunlimited.com/airguns.html#!/p=clear&manufacturer=228&order=name https://www.knobdeco.com/cabinet-hardware/cabinet-knobs.html?line=edwardian I have big question regarding dynamic pages which compiled by Narrow by Search or Shop By section on my Ecommerce website. Now, I have strong conclusion regarding rel=canonical! And, We don't need to implement rel=canonical on dynamic pages. So, I can set Meta Robots NOINDEX, Follow on all pages which I have described above. But, I would like to double confirm before make it happen on live website. What you think about it? Can you please give me more ideas on it? 
Cancel
kintus

2015-07-08T04:09:38-07:00

Oh, nice info for robots.txt. I will include this on my website.

MeganSingley edited 2015-07-08T13:28:06-07:00
1 0

Oh, nice info for robots.txt. I will include this on my website.
Cancel
salvyy

2012-02-15T03:52:10-08:00

Why aren't there other posts like this out there?!? Unfortunately, not all the SEOmozzers are experts. It is difficult for someone like me, an SEO novice, to really understand how the tags work and how to implement them after reading articles about them. Thank god, there are people like Lindsay who explain technical concepts in a simple manner and then provide examples of how to implement them in a practical way. I think even someone like me will now be able to use such fundamental tags for a positive on-page optimization activity.

1 0

Why aren't there other posts like this out there?!? Unfortunately, not all the SEOmozzers are experts. It is difficult for someone like me, an SEO novice, to really understand how the tags work and how to implement them after reading articles about them. Thank god, there are people like Lindsay who explain technical concepts in a simple manner and then provide examples of how to implement them in a practical way. I think even someone like me will now be able to use such fundamental tags for a positive on-page optimization activity.
Cancel
William Craig

2011-08-02T12:45:48-07:00

Thanks for sharing this post as I found it extremely helpful! I loved your comparison of Google Bot to a Toddler. Hilarious.

1 0

Thanks for sharing this post as I found it extremely helpful! I loved your comparison of Google Bot to a Toddler. Hilarious. 
Cancel
Cargo

2011-03-16T10:40:12-07:00

Great guide to all the different methods, but I'm still somewhat unsure about which pages I shouldn't be allowing acess to on a massive, dynamic site.

Anyone seen any good guides on this?

1 0

Great guide to all the different methods, but I'm still somewhat unsure about which pages I shouldn't be allowing acess to on a massive, dynamic site. Anyone seen any good guides on this?
Cancel
G Chandrashekar Reddy

2011-05-08T10:49:30-07:00

thank you infromation about robot.txt and meta robot tag. i am using robot.txt file but i dont know this much of depth of that file info.

1 0

thank you infromation about robot.txt and meta robot tag. i am using robot.txt file but i dont know this much of depth of that file info.
Cancel
Eric Wagner

2011-07-07T09:53:18-07:00

Most forums also have a huge amount of pages and links that should be noindexed and nofollowed.

1 0

Most forums also have a huge amount of pages and links that should be noindexed and nofollowed.
Cancel
eyepaq

2011-03-17T15:16:10-07:00

Robots.txt is old school and as in Jazz old school is the best !

There are a few limitation that can be "helped" with the X-Robots-Tag.

Other then those two I don't relly see the need for the rest - but this is just me.

1 0

Robots.txt is old school and as in Jazz old school is the best ! There are a few limitation that can be "helped" with the X-Robots-Tag. Other then those two I don't relly see the need for the rest - but this is just me.
Cancel
Tony Mandarich

2011-03-16T10:15:14-07:00

Thanks for this......I had just asked a question related to this last week. You have givin me more confirmation! Here is the link to the question:

https://www.seomoz.org/q/right-now-i-have-my-categories-as-noindex-should-i-change-them-to-index-or-let-the-individual-pages-retain-all-the-juice

Tony ;~)

mandarich edited 2011-03-16T10:16:02-07:00
1 0

Thanks for this......I had just asked a question related to this last week. You have givin me more confirmation! Here is the link to the question: <a href="../q/right-now-i-have-my-categories-as-noindex-should-i-change-them-to-index-or-let-the-individual-pages-retain-all-the-juice">https://www.seomoz.org/q/right-now-i-have-my-categories-as-noindex-should-i-change-them-to-index-or-let-the-individual-pages-retain-all-the-juice</a> Tony ;~)
Cancel
alfredopalconit

2011-03-15T23:56:03-07:00

canonical tags have proven very useful on our sites, eliminates tons of duplicate content issues. the robots.txt file however, is very tricky to implement and risks important pages not be indexed. better to keep the a clean robots file and use meta robots instead. :)

1 0

canonical tags have proven very useful on our sites, eliminates tons of duplicate content issues. the robots.txt file however, is very tricky to implement and risks important pages not be indexed. better to keep the a clean robots file and use meta robots instead. :)
Cancel
Eric Zhou

2011-03-15T22:51:46-07:00

Nice post.

As SEO for eCommerce site,I always have the following pages NoIndexed:

PrivacyShipping infoReturn policyTerms......

The list can go on,but you see what I meant :)

Those pages are mostly "useless" for users and rarely read by users.Hence they should be useless for SERP for any search phrases.

1 0

Nice post. As SEO for eCommerce site,I always have the following pages NoIndexed: PrivacyShipping infoReturn policyTerms...... The list can go on,but you see what I meant :) Those pages are mostly "useless" for users and rarely read by users.Hence they should be useless for SERP for any search phrases.
Cancel
- Jenni Brown
 
 2011-03-16T05:37:58-07:00
 
 Nice list, I would also add the Basket links (add to, view) to that. - Jenni
 
 1 0
 
 Nice list, I would also add the Basket links (add to, view) to that. - Jenni
 Cancel
JordanGreve

2011-03-16T10:25:10-07:00

I use the META Robots tag on pages when I put a website up for clients to view before we're done with it. This way it's live, they can access it but no indexation.

Just have to remember to remove it when going live ;)

JordanGreve edited 2011-03-16T11:04:35-07:00
1 0

I use the META Robots tag on pages when I put a website up for clients to view before we're done with it. This way it's live, they can access it but no indexation. Just have to remember to remove it when going live ;)
Cancel
David Sottimano

2011-03-16T03:36:23-07:00

Hi Lindsay,

Where was this post last night when I needed it!! ;) A bit of a technical question here, since I've heard different opinions from some very intelligent SEOs.

Here's the scenario:
- You've got a set of URLs indexed by Google, and you want them out quickly
- Once you've managed to remove them, you want to block Googlebot from crawling them again - for whatever reason.
Below is a sample of the URLs you want blocked, but you only want to block /beerbottles/ and anything past it:
- www.example.com/beers/brandofbeer/beerbottles/1
- www.example.com/beers/brandofbeer/beerbottles/2
- www.example.com/beers/brandofbeer/beerbottles/3
- etc..
To remove the pages from the index should you?:
1. Add the Meta=noindex,follow tag to each URL you want de-indexed
2. Use GWT to help remove the pages
3. Wait for Google to crawl again
If that's successful, to block Googlebot from crawling again - should you?:
1. Add this line to Robots.txt: DISALLOW */beerbottles/
2. Or add this line: DISALLOW: /beerbottles/
"To add the * or not to add the *, that is the question"

Thanks!

Dave

DaveSottimano edited 2011-03-16T03:37:28-07:00
1 0
Hi Lindsay, Where was this post last night when I needed it!! ;) A bit of a technical question here, since I've heard different opinions from some very intelligent SEOs. Here's the scenario: <ul><li>You've got a set of URLs indexed by Google, and you want them out quickly</li> <li>Once you've managed to remove them, you want to block Googlebot from crawling them again - for whatever reason.</li> </ul> Below is a sample of the URLs you want blocked, but you only want to block /beerbottles/ and anything past it: <ul><li>www.example.com/beers/brandofbeer/beerbottles/1</li> <li>www.example.com/beers/brandofbeer/beerbottles/2</li> <li>www.example.com/beers/brandofbeer/beerbottles/3</li> <li>etc.. </li> </ul> To remove the pages from the index should you?: <ol><li>Add the Meta=noindex,follow tag to each URL you want de-indexed</li> <li>Use GWT to help remove the pages </li> <li>Wait for Google to crawl again</li> </ol> If that's successful, to block Googlebot from crawling again - should you?: <ol><li>Add this line to Robots.txt: DISALLOW */beerbottles/</li> <li>Or add this line: DISALLOW: /beerbottles/ </li> </ol> "To add the * or not to add the *, that is the question" Thanks! Dave
Cancel
- goodnewscowboy
 
 2011-03-16T07:07:06-07:00
 
 Hey Dave:
 
 I just took the liberty of posting your question to the new public Q&A so don't forget to check in there in a bit to see if anyone nailed it.
 
 3 0
 
 Hey Dave: I just took the liberty of posting your question to the new public Q&A so don't forget to check in there in a bit to see if anyone nailed it.
 Cancel
 - Gianluca Fiorelli
 
 2011-03-16T08:50:05-07:00
 
 Cool move GNC
 
 1 0
 
 Cool move GNC
 Cancel
TSmeets

2011-03-16T03:25:17-07:00

Hi,

Nice article. Since farmer/panda we also think about using noindex/follow more on our website. Wee are an e-commerce website with 80% of affiliateproducts. Meaning large amounts of duplicate content or empty pages. We gradually increase products being unique by writing unique content for it but this is not something that goes quick.

Would you recommand noindexing all these pages to create more uniqueness and therefore better rankings for our website. We were hit quite severe when panda rolled out in the US. Mainly on our unique products because these were the pages which ranked the best.

By the way we are talking about noindexing thousends and thousends of pages in this process. I allready calculated that these "low quality" pages only bring in 3% of our SEO traffic and only 1% of revenue.

Only fear that i have is that Google will "punish" us for suddenly noindexing this large amount of our pages at once...

1 0

Hi, Nice article. Since farmer/panda we also think about using noindex/follow more on our website. Wee are an e-commerce website with 80% of affiliateproducts. Meaning large amounts of duplicate content or empty pages. We gradually increase products being unique by writing unique content for it but this is not something that goes quick. Would you recommand noindexing all these pages to create more uniqueness and therefore better rankings for our website. We were hit quite severe when panda rolled out in the US. Mainly on our unique products because these were the pages which ranked the best. By the way we are talking about noindexing thousends and thousends of pages in this process. I allready calculated that these "low quality" pages only bring in 3% of our SEO traffic and only 1% of revenue. Only fear that i have is that Google will "punish" us for suddenly noindexing this large amount of our pages at once...
Cancel
Jonathan Leplang

2011-03-16T01:35:53-07:00

Great summarize of all the possibilities. Personnaly, I never use the X-Robots-Tag.

1 0

Great summarize of all the possibilities. Personnaly, I never use the X-Robots-Tag.
Cancel
Mike Imrie

2011-03-16T03:19:10-07:00

Great article! I especially like the analogy of baby proofing your site for a rich and well connected toddler! Think I might use that in future with clients when explaining on-site SEO.

1 0

Great article! I especially like the analogy of baby proofing your site for a rich and well connected toddler! Think I might use that in future with clients when explaining on-site SEO.
Cancel
Gary Francis

2011-03-16T03:45:26-07:00

I never knew that the x-robots header existed. I could efinitely have used that a few times!

Thanks for the heads up! Great post.

1 0

I never knew that the x-robots header existed. I could efinitely have used that a few times! Thanks for the heads up! Great post.
Cancel
Amardeep Singh

2011-03-16T04:13:43-07:00

I appreciate your report, I agree with your point that many are using "nofollow,noindex" equally. I hope your post will help them out......

1 0

I appreciate your report, I agree with your point that many are using "nofollow,noindex" equally. I hope your post will help them out...... 
Cancel
Gianluca Fiorelli

2011-03-16T08:57:44-07:00

Hi usually use the robots.txt for:
1. show the sitemaps (classic, video, images)
2. whenever a site has an https duplicate version
My preferred choice is, whenever possible, to use the Meta Robot tag (for instance for paginated pages of categories products)

I use in order to avoid products page duplications (for instance if a product is in more categories and those are shown by the CMS in the URLs)

Finally, the classic 301, which is not strictly a way to restrict robot access, but a perfect way to say to it where to access.

1 0
Hi usually use the robots.txt for: <ol><li>show the sitemaps (classic, video, images)</li> <li>whenever a site has an https duplicate version</li> </ol> My preferred choice is, whenever possible, to use the Meta Robot tag (for instance for paginated pages of categories products) I use in order to avoid products page duplications (for instance if a product is in more categories and those are shown by the CMS in the URLs) Finally, the classic 301, which is not strictly a way to restrict robot access, but a perfect way to say to it where to access.
Cancel
Sangeeta

2011-03-16T09:34:36-07:00

We used the robot.txt to block SEs crawling for duplicate pages and our Domain authority went up.

1 0

We used the robot.txt to block SEs crawling for duplicate pages and our Domain authority went up.
Cancel
Brandon Millard

2011-03-16T09:54:40-07:00

So you have adding a noindex meta tag to allow link juice to flow, but dont you want to supplement this with a canonical tag if its a duplicate URL so that links to the duplicate version get counted for the canonical version?

1 0

So you have adding a noindex meta tag to allow link juice to flow, but dont you want to supplement this with a canonical tag if its a duplicate URL so that links to the duplicate version get counted for the canonical version?
Cancel
Wes Cowan

2011-03-16T08:16:09-07:00
1. Canonical Tag
2. Meta Robots Tag
3. Robots.txt file
4. X-Robots-Tag
1 0
<ol><li>Canonical Tag</li> <li>Meta Robots Tag</li> <li>Robots.txt file</li> <li>X-Robots-Tag</li> </ol>
Cancel
MichaelKovis

2011-03-16T07:41:46-07:00

Excellent article Lindsay. Great analogy between a developing child and bots. I have made the same comparison a few times lately myself to better explain how a web site should develop and why.

1 0

Excellent article Lindsay. Great analogy between a developing child and bots. I have made the same comparison a few times lately myself to better explain how a web site should develop and why.
Cancel
Xstroy

2011-03-16T05:48:53-07:00

Until now, watching how my blog indexed by search engines without closing unneeded pages in robots. True to use the plugin All in One SEO Pack, which automatically adds the articles .

All indexed smoothly, without problems, until one important item was not on the front lines in the index with an address that does not coincide with the original. It was addressed to the preview article for editing, while it lost an important property of the exact occurrence of a key phrase in the address of the article, which could put it right in the first place.

Conclusion: Do not neglect any of the methods described above.

P.s. I apologize for my bad English.

Xstroy edited 2011-03-16T05:49:37-07:00
1 0

Until now, watching how my blog indexed by search engines without closing unneeded pages in robots. True to use the plugin All in One SEO Pack, which automatically adds the articles . All indexed smoothly, without problems, until one important item was not on the front lines in the index with an address that does not coincide with the original. It was addressed to the preview article for editing, while it lost an important property of the exact occurrence of a key phrase in the address of the article, which could put it right in the first place. Conclusion: Do not neglect any of the methods described above. P.s. I apologize for my bad English.
Cancel
frostmill

2011-03-16T06:23:25-07:00

Needless to say, I go for the Meta Robots tag. This post gave me insight in to X-Robots tag. Cannonical Tags have always been a mystery to me and I decided to give up on them :)

Thanks for the share.

Yes, yes.. Happy Optimizing!

1 0

Needless to say, I go for the Meta Robots tag. This post gave me insight in to X-Robots tag. Cannonical Tags have always been a mystery to me and I decided to give up on them :) Thanks for the share. Yes, yes.. Happy Optimizing!
Cancel
LoonyToons

2011-03-16T02:31:47-07:00

Great article! I use the robots.txt file to exclude CMS core files and use robots meta tag to control content indexation, so it looks like I'm on the right track!

I confess I have rarely used the canonical tag...I try not to duplicate content. However, a website I am managing is using UTM code to track clicks, and it's something I now need to implement.

I have also never heard of the X-robots-tag - but it's a good thing to know about! Personally I hate finding pdfs and .docs in the SERPS!

1 0

Great article! I use the robots.txt file to exclude CMS core files and use robots meta tag to control content indexation, so it looks like I'm on the right track! I confess I have rarely used the canonical tag...I try not to duplicate content. However, a website I am managing is using UTM code to track clicks, and it's something I now need to implement. I have also never heard of the X-robots-tag - but it's a good thing to know about! Personally I hate finding pdfs and .docs in the SERPS!
Cancel
silentsender

2011-03-17T19:54:00-07:00

Good info on robot.txt file- just cleared up a whole lot of junk that was getting indexed

1 1

Good info on robot.txt file- just cleared up a whole lot of junk that was getting indexed
Cancel
reakerb

2011-03-21T15:36:34-07:00

Nice article gonna fix my robots.txt now.

1 2

Nice article gonna fix my robots.txt now.
Cancel
Tim Soulo

2011-03-23T07:21:34-07:00

SEOmoz keeps impressing me again and again :) Their robots.txt was really unexpected :)

1 3

SEOmoz keeps impressing me again and again :) Their robots.txt was really unexpected :)
Cancel
qualitypoint

2011-03-16T02:48:16-07:00

I am an Internet Marketer.But,I don't no much about on page SEO.This article help me a lot. Thanks. DPS qualitypointtech.net

1 5

I am an Internet Marketer.But,I don't no much about on page SEO.This article help me a lot. Thanks. DPS qualitypointtech.net
Cancel

Post Analytics

Comments 54

Log in to Moz

Don't have an account?