XML Sitemaps: The Most Misunderstood Tool in the SEO's Toolbox

Comments 83

Please keep your comments TAGFEE by following the community etiquette.

E-mail me when new comments are posted

Sort by:

Comments are closed on posts more than 30 days old. Got a burning question? Head to our Q&A section to start a new conversation.

Associate

Michael Cottam
Associate

2017-04-11T00:08:36-07:00

Has anyone got an interesting example of using XML sitemaps to diagnose what they needed to do to their content to get Google to start indexing a certain class of pages?

I'd love to see breakpoint stats on something like minimum image size, or original vs. stock photo image, adding a video to get the page indexed, or internal linking or clicks-from-home-page minimums.

8 0

 Has anyone got an interesting example of using XML sitemaps to diagnose what they needed to do to their content to get Google to start indexing a certain class of pages? I'd love to see breakpoint stats on something like minimum image size, or original vs. stock photo image, adding a video to get the page indexed, or internal linking or clicks-from-home-page minimums.
Cancel
- JibbedSEO
 
 2017-04-11T09:25:48-07:00
 
 Worth pointing out that "XML" sitemaps don't have to be XML format. They can just be a text file of URLs separated by a new line, and they're just as valid and trusted as actual XML. A very good option to avoid complex dev work and you can even make them on your own machine if it's a smallish site.
 
 However, if you want to implement hreflang via XML sitemap then it'll need to be true XML
 
 JibbedSEO edited 2017-04-11T09:40:01-07:00
 3 0
 
 Worth pointing out that "XML" sitemaps don't have to be XML format. They can just be a text file of URLs separated by a new line, and they're just as valid and trusted as actual XML. A very good option to avoid complex dev work and you can even make them on your own machine if it's a smallish site. However, if you want to implement hreflang via XML sitemap then it'll need to be true XML
 Cancel
 - Michael Cottam
 
 2017-04-11T10:08:29-07:00
 
 Good point. But if you have a small site, you might as well use the free version of Screaming Frog and let it generate a complete XML sitemap for you. Then you can tweak priorities, last update dates, etc. as needed.
 
 1 0
 
 Good point. But if you have a small site, you might as well use the free version of <a href="https://www.screamingfrog.co.uk/seo-spider/" rel="nofollow">Screaming Frog</a> and let it generate a complete XML sitemap for you. Then you can tweak priorities, last update dates, etc. as needed.
 Cancel
 - JibbedSEO
 
 2017-04-11T10:18:50-07:00
 
 Seems priorities are ignored according to Google: https://twitter.com/methode/status/846796737750712...
 
 1 0
 
 Seems priorities are ignored according to Google: <a href="https://twitter.com/methode/status/846796737750712320" rel="nofollow">https://twitter.com/methode/status/846796737750712...</a>
 Cancel
 - Michael Cottam
 
 2017-04-11T13:31:55-07:00
 
 Interesting--I guess they must have seen very few people using them in a helpful way!
 
 4 0
 
 Interesting--I guess they must have seen very few people using them in a helpful way!
 Cancel
AndreK

2017-04-11T13:39:14-07:00

Great Post!

I would like to add one little thing: I also exclude every URL that contains a Canonical to another Page -because it tells Google, that I do not want this 'non-canonical' Page to be indexed.

Greetings from Germany!

AndreK edited 2017-04-11T13:43:02-07:00
7 0

Great Post! I would like to add one little thing: I also exclude every URL that contains a Canonical to another Page -because it tells Google, that I do not want this 'non-canonical' Page to be indexed. Greetings from Germany!
Cancel
- Michael Cottam
 
 2017-04-11T13:53:11-07:00
 
 Excellent point!
 
 2 0
 
 Excellent point!
 Cancel
Arun Sarathy

2017-04-12T02:44:04-07:00

Thanks Michael for nailing the point down in plain language without much of the technical jargon we usually see on blog posts about XML sitemaps.

Now, apart from that, I also want to point out another important thing when it comes to XML Sitemaps - some sites don't realize that the XML Sitemaps actually gather all the pages from the site - including the ones that were crafted with care - like the lead generation pages (lead magnets) where PDF downloads are offered in exchange for reader's email IDs.

So, one could essentially grab the PDF without the site owner's requirement of providing the email address. I have come across hundreds of sites with this problem and have personally emailed them to fix it. Heck, I wrote a book on this. (I am not including its link here and end up appearing spammy) Nevertheless, this appears to be seldom addressed by site owners. Perhaps the seemingly technical stuff scares them out, though it just boils down to plain common sense.

ArunSarathy edited 2017-04-12T14:02:22-07:00
6 0

Thanks Michael for nailing the point down in plain language without much of the technical jargon we usually see on blog posts about XML sitemaps. Now, apart from that, I also want to point out another important thing when it comes to XML Sitemaps - some sites don't realize that the XML Sitemaps actually gather all the pages from the site - including the ones that were crafted with care - like the lead generation pages (lead magnets) where PDF downloads are offered in exchange for reader's email IDs. So, one could essentially grab the PDF without the site owner's requirement of providing the email address. I have come across hundreds of sites with this problem and have personally emailed them to fix it. Heck, I wrote a book on this. (I am not including its link here and end up appearing spammy) Nevertheless, this appears to be seldom addressed by site owners. Perhaps the seemingly technical stuff scares them out, though it just boils down to plain common sense. 
Cancel
- Michael Cottam
 
 2017-04-12T14:10:47-07:00
 
 That's a great point, Arun. If the do this, not only could a savvy web developer see through this and get the PDFs directly, but it would also be encouraging Google to index those PDFs directly, so non-developers might get to them without going through the paywall...directly from search results!
 
 3 0
 
 That's a great point, Arun. If the do this, not only could a savvy web developer see through this and get the PDFs directly, but it would also be encouraging Google to index those PDFs directly, so non-developers might get to them without going through the paywall...directly from search results!
 Cancel
 - Arun Sarathy
 
 2017-04-14T03:10:28-07:00
 
 Totally agree!
 
 Aptly named, here's the name of the book I mentioned earlier (just realized, I didn't give the name) - The Backdoor in Your Blog.available on Amazon.
 
 3 0
 
 Totally agree! Aptly named, here's the name of the book I mentioned earlier (just realized, I didn't give the name) - The Backdoor in Your Blog.available on Amazon.
 Cancel
Arav Rai

2017-04-11T02:17:41-07:00

A well explained guide of XML-sitemaps. I was also having some of the myths discussed in this post, as it helps to get page index, in fact, I have been taught in my SEO Training, but now got clear. This is how we learn many things from MOZ blogs.

4 0

A well explained guide of XML-sitemaps. I was also having some of the myths discussed in this post, as it helps to get page index, in fact, I have been taught in my SEO Training, but now got clear. This is how we learn many things from MOZ blogs. 
Cancel
Praveen Sharma

2017-04-11T00:39:06-07:00

Great post Michael,

Have done something similar for an eCommerce website in past. And after optimizing the sitemap and robots.txt, we saw better crawling stats in GSC.

The issue was something like this, the eCommerce has created specific pages for all their categories, yet they were allowing dynamic search URLs get indexed, even they had these URLs in sitemap, which is dynamically generated.

Another issue was, they have user profiles on their website, that only contains order history and related stuff, these URLs were also a part of the sitemap in big number.

We had a discussion with our client over the importance of such user profiles for search users, and we decided to remove them from the sitemap after that. Then, we got all these profile and dymanic search URLs deindexed from the search engine followed by blocking them from robots.txt. Within days, we saw improved crawl stats for the website.

Thanks

4 0

Great post Michael, Have done something similar for an eCommerce website in past. And after optimizing the sitemap and robots.txt, we saw better crawling stats in GSC. The issue was something like this, the eCommerce has created specific pages for all their categories, yet they were allowing dynamic search URLs get indexed, even they had these URLs in sitemap, which is dynamically generated. Another issue was, they have user profiles on their website, that only contains order history and related stuff, these URLs were also a part of the sitemap in big number. We had a discussion with our client over the importance of such user profiles for search users, and we decided to remove them from the sitemap after that. Then, we got all these profile and dymanic search URLs deindexed from the search engine followed by blocking them from robots.txt. Within days, we saw improved crawl stats for the website. Thanks
Cancel
- Michael Cottam
 
 2017-04-11T08:31:44-07:00
 
 Thanks Praveen...this is probably one of the biggest problems e-commerce sites have: where the very helpful UX gives you filtering, sorting, and user options that cause incredible numbers of variations on what really is pretty much the same page of content.
 
 3 0
 
 Thanks Praveen...this is probably one of the biggest problems e-commerce sites have: where the very helpful UX gives you filtering, sorting, and user options that cause incredible numbers of variations on what really is pretty much the same page of content.
 Cancel
WineisVino.com

2017-04-19T00:00:07-07:00

Wow! Very helpful information. Now I should check how to generate my dynamic xml sitemaps on my magento with only my important pagrs.

Thank you!

3 0

Wow! Very helpful information. Now I should check how to generate my dynamic xml sitemaps on my magento with only my important pagrs. Thank you!
Cancel
- Michael Cottam
 
 2017-04-19T06:24:04-07:00
 
 You don't need Magento, really....just any server-side programming language that can access your Magento database.
 
 2 0
 
 You don't need Magento, really....just any server-side programming language that can access your Magento database.
 Cancel
Banshe hogar

2017-04-28T16:12:57-07:00

Hello Michael!

Thanks for your advice, I will keep it in mind from now on.

2 0

Hello Michael! Thanks for your advice, I will keep it in mind from now on.
Cancel
SergioB1717

2017-04-27T16:50:33-07:00

Sometimes we have the best information in front of our eyes and we do not realize that

I've learned a lot about XML sitemaps in a single post, clarifying several ideas

I'll share the link so others can read it

SergioB1717 edited 2017-04-27T16:51:28-07:00
2 0

Sometimes we have the best information in front of our eyes and we do not realize that I've learned a lot about XML sitemaps in a single post, clarifying several ideas I'll share the link so others can read it 
Cancel
Akash Srivastava

2017-04-12T02:56:20-07:00

Excellent post Michael, I use Yoast plugin and that helps me solve most of these problems.

2 0

Excellent post Michael, I use Yoast plugin and that helps me solve most of these problems. 
Cancel
- Luis Alvarez
 
 2017-04-16T12:53:32-07:00
 
 What about All in Seo?
 
 1 0
 
 What about All in Seo?
 Cancel
 - Michael Cottam
 
 2017-04-17T07:58:20-07:00
 
 Not sure. I used All in 1 SEO several years ago, but I've since switched all of my sites and my clients' sites to Yoast.
 
 1 0
 
 Not sure. I used All in 1 SEO several years ago, but I've since switched all of my sites and my clients' sites to Yoast.
 Cancel
Tim Wilson

2017-04-11T08:11:01-07:00

What a great recommendation about the utility pages. I have been wondering if the no value pages for search on a site and more of a user tool should be ignored or indexed, and you just answered that thought. I really think you touched on some great points in this read by talking about both the value of sitemaps and how Google and other search engines have a pre-compiled algorithm that will determine if the page is work indexing.

Last note on the e-commerce indexing fantastic when a person is wondering why there are so many products not being consumed by the index bot.

Thanks for the contribution to the Moz community.

2 0

What a great recommendation about the utility pages. I have been wondering if the no value pages for search on a site and more of a user tool should be ignored or indexed, and you just answered that thought. I really think you touched on some great points in this read by talking about both the value of sitemaps and how Google and other search engines have a pre-compiled algorithm that will determine if the page is work indexing. Last note on the e-commerce indexing fantastic when a person is wondering why there are so many products not being consumed by the index bot. Thanks for the contribution to the Moz community. 
Cancel
- Michael Cottam
 
 2017-04-11T08:41:56-07:00
 
 Thanks Tim!
 
 1 0
 
 Thanks Tim!
 Cancel
Nicholas White

2017-04-12T19:44:01-07:00

Great summary on XML Sitemaps Michael! I'd be lying if I said I didn't have a couple misconceptions about them throughout the years, but you summed it up quite nicely and this will be great to refer back to. Also, I most definitely agree that understanding the difference between a utility page and a search landing page for your website is crucial.

2 0

Great summary on XML Sitemaps Michael! I'd be lying if I said I didn't have a couple misconceptions about them throughout the years, but you summed it up quite nicely and this will be great to refer back to. Also, I most definitely agree that understanding the difference between a utility page and a search landing page for your website is crucial.
Cancel
- Michael Cottam
 
 2017-04-12T20:21:56-07:00
 
 Thanks Nicholas!
 
 2 0
 
 Thanks Nicholas!
 Cancel
AnikTM-SEO

2017-04-12T01:07:09-07:00

Excellent Post Michael! Very helpful!

2 0

Excellent Post Michael! Very helpful!
Cancel
Sergey Grybniak

2017-04-11T02:51:49-07:00

Great post, Michael. Thanks!

XML is always a problem. A mismatch between xml and robots.txt is real.

Sergey_Grybniak edited 2017-04-11T09:40:19-07:00
2 0

Great post, Michael. Thanks! XML is always a problem. A mismatch between xml and robots.txt is real. 
Cancel
- Michael Cottam
 
 2017-04-11T10:10:57-07:00
 
 Thanks! And I agree. It gets even worse when meta robots doesn't line up with robots.txt and that doesn't line up with the XML sitemap.
 
 1 0
 
 Thanks! And I agree. It gets even worse when meta robots doesn't line up with robots.txt and that doesn't line up with the XML sitemap.
 Cancel
Sindhu SEO

2017-04-11T05:27:53-07:00

Oh my god... Seriously, all these years i thought just add xml sitemap is enough to get an attention from Google. honestly i never know there are so many things in xml sitemap. Good that at least i have learned now. Thanks a lot

2 0

Oh my god... Seriously, all these years i thought just add xml sitemap is enough to get an attention from Google. honestly i never know there are so many things in xml sitemap. Good that at least i have learned now. Thanks a lot
Cancel
Abhishek Singh Rao

2017-04-11T01:36:53-07:00

I want to add 2 important things which needs to be understood along with this great article!

1) HTML Sitemap: As Michael explained XML Sitemap is like giving clue to Google that these pages are important for Indexing whereas HTML sitemaps are usually give clue to visitors to have a better and easier site experience.

2) XML Sitemap Priority: I often saw that client assigns a high priority(1.0) to all of the URLs on sitemap but It won't help ever. It's only a a hint to Search Crawler to select between URLs on the same site.

2 0

I want to add 2 important things which needs to be understood along with this great article! 1) HTML Sitemap: As Michael explained XML Sitemap is like giving clue to Google that these pages are important for Indexing whereas HTML sitemaps are usually give clue to visitors to have a better and easier site experience. 2) XML Sitemap Priority: I often saw that client assigns a high priority(1.0) to all of the URLs on sitemap but It won't help ever. It's only a a hint to Search Crawler to select between URLs on the same site. 
Cancel
- Alireza Zahedi
 
 2017-04-11T07:52:45-07:00
 
 I Have this website with over 300,000 index pages that the users add content themselves. how am I supposed to make XML sitemap with too many links? Also, What if the users delete some content and it remains in my map?
 
 here is the site
 
 1 0
 
 I Have this website with over 300,000 index pages that the users add content themselves. how am I supposed to make XML sitemap with too many links? Also, What if the users delete some content and it remains in my map? <a href="https://www.nanotejarat.com" rel="nofollow">here is the site</a> 
 Cancel
 - Shiv Jaiswal
 
 2017-04-12T05:18:51-07:00
 
 For big xml sitemaps, you can break them into part and upload them separately.
 
 2 0
 
 For big xml sitemaps, you can break them into part and upload them separately. 
 Cancel
 - Michael Cottam
 
 2017-04-12T14:05:50-07:00
 
 I agree with Shiv--break it into many smaller sitemaps. Google limits you to 50,000 URLs per sitemap, in fact. You should be generating your sitemap automatically, or at least on a very regular basis, from the actual content in your CMS.
 
 1 0
 
 I agree with Shiv--break it into many smaller sitemaps. <a href="https://stackoverflow.com/questions/2887358/limitation-for-google-sitemap-xml-file-size" rel="nofollow">Google limits you</a> to 50,000 URLs per sitemap, in fact. You should be generating your sitemap automatically, or at least on a very regular basis, from the actual content in your CMS.
 Cancel
 - Abhishek Singh Rao
 
 2017-04-13T02:04:17-07:00
 
 Hi Alireza, I reviewed your website and I recommend you to to make category wise sitemaps. i.e Electronic Components has separate sitemap and others have the same. Please let me know if there is any follow-up question.
 
 1 0
 
 Hi Alireza, I reviewed your website and I recommend you to to make category wise sitemaps. i.e Electronic Components has separate sitemap and others have the same. Please let me know if there is any follow-up question.
 Cancel
- Michael Cottam
 
 2017-04-11T08:34:36-07:00
 
 Definitely agree. If you have an HTML sitemap, and you're finding a lot of users are resorting to the sitemap to find what they're looking for, then this is a good indication that you need to improve your main navigation!
 
 Agreed on the sitemap priority number. People need to understand that it's there for you to give Google a clue as to which of two or more pages about the same topic is the more important one, i.e. your category page about purple widgets vs. a blog post about purple widgets. It's not going to affect how your page ranks against pages from another website.
 
 2 0
 
 Definitely agree. If you have an HTML sitemap, and you're finding a lot of users are resorting to the sitemap to find what they're looking for, then this is a good indication that you need to improve your main navigation! Agreed on the sitemap priority number. People need to understand that it's there for you to give Google a clue as to which of two or more pages about the same topic is the more important one, i.e. your category page about purple widgets vs. a blog post about purple widgets. It's not going to affect how your page ranks against pages from another website.
 Cancel
 - Abhishek Singh Rao
 
 2017-04-13T02:10:36-07:00
 
 Well said. Google clearly tells that "Design as much as possible user friendly and responsive website, it'll automatically add SEO value". HTML sitemap plays big role in user friendliness. Thanks Michael!
 
 2 0
 
 Well said. Google clearly tells that "Design as much as possible user friendly and responsive website, it'll automatically add SEO value". HTML sitemap plays big role in user friendliness. Thanks Michael!
 Cancel
I.Marketing

2017-04-11T01:06:58-07:00

Great technical article! Very useful for Seos without a technical background like me.

I.Marketing edited 2017-04-11T09:40:23-07:00
2 0

Great technical article! Very useful for Seos without a technical background like me. 
Cancel
- Michael Cottam
 
 2017-04-11T10:12:05-07:00
 
 Thanks Mark!
 
 1 0
 
 Thanks Mark!
 Cancel
Frank_A

2017-04-17T05:00:49-07:00

Very, very helpful and ready for immediate application after I resolve some areas of ignorance. For example, I now understand that any pages behind password protection should be noindexed as they are not landing pages. But, over half of my pages are PHP action pages with no HTML block. Do these pages need to be noindexed?

1 0

Very, very helpful and ready for immediate application after I resolve some areas of ignorance. For example, I now understand that any pages behind password protection should be noindexed as they are not landing pages. But, over half of my pages are PHP action pages with no HTML block. Do these pages need to be noindexed? 
Cancel
- Michael Cottam
 
 2017-04-17T08:00:13-07:00
 
 Any pages that are password-protected shouldn't really need noindex, unless there's actually a way for Google to find a link to them and get the content without logging in as one of your users. If that's the case, then you probably need to work on your login security :-).
 
 For your PHP pages that have no HTML on them, I'd block those in robots.txt. There's no point in letting Google crawl those as they have no outbound links to send link juice to other pages on your site.
 
 1 0
 
 Any pages that are password-protected shouldn't really need noindex, unless there's actually a way for Google to find a link to them and get the content without logging in as one of your users. If that's the case, then you probably need to work on your login security :-). For your PHP pages that have no HTML on them, I'd block those in robots.txt. There's no point in letting Google crawl those as they have no outbound links to send link juice to other pages on your site.
 Cancel
colemanconcierge

2017-04-16T16:04:26-07:00

I have been weighing the benefits of publishing a series of small articles to boost regular content but I am worried about producing regular content. Would I use XML Sitemaps to keep the crawlers focused on higher quality content?

1 0

I have been weighing the benefits of publishing a series of small articles to boost regular content but I am worried about producing regular content. Would I use XML Sitemaps to keep the crawlers focused on higher quality content?
Cancel
- Michael Cottam
 
 2017-04-17T07:57:19-07:00
 
 If you only have your more important articles in your XML sitemap, it MAY cause Google to crawl those first, especially if you resubmit that sitemap.
 
 1 0
 
 If you only have your more important articles in your XML sitemap, it MAY cause Google to crawl those first, especially if you resubmit that sitemap. 
 Cancel
jamiehennings

2017-04-16T03:56:37-07:00

Hello,

I have used XML sitemaps plugin on my WordPress site from 1 years, it was working fine, but a few days ago I found some spamming issue in it, when i am try to click on "XML sitemap", and I have started ping my site manually.

jamiehennings edited 2017-04-16T03:59:22-07:00
1 0

Hello, I have used XML sitemaps plugin on my WordPress site from 1 years, it was working fine, but a few days ago I found some spamming issue in it, when i am try to click on "XML sitemap", and I have started ping my site manually.
Cancel
Martin Beneš

2017-04-18T05:56:14-07:00

Hi Michael, thanks for the article. What do you think about uploading sitemaps regularly based on the months with the latest pages?

For example "sitemap-2017-april.xml" etc.

Thanks. Cheers, Martin

1 0

Hi Michael, thanks for the article. What do you think about uploading sitemaps regularly based on the months with the latest pages? For example "sitemap-2017-april.xml" etc. Thanks. Cheers, Martin 
Cancel
- Michael Cottam
 
 2017-04-18T09:05:39-07:00
 
 When you submit an XML sitemap in Search Console, it's a hint/suggestion to Google that you've either updated that content or it's new. So, if you've got new articles in that sitemap, then that can be a good idea.
 
 But that sounds like a lot of manual work to me :-).
 
 I'd do something programmatically that pulled the latest 10 days worth of articles, generated a newest-articles.xml sitemap, setting the modification frequency to daily on all the URLs.
 
 1 0
 
 When you submit an XML sitemap in Search Console, it's a hint/suggestion to Google that you've either updated that content or it's new. So, if you've got new articles in that sitemap, then that can be a good idea. But that sounds like a lot of manual work to me :-). I'd do something programmatically that pulled the latest 10 days worth of articles, generated a newest-articles.xml sitemap, setting the modification frequency to daily on all the URLs.
 Cancel
nachete92

2017-05-01T12:56:01-07:00

I think this article could help so much people like me, because I just use the XML sitemap plugin by Arne Brachold and i do not configure anything. It is also true that my sites are so small and therefore the importance of this tool could be less than for huge projects.

1 0

I think this article could help so much people like me, because I just use the XML sitemap plugin by Arne Brachold and i do not configure anything. It is also true that my sites are so small and therefore the importance of this tool could be less than for huge projects.
Cancel
Rohit Chugh

2017-04-19T03:47:20-07:00

I get the below error for my XML sitemap in Search Console. Not able to resolve it :/

Your Sitemap or Sitemap index file doesn't properly declare the namespace. Expected: https://www.w3.org/1999/xhtml Found: https://www.sitemaps.org/schemas/sitemap/0.9

Parent tag: url

Tag: link

1 0

I get the below error for my XML sitemap in Search Console. Not able to resolve it :/ Your Sitemap or Sitemap index file doesn't properly declare the namespace. Expected: <a href="https://www.w3.org/1999/xhtml" rel="nofollow">https://www.w3.org/1999/xhtml</a> Found: <a href="https://www.sitemaps.org/schemas/sitemap/0.9" rel="nofollow">https://www.sitemaps.org/schemas/sitemap/0.9</a> Parent tag: url Tag: link
Cancel
- Michael Cottam
 
 2017-04-19T11:27:56-07:00
 
 I'm betting a special character in there somewhere is messing up the XML. What did you edit it with or create it with?
 
 2 0
 
 I'm betting a special character in there somewhere is messing up the XML. What did you edit it with or create it with?
 Cancel
Interstate.Tenant

2017-04-13T13:25:58-07:00

Do you have any advice for small sites? I have a sitemap that updates daily and Google still only indexes weird pages. I even used the Googe (XML) Sitemaps Generator Plugin for Wordpress and it's still a mess. Our blog doesn't show up at all and pages that don't exist, like "portfolio tag" and "branding tag" show up constantly no matter how many times I block them.

Interstate.Tenant edited 2017-04-13T13:26:32-07:00
1 0

Do you have any advice for small sites? I have a sitemap that updates daily and Google still only indexes weird pages. I even used the Googe (XML) Sitemaps Generator Plugin for Wordpress and it's still a mess. Our blog doesn't show up at all and pages that don't exist, like "portfolio tag" and "branding tag" show up constantly no matter how many times I block them. 
Cancel
Topsydneyseo

2017-04-27T21:28:36-07:00

Hey Micheal, Just to touch on what you said regarding utility I often ask myself before posting anything on one of our websites for example "Is it relevant?". It sounds kind of odd, but when your writing content 11 hours a day 4 days a week it does get tiresome and easy to drift off topic. But with regards to this post and XML sitemaps, your absolutely right.

I often tell my clients, we have to fix your layout, drop some keywords and make your website even and consistent. Many SEO agencies here is Australia often forget to write their content for humans to read and search engines to rank. If a human wont read my content, why would a search engine?

Out of all the posts, pages and back links, I have submitted to google, the one issue that gives me anxiety is: Sitemap. I didn't think about dynamic sitemaps before until now, it makes a lot of sense. Thanks for sharing.

FeliciaCrawford edited 2017-04-28T16:18:25-07:00
1 0

Hey Micheal, Just to touch on what you said regarding utility I often ask myself before posting anything on one of our websites for example "Is it relevant?". It sounds kind of odd, but when your writing content 11 hours a day 4 days a week it does get tiresome and easy to drift off topic. But with regards to this post and XML sitemaps, your absolutely right. I often tell my clients, we have to fix your layout, drop some keywords and make your website even and consistent. Many SEO agencies here is Australia often forget to write their content for humans to read and search engines to rank. If a human wont read my content, why would a search engine? Out of all the posts, pages and back links, I have submitted to google, the one issue that gives me anxiety is: Sitemap. I didn't think about dynamic sitemaps before until now, it makes a lot of sense. Thanks for sharing. 
Cancel
gearexperten

2017-04-23T13:16:41-07:00

Don’t underestimate an xml sitemap. And make sure it is setup and working in the Google Search Consol and Bing Webmaster Tools. To many forget the xml sitemap importance.

gearexperten edited 2017-04-23T13:17:00-07:00
1 0

Don’t underestimate an xml sitemap. And make sure it is setup and working in the Google Search Consol and Bing Webmaster Tools. To many forget the xml sitemap importance. 
Cancel
BJ Wright

2017-04-12T11:20:13-07:00

Michael, excellent content thanks for posting! We have a Wordpress site with 50k+ indexed pages. I've been advised against using Yoast to manage our XML-Sitemap for our site specifically and am currently using ScreamingFrog to manually create the XML sitemap.

At the end of the day, do we need to build our our xml sitemap based around the rules you mapped out above specific to our content? Or is there another tool/process you'd recommend? Right now our process is very manual and I want to find a more automated/optimized route to handling our XML sitemap.

1 0

Michael, excellent content thanks for posting! We have a Wordpress site with 50k+ indexed pages. I've been advised against using Yoast to manage our XML-Sitemap for our site specifically and am currently using ScreamingFrog to manually create the XML sitemap. At the end of the day, do we need to build our our xml sitemap based around the rules you mapped out above specific to our content? Or is there another tool/process you'd recommend? Right now our process is very manual and I want to find a more automated/optimized route to handling our XML sitemap. 
Cancel
- Michael Cottam
 
 2017-04-12T14:08:17-07:00
 
 For large sites, I recommend building internal processes for generating your sitemaps. Break your content down into various types, and generate a separate sitemap for each type. For my travel site, for instance, I have an XML sitemap for just hotel pages, another for travel specials, another for static pages, and a set of them (Yoast-generated for these) for the blog pages (only the blog part of my site is WordPress). It's a relatively simple thing to iterate over all of a certain type of record in your database and spit out the URLs for those types of entities, in XML sitemap format.
 
 1 0
 
 For large sites, I recommend building internal processes for generating your sitemaps. Break your content down into various types, and generate a separate sitemap for each type. For my travel site, for instance, I have an XML sitemap for just hotel pages, another for travel specials, another for static pages, and a set of them (Yoast-generated for these) for the blog pages (only the blog part of my site is WordPress). It's a relatively simple thing to iterate over all of a certain type of record in your database and spit out the URLs for those types of entities, in XML sitemap format.
 Cancel
sweta-patel

2017-04-11T22:57:00-07:00

Good Information.

But How To Find Which page are index and Which are pending in Google?

sweta-patel edited 2017-04-12T14:02:42-07:00
1 0

Good Information. But How To Find Which page are index and Which are pending in Google?
Cancel
- Michael Cottam
 
 2017-04-12T14:16:21-07:00
 
 Break your sitemap into many smaller sitemaps. You can then look for sitemaps that have a low indexation rate, and then that's where your problems lie. You can then take THOSE problem sitemaps, and break them into smaller sitemaps even further, based on whatever hypothesis you have on why some of those URLs aren't getting indexed and others are.
 
 1 0
 
 Break your sitemap into many smaller sitemaps. You can then look for sitemaps that have a low indexation rate, and then that's where your problems lie. You can then take THOSE problem sitemaps, and break them into smaller sitemaps even further, based on whatever hypothesis you have on why some of those URLs aren't getting indexed and others are.
 Cancel
Alireza Zahedi

2017-04-11T07:55:52-07:00

What if the indexed pages by google are higher in number than any possible XML site map we can create? do we stilll need one?

1 0

What if the indexed pages by google are higher in number than any possible XML site map we can create? do we stilll need one?
Cancel
- Michael Cottam
 
 2017-04-11T08:41:37-07:00
 
 Absolutely. In fact, this is an indication that you have a big problem with indexation, in that Google is finding and indexing pages that you don't think are important or potential search landing pages! Likely that means they're very light on content...and if Google ends up indexing them, then from an overall site perspective, Google is seeing the average content quality per page as lower than they should.
 
 As an example, let's say you have a page for sharing a URL from your website. Let's say this page takes some parameter that indicates the page to be shared, and at the top shows the heading from the page and a snippet from the content, plus the usual form fields for sharing...just enough content so that Google does decide to index it. You're not going to put all of those pages in your XML sitemap, of course. If Google is indexing those, and you have 1000 pages of real content on your site, you've now got Google indexing 1000 good pages + 1000 share-this pages of non-content. And so Google will see half your site as pretty marginal content.
 
 3 0
 
 Absolutely. In fact, this is an indication that you have a big problem with indexation, in that Google is finding and indexing pages that you don't think are important or potential search landing pages! Likely that means they're very light on content...and if Google ends up indexing them, then from an overall site perspective, Google is seeing the average content quality per page as lower than they should. As an example, let's say you have a page for sharing a URL from your website. Let's say this page takes some parameter that indicates the page to be shared, and at the top shows the heading from the page and a snippet from the content, plus the usual form fields for sharing...just enough content so that Google does decide to index it. You're not going to put all of those pages in your XML sitemap, of course. If Google is indexing those, and you have 1000 pages of real content on your site, you've now got Google indexing 1000 good pages + 1000 share-this pages of non-content. And so Google will see half your site as pretty marginal content. 
 Cancel
 - Vjay Vasu
 
 2017-04-12T11:57:03-07:00
 
 Hello Mike, so pretty much an index bloat, which in the long run is going to affect how Google sees a website, ie is it a quality site, or low quality ( Low EAT) site.
 
 This means that even though a lot of pages are indexed, the crawl rate will go down, the over all rankings will be affected, or worse make it harder to do clean and propoer SEO?
 
 Also, I have created hundereds of sitemaps using screaming frog paid liscence , , inculded sub domains, images, videos, etc but never set priorities. This may be a good idea but googlebot ultimately will do what it things is best, which pages it feels is most relevant.
 
 I have never created a dynamic site map - can you please point me to a resource or tool?
 
 Thank you and this is terrific post.
 
 @seogrowthhacker from San Francisco
 
 1 0
 
 Hello Mike, so pretty much an index bloat, which in the long run is going to affect how Google sees a website, ie is it a quality site, or low quality ( Low EAT) site. This means that even though a lot of pages are indexed, the crawl rate will go down, the over all rankings will be affected, or worse make it harder to do clean and propoer SEO? Also, I have created hundereds of sitemaps using screaming frog paid liscence , , inculded sub domains, images, videos, etc but never set priorities. This may be a good idea but googlebot ultimately will do what it things is best, which pages it feels is most relevant. I have never created a dynamic site map - can you please point me to a resource or tool? Thank you and this is terrific post. @seogrowthhacker from San Francisco
 Cancel
 - Michael Cottam
 
 2017-04-12T12:10:39-07:00
 
 Hello Vjay,
 
 I think you're exactly right on the index bloat/quality comments.
 
 For dynamic sitemaps, I don't know that there's a tool for that. What I have done is written database queries to return the values I need to figure out all page URLs for a given type, and then form the URLs the same way I'd form them on the web pages that list links to those pages....but instead, spit out XML in the sitemap syntax.
 
 1 0
 
 Hello Vjay, I think you're exactly right on the index bloat/quality comments. For dynamic sitemaps, I don't know that there's a tool for that. What I have done is written database queries to return the values I need to figure out all page URLs for a given type, and then form the URLs the same way I'd form them on the web pages that list links to those pages....but instead, spit out XML in the sitemap syntax.
 Cancel
ChrisHemmingway

2017-04-11T13:18:35-07:00

Hi, great post and very helpful. We have a few websites but one of them https://flyusanywhere.com/ has yoast and I tried to activate the google xml tool as well but it won't allow me to run both as it says they will get confused. Is it better to deactivate the Yoast one and run the Google version or what do you think is best?

Many thanks

FeliciaCrawford edited 2017-04-25T16:59:55-07:00
1 0

Hi, great post and very helpful. We have a few websites but one of them https://flyusanywhere.com/ has yoast and I tried to activate the google xml tool as well but it won't allow me to run both as it says they will get confused. Is it better to deactivate the Yoast one and run the Google version or what do you think is best? Many thanks 
Cancel
- Michael Cottam
 
 2017-04-11T13:55:48-07:00
 
 There's no problem in Search Console in submitting a number of different sitemaps. Even if some URLs are included in more than one, that should be just fine. I do this all the time.
 
 Having said that, there might possibly be a conflict between the two plugins, i.e. something simple like they're both trying to write out to sitemap_index.xml or something like that.
 
 1 0
 
 There's no problem in Search Console in submitting a number of different sitemaps. Even if some URLs are included in more than one, that should be just fine. I do this all the time. Having said that, there might possibly be a conflict between the two plugins, i.e. something simple like they're both trying to write out to sitemap_index.xml or something like that.
 Cancel
Mark Marino

2017-04-11T12:46:37-07:00

This was a great post. I'd also include canonical URLs in Bucket #2. I've instances where a product feed - that generates the XML Sitemaps - has dynamic parameters to reference SKU's or unique ID's, canonical to the clean URL, but only the SKU URLs added tot he sitemap.

1 0

This was a great post. I'd also include canonical URLs in Bucket #2. I've instances where a product feed - that generates the XML Sitemaps - has dynamic parameters to reference SKU's or unique ID's, canonical to the clean URL, but only the SKU URLs added tot he sitemap. 
Cancel
- Michael Cottam
 
 2017-04-11T13:33:42-07:00
 
 Absolutely agree. Good point Mark!
 
 2 0
 
 Absolutely agree. Good point Mark!
 Cancel
Lance-Zeidman

2017-04-11T23:10:49-07:00

Thanks for sharing and so just a quick question for an insurance website. Please also forgive I'm still a layman but If I have agents/brokers that access a training or sensitive information section that is not intended for public eyes or indexing, isn't this where no index no follow could apply?

Also does yoast plug in automatically update xml with meta in their no index page option mentioned above?

1 0

Thanks for sharing and so just a quick question for an insurance website. Please also forgive I'm still a layman but If I have agents/brokers that access a training or sensitive information section that is not intended for public eyes or indexing, isn't this where no index no follow could apply? Also does yoast plug in automatically update xml with meta in their no index page option mentioned above?
Cancel
- Michael Cottam
 
 2017-04-12T09:39:53-07:00
 
 I wouldn't use just noindex for those, I'd make sure those pages are password-protected instead. Otherwise not-very-well-behaved bots and scrapers will still be able to see (and perhaps copy) those pages.
 
 Important note with Yoast configuration: you MUST make sure that what you're including in your XML sitemaps aligns with what you're indexing/noindexing on the pages themselves. It doesn't do this for you automatically.
 
 1 0
 
 I wouldn't use just noindex for those, I'd make sure those pages are password-protected instead. Otherwise not-very-well-behaved bots and scrapers will still be able to see (and perhaps copy) those pages. Important note with Yoast configuration: you MUST make sure that what you're including in your XML sitemaps aligns with what you're indexing/noindexing on the pages themselves. It doesn't do this for you automatically.
 Cancel
Sandy Maquilin

2017-04-11T07:36:46-07:00

Is it good to use a plain simple straight forward sitemap or a tree like sitemap in an e-commerce website?

Google does not follow the sitemap at all, they crawl more than what the sitemap says, they sort of juice out everything they can find in your domain, which is worthwhile and fresh. This is the main issue for so many duplicate contents especially in e-commerce platforms. Google should be considering the use of sitemap strictly especially in e-commerce websites.

1 0

Is it good to use a plain simple straight forward sitemap or a tree like sitemap in an e-commerce website? Google does not follow the sitemap at all, they crawl more than what the sitemap says, they sort of juice out everything they can find in your domain, which is worthwhile and fresh. This is the main issue for so many duplicate contents especially in e-commerce platforms. Google should be considering the use of sitemap strictly especially in e-commerce websites. 
Cancel
- Michael Cottam
 
 2017-04-11T10:10:16-07:00
 
 I generally recommend for e-com sites creating a bunch of separate sitemaps for similar pages. Note I said "similar" and not "related"...I wouldn't create a sitemap for all types of pages in one product group, for instance...instead, I'd create a sitemap for blog posts, one for all category pages, one for all subcategory pages, and then one or more for all product pages. You want to be able to see what types of pages are giving you indexation nightmares.
 
 2 0
 
 I generally recommend for e-com sites creating a bunch of separate sitemaps for similar pages. Note I said "similar" and not "related"...I wouldn't create a sitemap for all types of pages in one product group, for instance...instead, I'd create a sitemap for blog posts, one for all category pages, one for all subcategory pages, and then one or more for all product pages. You want to be able to see what types of pages are giving you indexation nightmares.
 Cancel
Mario Lurig

2017-04-12T11:03:01-07:00

You were unclear as to when it was a good idea to use noindex,nofollow so I thought I'd provide an example.

I use noindex on pages that shouldn't ever be seen (such as a web app) in search engines. While 99% of the time they are accessed by a user/pw wall, I also have a custom HREF and script that will log you into the demo account, thus an avenue where a crawler could find themselves on a page that should never be in the index.

I use nofollow when the majority of links on that page are to other noindex pages, such as in the web app.

1 0

You were unclear as to when it was a good idea to use noindex,nofollow so I thought I'd provide an example. I use noindex on pages that shouldn't ever be seen (such as a web app) in search engines. While 99% of the time they are accessed by a user/pw wall, I also have a custom HREF and script that will log you into the demo account, thus an avenue where a crawler could find themselves on a page that should never be in the index. I use nofollow when the majority of links on that page are to other noindex pages, such as in the web app. 
Cancel
- Michael Cottam
 
 2017-04-12T12:08:20-07:00
 
 Hi Mario, I think I covered that pretty well in the Consistency section? I wouldn't use nofollow on a page unless 100% of the outbound links are to noindexed pages....otherwise, you're just throwing away link juice.
 
 1 0
 
 Hi Mario, I think I covered that pretty well in the Consistency section? I wouldn't use nofollow on a page unless 100% of the outbound links are to noindexed pages....otherwise, you're just throwing away link juice.
 Cancel
 - Mario Lurig
 
 2017-04-12T12:22:26-07:00
 
 You did, "you’re being a tease." So you're contradicting yourself by saying it's okay to be a tease as long as there is at least 1 link that should be followed, when that's just not true.
 
 1 0
 
 You did, "you’re being a tease." So you're contradicting yourself by saying it's okay to be a tease as long as there is at least 1 link that should be followed, when that's just not true. 
 Cancel
 - Michael Cottam
 
 2017-04-12T12:26:37-07:00
 
 I think you misunderstood. I think it's perfectly fine to tell Google you'd like the outbound links from a page to be counted, but that you don't think the page itself is index-worthy content.
 
 1 0
 
 I think you misunderstood. I think it's perfectly fine to tell Google you'd like the outbound links from a page to be counted, but that you don't think the page itself is index-worthy content. 
 Cancel
tay.nau

2017-04-12T09:43:02-07:00

Hi Michael,

Thanks for the article.

I have a question regarding how to differentiate between utility pages and high quality search landing pages.

My company is currently working on creating a new ecommerce site for one of our clients who runs a local business. This store has hundreds of products, and I've noticed that all of the product descriptions are word for word the same with just the name of the product being different.

I understand that in an ideal world, we would create unique descriptions for each product, but this client doesn't have the time or money to devote to such an effort for his hundreds of different products.

Since there is so much duplicate content on these pages, would it be a bad idea to noindex, follow these product pages?

With the current site, these pages are being indexed, and I'm wondering if we couldn't improve our client's rankings quicker by not indexing them in the new iteration of his site vs. spending the time, effort, and money to create unique product descriptions with quality content (which isn't a viable option currently).

What are your thoughts on an approach such as this?

Thanks,

Taylor

1 0

Hi Michael, Thanks for the article. I have a question regarding how to differentiate between utility pages and high quality search landing pages. My company is currently working on creating a new ecommerce site for one of our clients who runs a local business. This store has hundreds of products, and I've noticed that all of the product descriptions are word for word the same with just the name of the product being different. I understand that in an ideal world, we would create unique descriptions for each product, but this client doesn't have the time or money to devote to such an effort for his hundreds of different products. Since there is so much duplicate content on these pages, would it be a bad idea to noindex, follow these product pages? With the current site, these pages are being indexed, and I'm wondering if we couldn't improve our client's rankings quicker by not indexing them in the new iteration of his site vs. spending the time, effort, and money to create unique product descriptions with quality content (which isn't a viable option currently). What are your thoughts on an approach such as this? Thanks, Taylor
Cancel
- Michael Cottam
 
 2017-04-12T14:15:00-07:00
 
 Hi Taylor,
 
 I think what I would do is this: look at search traffic in aggregate to those product pages--try using URL patterns in Search Analytics in Search Console to see this. If you're not getting search traffic to those pages anyway, then I'd noindex them, as you're right....they may be dragging down your rankings for other pages on the site. If you ARE getting search traffic to them, leave them alone else you're cutting off traffic from Google.
 
 Note that I believe that Google has some sort of overall site quality ranking factor that affects your best pages based on something like the average quality of pages on your site....I believe this based on what I've seen happen on clients' sites when they've pruned off a lot of thin content. But, I don't recall ever seeing any statement from Google backing this up, so it's just my gut feel based on patterns I think I've seen.
 
 1 0
 
 Hi Taylor, I think what I would do is this: look at search traffic in aggregate to those product pages--try using URL patterns in Search Analytics in Search Console to see this. If you're not getting search traffic to those pages anyway, then I'd noindex them, as you're right....they may be dragging down your rankings for other pages on the site. If you ARE getting search traffic to them, leave them alone else you're cutting off traffic from Google. Note that I believe that Google has some sort of overall site quality ranking factor that affects your best pages based on something like the average quality of pages on your site....I believe this based on what I've seen happen on clients' sites when they've pruned off a lot of thin content. But, I don't recall ever seeing any statement from Google backing this up, so it's just my gut feel based on patterns I think I've seen.
 Cancel
Joseph de Souza

2017-04-11T02:32:07-07:00

Great post explaining XML sitemaps. However, I have noticed that if you want fast indexing, submitting to Google via Google search console is the fastest to get a page indexed. And if you domain has got reasonable amount of authority .... the page may start appearing in search results within hours.

Next for category pages . In case you want a category page to get indexed and rank in the search results make sure there is enough amount of relevant , unique text around 1,000 words or more the better. And then submit to Google with the option to index the linked pages as well

Regarding the internal linking issue..... For a domain with a reasonable amount of authority having a lot of internal links will definitely help in getting the page indexed faster.

2 1

Great post explaining XML sitemaps. However, I have noticed that if you want fast indexing, submitting to Google via Google search console is the fastest to get a page indexed. And if you domain has got reasonable amount of authority .... the page may start appearing in search results within hours. Next for category pages . In case you want a category page to get indexed and rank in the search results make sure there is enough amount of relevant , unique text around 1,000 words or more the better. And then submit to Google with the option to index the linked pages as well Regarding the internal linking issue..... For a domain with a reasonable amount of authority having a lot of internal links will definitely help in getting the page indexed faster. 
Cancel
- Michael Cottam
 
 2017-04-11T08:38:08-07:00
 
 Good point, Joseph. Submitting (or resubmitting, if you've made a major update) a page in Search Console is a hint to Google that you think it's important and worth crawling before whatever would normally be in the queue to crawl from your website.
 
 Category pages: Google appears to be less fond than it used to be of plain old category archives pages where there's an H1 heading and then a list of either products or blog posts. Fair enough: really all that page is is a list of links (and that's what Google wants to be!). Improving the content on a category page by adding an overview, some images or videos--that makes for a better page about that topic, for sure. From a UX perspective, many users just want to see the products (or blog posts) because they're familiar with the topic overall, and so often people will put a snippet of the overview up top and hide the majority of it initially, and supply a "Read more" link or button.
 
 2 0
 
 Good point, Joseph. Submitting (or resubmitting, if you've made a major update) a page in Search Console is a hint to Google that you think it's important and worth crawling before whatever would normally be in the queue to crawl from your website. Category pages: Google appears to be less fond than it used to be of plain old category archives pages where there's an H1 heading and then a list of either products or blog posts. Fair enough: really all that page is is a list of links (and that's what Google wants to be!). Improving the content on a category page by adding an overview, some images or videos--that makes for a better page about that topic, for sure. From a UX perspective, many users just want to see the products (or blog posts) because they're familiar with the topic overall, and so often people will put a snippet of the overview up top and hide the majority of it initially, and supply a "Read more" link or button.
 Cancel
 - Aidan Healy
 
 2017-04-17T00:28:05-07:00
 
 That's great! Category pages will be helpful in "short tail keywords". Yes, I can relate your explanation on how a category page should be with IKEA's category pages.
 
 2 0
 
 That's great! Category pages will be helpful in "short tail keywords". Yes, I can relate your explanation on how a category page should be with IKEA's category pages.
 Cancel
Shiv Jaiswal

2017-04-12T05:22:20-07:00

For better performance, we must configure frequencies and priorities of each urls in XML sitemap. Do not use invalid URLs in XML sitemap and must validate them in Google search console.

ShivJaiswal edited 2017-04-12T05:23:23-07:00
1 0

For better performance, we must configure frequencies and priorities of each urls in XML sitemap. Do not use invalid URLs in XML sitemap and must validate them in Google search console.
Cancel
Ron Spinabella

2017-04-11T10:36:43-07:00

Thanks Michael, lots of useful info in here, thanks for the help. Any reccommendations on how to structure the sitemap besides how important the content is? Ive seen some sitemaps that tell google what the content is ex products, blog, articl

1 0

Thanks Michael, lots of useful info in here, thanks for the help. Any reccommendations on how to structure the sitemap besides how important the content is? Ive seen some sitemaps that tell google what the content is ex products, blog, articl
Cancel
- Michael Cottam
 
 2017-04-11T13:32:54-07:00
 
 I doubt Google pays attention to those other fields. See the comment above mentioned Gary Illyes' tweet saying even the priority field is "just noise".
 
 2 0
 
 I doubt Google pays attention to those other fields. See the comment above mentioned Gary Illyes' tweet saying even the priority field is "just noise". 
 Cancel
Juan Luis Toboso

2017-04-11T01:10:25-07:00

Great and useful information.

I have a few doubts about the application. If you want to avoid indexing pages like 'Who we are' or 'Contact us' and other irrelevant pages you recommend using meta robots "no index, follow" right?

An easy way to do it for a Wp web includes Yoast Seo plugin. Is it correct this way or is there a better one?

Is there any way to know if a page is A, B, C, D...?

Great post. Very useful for non technical seos. Thank you!

2 3

Great and useful information. I have a few doubts about the application. If you want to avoid indexing pages like 'Who we are' or 'Contact us' and other irrelevant pages you recommend using meta robots "no index, follow" right? An easy way to do it for a Wp web includes Yoast Seo plugin. Is it correct this way or is there a better one? Is there any way to know if a page is A, B, C, D...? Great post. Very useful for non technical seos. Thank you!
Cancel
- Abhishek Singh Rao
 
 2017-04-11T02:19:07-07:00
 
 If you don't care about potential recoil in website performance then robots.txt will be useful. But I recommend to do to noindex, follow because it indicates search engines that you do not want the pages to be indexed.
 
 2 0
 
 If you don't care about potential recoil in website performance then robots.txt will be useful. But I recommend to do to noindex, follow because it indicates search engines that you do not want the pages to be indexed. 
 Cancel
 - Michael Cottam
 
 2017-04-11T10:11:52-07:00
 
 How do you see robots.txt affecting performance? It's not processed by the web server with every request, like .htaccess is.
 
 1 0
 
 How do you see robots.txt affecting performance? It's not processed by the web server with every request, like .htaccess is.
 Cancel
- Michael Cottam
 
 2017-04-11T08:32:44-07:00
 
 I'm a big fan of the Yoast plug-in, and yes, there's a page setting that allows you to noindex specific pages. They've also got some very helpful settings like noindexing subpages of archives, noindexing tag archives, etc.
 
 4 0
 
 I'm a big fan of the Yoast plug-in, and yes, there's a page setting that allows you to noindex specific pages. They've also got some very helpful settings like noindexing subpages of archives, noindexing tag archives, etc.
 Cancel
 - Juan Luis Toboso
 
 2017-04-12T09:34:38-07:00
 
 Thank you Michael.
 
 3 0
 
 Thank you Michael.
 Cancel

Post Analytics

XML Sitemaps: The Most Misunderstood Tool in the SEO's Toolbox

Indexation

Consistency

Overall site quality

The hidden fluff

Noindex vs. robots.txt

Crawl bandwidth management

Indexation problem debugging

Dynamic XML sitemaps

Video sitemaps

Summary

Comments 83

Indexation

Consistency

Overall site quality

The hidden fluff

Noindex vs. robots.txt

Crawl bandwidth management

Indexation problem debugging

Dynamic XML sitemaps

Video sitemaps

Summary

Comments 83

Log in to Moz

Don't have an account?