What are the negative implications of listing URLs in a sitemap that are then blocked in the robots.txt?

richdan

In running a crawl of a client's site I can see several URLs listed in the sitemap that are then blocked in the robots.txt file.

Other than perhaps using up crawl budget, are there any other negative implications?

Dezzign

I highly doubt it would effect rankings due to low quality issues but it will show that you have site map error warnings in your GWT console. That issue is technically classified as 'Warnings' and not 'Errors'. The right thing to do in that scenario is take the robots.txt block off and just use a 'noindex' tag on the pages. That way they can stay in the site map but they won't show up in the index. Otherwise you should remove them from the sitemap if you don't want the warnings in GWT.

LesleyPaone

I personally do not think there is any penalty SEO wise in doing it. Although, I do think it will mess up the metric in GWT that shows how many pages have been submitted and how many have been indexed. I find that metric useful, so it would make it no longer useful if there are a lot of pages blocked by the robots.txt.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

What are the negative implications of listing URLs in a sitemap that are then blocked in the robots.txt?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

No index tag robots.txt

Little confused regarding robots.txt

Should I block Map pages with robots.txt?

GWT returning 200 for robots.txt, but it's actually returning a 404?

Blocked URL parameters can still be crawled and indexed by google?

Is having no robots.txt file the same as having one and allowing all agents?

How do I properly use the canonical tag to avoid negative effect from having identical content on 2 url’s?

Summarize your question.Sitemap blocking or not blocking that is the question?