3,511 Pages Indexed and 3,331 Pages Blocked by Robots

PeaSoupDigital

Morning,

So I checked our site's index status on WMT, and I'm being told that Google is indexing 3,511 pages and the robots are blocking 3,331. This seems slightly odd as we're only disallowing 24 pages on the robots.txt file. In light of this, I have the following queries:

Do these figures mean that Google is indexing 3,511 pages and blocking 3,331 other pages? Or does it mean that it's blocking 3,331 pages of the 3,511 indexed?
As there are only 24 URLs being disallowed on robots.text, why are 3,331 pages being blocked? Will these be variations of the URLs we've submitted?
Currently, we don't have a sitemap. I know, I know, it's pretty unforgivable but the old one didn't really work and the developers are working on the new one. Once submitted, will this help?
I think I know the answer to this, but is there any way to ascertain which pages are being blocked?

Thanks in advance!

Lewis

PeaSoupDigital

Hi,

No more links than a standard e-commerce site should have...

I'm chasing the sitemap as we speak.

Cheers,

MonicaOConnor

The blocked URLs are probably no follow links throughout the site. Do you have a lot of links pointing outward from pages?

Google is indexing 3511 pages, of which 3331 are blocked by Robots. I would check some of the internal/external links on those disallowed pages. I don't see how it could come up to 3331 blocked pages, but it couldn't hurt to start there.

Definitely get a sitemap submitted asap. It will help for sure.

Whittie

Excuse the short reply.

Add sitemap to your robots.txt - And submit it to Google WMT.

Just use a free one if you're in the middle of developing?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

3,511 Pages Indexed and 3,331 Pages Blocked by Robots

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

How to stop google from indexing specific sections of a page?

Duplicated rel=author tags (x 3) on WordPress pages, any issue with this?

Blocked URL parameters can still be crawled and indexed by google?

How Does Google's "index" find the location of pages in the "page directory" to return?

Changed cms - google indexes old and new pages

Why is Google only indexing 3 of 8 pages?

Getting Google to index new pages

Are .html pages better for ranking than .asp pages