Why is Roger crawling pages that are disallowed in my robots.txt file?

MeltButterySpread

I have specified the following in my robots.txt file:

Disallow: /catalog/product_compare/

Yet Roger is crawling these pages = 1,357 errors.

Is this a bug or am I missing something in my robots.txt file?

Here's one of the URLs that Roger pulled:

<colgroup><col width="312"></colgroup>
|

example.com/catalog/product_compare/add/product/19241/uenc/aHR0cDovL2ZyZXNocHJvZHVjZWNsb3RoZXMuY29tL3RvcHMvYWxsLXRvcHM_cD02/

Please let me know if my problem is in robots.txt or if Roger spaced this one. Thanks!

|

MeltButterySpread

Digging in further I discovered that rogerbot had blocked a portion of these URL variations, but 2/3 slipped through. I sent an email to support. Thanks for the suggestion.

blu42media

Digging back through the Q&A... I'm several posts reporting this sort of thing.

http://www.seomoz.org/dp/rogerbot

Perhaps you could try specifically blocking rogerbot? If that doesn't work, an email to the SEOmoz team may do the trick

MeltButterySpread

Yes, blocking all --> *

blu42media

Have you specified a User-Agent?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Why is Roger crawling pages that are disallowed in my robots.txt file?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Why did Moz crawl our development site?

Why would my sub landing pages have a higher Moz Rank than my home page

1 page crawled ... and other errors

Has any on else experienced a spike in crawl errors?

Broken CSV files?

Article Directory Page Rank 7

How long is a full crawl?

How to check Page Authority in bulk?