Why is Roger crawling pages that are disallowed in my robots.txt file?

MeltButterySpread

I have specified the following in my robots.txt file:

Disallow: /catalog/product_compare/

Yet Roger is crawling these pages = 1,357 errors.

Is this a bug or am I missing something in my robots.txt file?

Here's one of the URLs that Roger pulled:

<colgroup><col width="312"></colgroup>
|

example.com/catalog/product_compare/add/product/19241/uenc/aHR0cDovL2ZyZXNocHJvZHVjZWNsb3RoZXMuY29tL3RvcHMvYWxsLXRvcHM_cD02/

Please let me know if my problem is in robots.txt or if Roger spaced this one. Thanks!

|

MeltButterySpread

Digging in further I discovered that rogerbot had blocked a portion of these URL variations, but 2/3 slipped through. I sent an email to support. Thanks for the suggestion.

blu42media

Digging back through the Q&A... I'm several posts reporting this sort of thing.

http://www.seomoz.org/dp/rogerbot

Perhaps you could try specifically blocking rogerbot? If that doesn't work, an email to the SEOmoz team may do the trick

MeltButterySpread

Yes, blocking all --> *

blu42media

Have you specified a User-Agent?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Why is Roger crawling pages that are disallowed in my robots.txt file?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Keyword Stuffing - MOZ On-Page Grader

Crawl Diagnostics Summary Problem

Not all pages are being crawled

Crawl credits how to buy more?

SEOMOZ Crawl Test

Crawl test tool from SEOmoz - which URLs does it actually crawl?

Rogerbot Ignoring Robots.txt?

Page and Domain Authority and other bits