Why is Roger crawling pages that are disallowed in my robots.txt file?

MeltButterySpread

I have specified the following in my robots.txt file:

Disallow: /catalog/product_compare/

Yet Roger is crawling these pages = 1,357 errors.

Is this a bug or am I missing something in my robots.txt file?

Here's one of the URLs that Roger pulled:

<colgroup><col width="312"></colgroup>
|

example.com/catalog/product_compare/add/product/19241/uenc/aHR0cDovL2ZyZXNocHJvZHVjZWNsb3RoZXMuY29tL3RvcHMvYWxsLXRvcHM_cD02/

Please let me know if my problem is in robots.txt or if Roger spaced this one. Thanks!

|

MeltButterySpread

Digging in further I discovered that rogerbot had blocked a portion of these URL variations, but 2/3 slipped through. I sent an email to support. Thanks for the suggestion.

blu42media

Digging back through the Q&A... I'm several posts reporting this sort of thing.

http://www.seomoz.org/dp/rogerbot

Perhaps you could try specifically blocking rogerbot? If that doesn't work, an email to the SEOmoz team may do the trick

MeltButterySpread

Yes, blocking all --> *

blu42media

Have you specified a User-Agent?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Why is Roger crawling pages that are disallowed in my robots.txt file?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

F rated page appearing higher than A rated page

Why does SEOMoz only crawl 1 page of my site?

Sudden decrease in Moz Page rank

I have another Duplicate page content Question to ask.Why does my blog tags come up as duplicates when my page gets crawled,how do I fix it?

Fixing the Too Many On-Page Links

To block with robots.txt or canonicalize?

How long does a crawl take?

Crawl Issues