Why is Roger crawling pages that are disallowed in my robots.txt file?

MeltButterySpread

I have specified the following in my robots.txt file:

Disallow: /catalog/product_compare/

Yet Roger is crawling these pages = 1,357 errors.

Is this a bug or am I missing something in my robots.txt file?

Here's one of the URLs that Roger pulled:

<colgroup><col width="312"></colgroup>
|

example.com/catalog/product_compare/add/product/19241/uenc/aHR0cDovL2ZyZXNocHJvZHVjZWNsb3RoZXMuY29tL3RvcHMvYWxsLXRvcHM_cD02/

Please let me know if my problem is in robots.txt or if Roger spaced this one. Thanks!

|

MeltButterySpread

Digging in further I discovered that rogerbot had blocked a portion of these URL variations, but 2/3 slipped through. I sent an email to support. Thanks for the suggestion.

blu42media

Digging back through the Q&A... I'm several posts reporting this sort of thing.

http://www.seomoz.org/dp/rogerbot

Perhaps you could try specifically blocking rogerbot? If that doesn't work, an email to the SEOmoz team may do the trick

MeltButterySpread

Yes, blocking all --> *

blu42media

Have you specified a User-Agent?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Why is Roger crawling pages that are disallowed in my robots.txt file?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Website blocked by Robots.txt in OSE

Moz campaign works around my robots.txt settings

Functionality of SEOmoz crawl page reports

On-Page Report Card B grade because its a PPC landing page

Page authority questions?

Page Penalization

How to get rid of the message "Search Engine blocked by robots.txt"

Can I exclude pages from my Crawl Diagnostics?