I'm getting a Crawl error 605 Page Banned by robots.txt, X-Robots-Tag HTTP Header, or Meta Robots Tag

TumbleweedPDX

The website is www.bigbluem.com and is a wordpress site.

I'm getting the following error:

605 Page Banned by robots.txt, X-Robots-Tag HTTP Header, or Meta Robots Tag

But what is weird is the domain it lists below that is http://None/BigBlueM.com

Any advice?

DavidLee

I can now resolve the www version of the site but not reach the root domain which will continue to return the 605 error so there is something about the root domain configuration that is blocking our bot. A workaround would be to create a new campaign for www.bigbluem.com instead of bigbluem.com

TumbleweedPDX

Thanks David!

I noticed that this morning it was showing the correct domain all of a sudden. Thanks for looking into that further.

I've made that change to the robots.txt file so go ahead and test when you can.

Thanks again!

DavidLee

Hello!

Sorry for the confusion. For your site there were two issues, one on our side with our crawler failing between August 21st - 29th crawls trying to reach http://None/yourdomain.com and the site not responding to robots.txt

The crawl issue has been resolved but for some reason your site is still blocking our user-agent.

Rogerbot can't follow allow directives well so you could try updating it with Disallow:

User-agent: Rogerbot
Disallow:

Let me know if this helps! Once you make the change I can run a quick test to see if it will resolve.

TumbleweedPDX

Thanks!

I've made that change, although I still don't know why it would have the URL as http://None/BigBlueM.comThat's concerning and makes me think that it isn't crawling because it's trying to crawl that URL which doesn't exist.

Do you know if I have to wait another week for Moz to attempt a crawl or can I force that to see if it's working?

TumbleweedPDX

Thanks for piping in here! I will definitely rely more heavily on GWMT and will check out Screaming Frog SEO spider. Thanks!

Ray-pp

I've consistently experienced problems with the Moz crawler - to the point of I no longer put much value into it.

I'm getting this error and nothing has changed in my robots.txt.

Instead, use GWMT and Screaming Frog SEO spider - that's all you need and does more than the Moz crawler.

chrissmithps

I have noticed underneath rogerbot you have dissallow change it to

User-agent: rogerbot

Allow: /

Then let me know how you get on crawling the home page.

chrissmithps

No problem, I will have a look into that issue for you now it does seem strange.

TumbleweedPDX

Hey! Thanks for the response.

I didn't have Disallow set up for the root folder at all, just for /wp-admin/.

I went ahead and added the User-agent: rogerbot

The one thing I am still concerned about is that Moz is saying "We were unable to access your homepage" and then has the URL http://None/BigBlueM.com

Why does it think that is my homepage? That seems weird and it isn't that way on any of my other sites that are set up in Moz.

chrissmithps

In order to allow crawlers to to access the site, you would need to either remove the / after Disallow or change Disallow to Allow.

If you specifically want to allow the Moz crawler, you can insert the following directive above the current directive that is in the .htaccess

User-agent: rogerbot
Disallow:

TumbleweedPDX

If you happen to figure out a solution before someone posts here, let me know what it is!

Webformix

I am having the exact same problem, however Google webmaster tools is able to crawl the site just fine.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

I'm getting a Crawl error 605 Page Banned by robots.txt, X-Robots-Tag HTTP Header, or Meta Robots Tag

605 Page Banned by robots.txt, X-Robots-Tag HTTP Header, or Meta Robots Tag

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

How is Moz's DA score so high?

Who and how does one get in Fresh Alerts?

Can't delete items from the on page grader

Site crawl errors - download list of all urls

Duplicate page content/page titles on tages

Crwal errors : duplicate content even with canonical links

Clarify "broad keyword usage in page title"

Hi guys, how can I access the on-demand Top Pages Reports? Thanks!