What is the full user-agent for rogerbot?

uShip

IT is blocking AWS via a proxy in front of our server. We've tried allowing the "roberbot" user-agent but crawling functionality still isn't working in my Moz Pro account. Is there a more specific user-agent we can allow in our proxy software? Thank you.

SarahWJATL

We are having this same issue. I was hoping someone clarified it for you.

Did you ever get it sorted out?

BlueprintMarketing

User-agent: *

User-agent: dotbot
Disallow: /
User-agent: rogerbot
Disallow: /

if you want to prevent robots from crawling your site truly prevent them you will need to use either a password restriction or a tool similar to this

http://www.distilnetworks.com/

If you see what is being said by Google and Moz a robots.txt file can not guarantee blocking something that is linked to. if you want to do that you will have to block the referral using a WAF like distilnetworks

http://moz.com/help/guides/search-overview/crawl-diagnostics

https://moz.com/researchtools/ose/dotbot

&

https://support.google.com/webmasters/answer/6062608?rd=2

Also blocking link analysis user agents that are nothing but a drain on your resources is a good idea. Simple enough to do in htaccess with something like this:

Search Engine Blocked by Robots.txt

This page cannot be crawled by search engines due to the robots.txt protocol. If you're seeking to remove this page from search results, we recommend that you use meta robots (with noindex, follow values) instead of robots.txt. This will ensure that the page does not appear in the results but allows link juice to flow through the page's links and count towards the relevance/popularity of other pages on your site.

How to block DotBot from crawling your site

If you don't want dotbot crawling your site, we always respect the standard Robots Exclusion Protocol (aka robots.txt). If you would like to block dotbot, all you need to do is add our user-agent string to your robots.txt file.

If you want to ban dotbot from most areas of your site, it looks a little something like this:

      User-agent: dotbot
      Disallow: /admin/
      Disallow: /scripts/
      Disallow: /images/

below this I have placed what somebody has created that they state works I do not know if it works I told you that distill networks will work but I cannot guarantee the very bottom I think you will not have any trouble if you set up the robots.txt as configured at the top.

If you want to ban dotbot from crawling any part of your site, add this text instead:

      User-agent: dotbot
      Disallow: /

BEGIN

RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} ^rogerbot [OR]
RewriteCond %{HTTP_USER_AGENT} ^exabot [OR]
RewriteCond %{HTTP_USER_AGENT} ^MJ12bot [OR]
RewriteCond %{HTTP_USER_AGENT} ^dotbot [OR]
RewriteCond %{HTTP_USER_AGENT} ^gigabot [OR]
RewriteCond %{HTTP_USER_AGENT} ^AhrefsBot
RewriteRule .* – [F]

SetEnvIfNoCase User-Agent .rogerbot. bad_bot
SetEnvIfNoCase User-Agent .exabot. bad_bot
SetEnvIfNoCase User-Agent .mj12bot. bad_bot
SetEnvIfNoCase User-Agent .dotbot. bad_bot
SetEnvIfNoCase User-Agent .gigabot. bad_bot
SetEnvIfNoCase User-Agent .ahrefsbot. bad_bot
SetEnvIfNoCase User-Agent .sitebot. bad_bot

Order Allow,Deny
Allow from all
Deny

END

Thomas

uShip

Thank you for your response but this doesn't answer my question. We aren't blocking rogerbot using robots.txt. We need to allow it through the proxy in front of our web server by using the exact user-agent (case sensitive) that is being sent by rogerbot. We've tried "rogerbot" but that isn't working. Based on the 3rd party documentation you linked to there seem to be a variety of possibilities:

roberbot/1.0
roberBot/1.0
RogerBot/1.0
rogerBot
RogerBot

It would be great if Moz provided clear documentation on this.

BlueprintMarketing

HiI know there are two crawlers that Moz uses Roger bot and open site Explorer uses dotbot

Make sure there is no forward slash "/" after e.g. (Disallow:/)

Mozused to have an awesome writeup on it but it just forwards to Moz.com/help now it could be that they have another great write up but the URL changed. For now here's the information

User-agent: rogerbot
Disallow:
User-agent: dotbot
Disallow:

User Agent Analyser
Mozilla/5.0 (compatible; DotBot/1.1; http://www.opensiteexplorer.org/dotbot, help@moz.com)

http://www.useragentstring.com/Dotbot1.1_id_16014.php

https://udger.com/resources/ua-list/bot-detail?bot=rogerbot

http://www.botopedia.org/user-agent-list/crawlers/item/369-rogerbot-seomoz

hope this helps,

Thomas

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

What is the full user-agent for rogerbot?

How to block DotBot from crawling your site

BEGIN

END

Got a burning SEO question?

Explore more categories

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved