Moz campaign works around my robots.txt settings
-
My robots.txt file looks like this:
User-agent: *
Disallow: /*?
Disallow: /search
So, it should block (deindex) all dynamic URLs.
If I check this url in Google:
site:http://www.webdesign.org/search/page-1.html?author=47
Google tells me:
A description for this result is not available because of this site's robots.txt – learn more.
So far so good.
Now, I ran a Moz SEO campaign and I got a bunch of duplicate page content errors.
One of the links is this one:
http://www.webdesign.org/search/page-1.html?author=47
(the same I tested in Google and it told me that the page is blocked by robots.txt which I want)
So, it makes me think that Moz campaigns check files regardless of what robots.txt say? It’s my understanding User-agent: * should forbid Rogerbot from crawling as well. Am I missing something?
-
That worked, thanks!
-
Thanks Abe.
I guess I'll try this:
Useragent: Rogerbot
Disallow: /*?
Because if I use Disallow: / I'll lose my current Moz reports because Rogerbot will just ignore all my file, right?
-
Hello Vince, thank you for reaching out to us! This seems quite odd, our crawler usually obeys all robots.txt files. Let's try this. Add this code to your robots.txt:
Useragent: Rogerbot
Disallow: /
This should specifically instruct us to follow these rules. Once you have tried this, if it does not work, please send an email to help@moz.com and we will have our engineers dig in a bit further. Sorry for the inconvenience, I hope the above fix works for you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Solve a Lost GA Connection - can't get it to work
Hi, I followed all steps on https://moz.com/help/moz-pro/getting-started/troubleshoot-google-analytics?_ga=2.15852085.2007735045.1676157945-122376721.1676157945 below Solve a Lost GA Connection multiple times but I keep getting the message "Our connection to your Google Analytics account was lost. Don't worry, you won't lose any data. Please follow the steps in our guide to reconnect.
Moz Pro | | silvansoeters
" I am on the free trial and would obviously like to connect to GA to see how everything is working. Any help would be greatly appreciated. Thanks.0 -
Does MOZ Provide an On Page Grader Embed?
I am looking to provide an on page grader on our website. Does MOZ provide this feature?
Moz Pro | | WebMarkets0 -
MOZ Pro - Page Grader Queries
Greetings! I started a new position last week and the extremely supportive director promised to give me anything I required to make my job easier. Of course, my first port of call was MOZ Pro. Having never used MOZ Pro before, I've just been getting to grips with it, fixing any pressing issues and giving the whole site a general SEO health check. A few fairly major issues have been flagged, which I'm in the process of fixing, and I'm currently putting our main landing pages through the MOZ Page Grader. After a little bit of tinkering, our iPhone 6 cases page has been graded B for the term 'iPhone 6 cases', but I have a few queries/concerns regarding some of the suggested fixes: **Avoid keyword stuffing in document **- The term 'iphone 6 cases' only appears thrice in the body, so it can't be that? The term, however, appears 23 times in the page's img alt tags. Could this be the issue? This is an ecommerce site that sells iPhone 6 cases, so the img alt tags are bound to contain that keyword. Each img alt tag is unique, so I don't really know what I can do here? "Show details for YouSave iPhone 6 0.6mm Clear Gel Case" is the example of the img alt tag of one of the products on that page, surely I can't remove the words 'iPhone 6 case'? Avoid too many internal links - MOZ suggests keeping the internal links to below 100 or, at a minimum, less than 100 links on the main navigation menu. I haven't counted, but I'd guess that page has more than 100 links, but not too many on the navigation menu. To me, this looks like a standard ecommerce page, with links to products and different pages via the top and bottom menus. Would I improve visibility if I reduced the amount of links by, say, reducing the number of products on the page? We currently have it set to 36, but can easily be reduced. Only One Canonical URL - We've put a fix in place for this issue and are just waiting for it to go live. For some reason rel=canonical tags have been duplicated on the majority of the pages. Like I say, this is being remedied, but I just wondered whether a duplicate tag negatively affect the page's visibility? The tags are identical and just point to the page they're on. I think that's about it for now! Thanks in advance and keep up the good work! Cheers, Lewis (Andrew is the name of the director) UPDATE Now I've sorted the rel=canonical issue, the pages are being graded A but still with the first two suggestions above.
Moz Pro | | PeaSoupDigital0 -
Website blocked by Robots.txt in OSE
When viewing my client's website in OSE under the Top Pages tab, it shows that ALL pages are blocked by Robots.txt. This is extremely concerning because Google Webmaster Tools is showing me that all pages are indexed and OK. No crawl errors, no messages, no nothing. I did a "site:website.com" in Google and all of the pages of the website returned. Any thoughts? Where is OSE picking up this signal? I cannot find a blocked robots tag in the code or anything.
Moz Pro | | ConnellyPartners0 -
Can I exclude a sub-domain from SEOMoz campaigns?
We have recently implemented a white label site that is on a sub-domain. The site employs noindex on most of the pages I imagine due to duplicate content concerns on other white label versions of the site. It has led to a spike of over 14 thousand notices on our report. Is there a way to exclude a sub-domain from the SEOMoz scans and reports?
Moz Pro | | TSDigital0 -
Usable to set up campaign because site cannot be
I don't understand this message. i never had problems with other sites and now I get problems with this message when trying to set a campaign twice for 2 different sites. I received the same message twice. What do I do? Help! We have detected that the root domain xxxxxxxxxxxxxxxxxxxx does not respond to web requests. Using this domain, we will be unable to crawl your site or present accurate SERP information. Thanks.
Moz Pro | | mcuneo0 -
Archiving Campaigns in SEOmoz
First off, I love the campaign archive feature. Very useful for my purposes. My question is: Is there a limit to how many campaigns I can archive? Thanks in advance!
Moz Pro | | CollinJarman0 -
Possible to have more then one SEO Moz login / account
We have an SEO Moz subscription and we were wondering if there was a way to setup multiple users attached to that subscription (i.e. separate logins).
Moz Pro | | Panjiva2