Why does SEOMoz crawler ignore robots.txt?
-
The SEOMoz crawler ignores robots.txt
It also "indexes" pages marked as noindex.
That means it is filling up the reports with things that don't matter.
Is there any way to stop it doing that?
-
Hi Alan,
The code should be ok
Try to "drive-test" it with a custom crawl from http://pro.seomoz.org/tools/crawl-test then you will see if it works well.
I am glad the link was useful.
Gr.,
Istvan
-
Thank you István
I added this:
User-agent: rogerbot
Disallow: /sendtoafriend/
Disallow: /photo/
Disallow: /pix/- because crawlers shouldn't go down those paths and roger is detecting pages without descriptions.
Is what I added OK?
-
Hi,
You can block RogerBot from Robots.txt
Check for further instructions on: http://www.seomoz.org/dp/rogerbot
"Please note: Adding this code will prevent our crawl test tool from being able to crawl your website."
Gr.,
Istvan
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Seomoz crawl: 4XX (Client Error) How to find were the error are?
I got eight 404 errors with the Seomoz crawl, but the report does not says where the 404 page is linked from (like it does for dup content), or I'm I missing something? Thanks
Moz Pro | | PaddyDisplays0 -
Does anybody really think that SEOMoz provides much value?
Crawl results lag so far behind as to be of questionable value for corrective purposes, and in the fast-paced world of SEO, the service seems right out of the horse-and-buggy era.. I corrected crawl errors two weeks ago, and yet SEOMoz' crawls are still not reflecting this. Furthermore, SEOMoz' idea of where my keywords rank has little to no bearing on reality. I am really disenchanted, and thinking of cancelling my subscription.
Moz Pro | | amadomon0 -
SEOmoz showing crawl errors but webmastertools says no errors, need help!
Hi this is my first question and i couldnt find a similar question on here. basically i have a clients website that is showing 150 duplicate page titles and content errors plus others. SEOmoz analysis is showing me for example is 3 duplicate hompage URLS: 1.www.domain.com 2.domain.com 3.www.domain.com/index.html all 3 are the same page. after explaining to the guy (who built the website) the errors, he ensured me that the main URL is URl 1. and the other 2 are 301 redirects. however SEOmoz analysis doesnt seem to change the results and webmastertools doesnt seem to show any errors at all. also if i try all 3 URL's there are no redirects to URL 1. any help or clarity would be awesome! Thanks e-bob
Moz Pro | | bobsnowzell0 -
SEOmoz directory list, discussing individual directories...
It would be nice if the directory list (http://www.seomoz.org/directories) had an area to discuss each individual directory. Some of the directories like Internet Library have great domain authority, but have a large number of thumbs-down from users. I would like to know why. Personally, I'm very skeptical about directories. I'd love to prove myself wrong, though. The SEOmoz directory list in its current form isn't all that helpful.
Moz Pro | | MicahMMG0 -
SEOMOZ Stats dont work out
Hi, When I check my mozstats for the homepage it says the PA is 50 but the DA is 30, how can that be? I would expect them to either be the same or at least the DA to be higher then the PA. Cheers
Moz Pro | | activitysuper0 -
SEOmoz crawl error questions
I just got my first seomoz crawl report and was shocked at all the errors it generated. I looked into it and saw 7200 crawl errors. Most of them are duplicate page titles and duplicate page content. I clicked into the report and found that 97% of the errors were going off of one page It has ttp://legendzelda.net/forums/index.php/members/page__sort_key__joined__sort_order__asc__max_results__20 http://legendzelda.net/forums/index.php/members/page__sort_key__joined__sort_order__asc__max_results__20__quickjump__A__name_box__begins__name__A__quickjump__E etc Has 20 pages of slight variations of this link. It is all my members list or a search of my members list so it is not really duplicate content or anything. How can I get these errors to go away and make search my site is not taking a hit? The forum software I use is IPB.
Moz Pro | | NoahGlaser780 -
Has SEOMoz Domain Authority calc been updated recently
As ever, my apologies if this question's been asked before -- I searched for it, and didn't find it. But, our SEOMoz 'Domain Authority' has recently really jumped (from 67, where it was for many months, to 77). And, our number of Linking Root Domains increased as well, from 1,986 to 3,349) Of course, none of this has translated into Google goodness, but if any of it's true, maybe it WILL in the future? Any thoughts/ideas? Thanks, Dave (www.TeachStreet.com)
Moz Pro | | daveschappell0 -
Help with SEOmoz API
Hi guys, I'm trying to make API requests from my webserver via PHP. I'd like to retrieve data from the SEOmoz URL Metrics API. Unfortunately I always get the error response "unauthorized" even when I copy and paste the Sample Valid API Signature generated by your system into the browser. Is Signed Authentication not longer supported? I even tried the sample PHP Code SignedAuth.php but there's the same problem, too. If signed authentication is not longer available, do you have a code example for the basic http authorization? Thanks, Brandon
Moz Pro | | thegreatpursuit1