Moz crawler showing pages blocked by robots.txt
-
I've blocked a large number of pages which Moz were showing as duplicate or giving 404's in our robots.txt using /?key and /?p etc.
However Moz crawler is still showing as being an issue. I assumed Roger picked up the robots.txt file, or is that not the case?
-
Not really an expert in robots.txt - but you could always try to simulate Rogerbot using a tool like Screaming Frog (the tool is free if you crawl less than 500 pages).
You can enter a custom user agent. Rogerbot seems to be rogerbot/1.0 (http://moz.com/help/pro/what-is-rogerbot-, rogerbot-crawler+shiny@moz.com) and try crawling the pages on your site. In default mode, the tool will only crawl pages that are allowed by the robots.txt. If the tool crawls your pages, then something is wrong with the robots.txt - if not, than you should contact Moz staff to check with them.
-
Yep. Checked in the tester at the time and did some more spot checks this morning.
I used Disallow: /*?key which I thought was generally recognised by bots?
-
Did you check these pages with the robots.txt tester in Webmastertools to be sure that these pages are really blocked for bots? Did you exclude all bots or only the Googlebot?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Moz was unable to crawl your site on Jun 22, 2020\. We were unable to access your site due to a page timeout on your robots.txt, which prevented us from crawling the rest of your site.
Site: www.kpmg.us Getting robots.txt timeout fail since 02/29/20. We've checked our server logs and see no errors. Went through all the steps of the "Troubleshooter". Updated robots.txt to allow rogerbot full access: User-agent: rogerbot
Link Explorer | | KPMG-Search-Social
Disallow: Any ideas how to get roger to crawl my site????1 -
Is there a way to download a report showing all meta descriptions for our web pages?
I see how to look at data for web pages with meta descriptions that have been flagged as being less than optimal. Is there a way to do a complete download of the meta descriptions on all our pages? (good meta as well as not so good) Thanks! Lydia
Link Explorer | | Lifespan-Moz0 -
Does MOZ have a Flash test tool?
I want to test my websites and see if they use Flash, is there a flash check tool like on SEO tool kit here on MOZ? Thanks, Lance
Link Explorer | | BlueprintMM0 -
Moz showing +3M new inbound links. But nowhere to find them?
Hi there Moz is showing a +3M backlink rise in the dashboard for one of our domains. The site has always had around 350K backlinks and in Ahrefs and Majestic it still shows this number. But Moz shows a growth of +3M in the last two weeks. Is there a way to see where these backlinks come from in OSE? I can't seem to understand how it is possible to see this somehow. Can it be a mistake of Moz maybe?
Link Explorer | | snorkel1 -
403 errors in Moz but not in Google Search Console
Hello, Moz is showing that one of the sites I manage has about ten 403 errors on main pages, including the home page. But when I go to Google Search Console, I'm not getting any 403 errors. I don't know too much about this site (I handle the SEO for a few sites as a contractor for a digital marketing agency), but I can see that it's a WordPress site (I'm not sure if that's relevant). Can I assume this a Moz issue only? Thanks, Susannah Noel
Link Explorer | | SusannahK.Noel0 -
Error Code 612 with robots.txt 200
Hi! I am getting this message Error Code 612: Error response for robots.txt, so the crawler do not check any page of the site. The status code for the robots.txt is 200 and it does not seem Googlebot has any problem crawling the site, so I don't know what the matter is. The site is http://www.musicopolix.com/ Thanks so much in advance for any help!
Link Explorer | | Musicopolix0 -
Why are no-follow links in my blog comments across the web showing up as "equity-passing"?
When I make comments on blog posts (and there's a link in my comment or my name is the link to my site), the links are always no-follow (as they should be). But, when I check Open Site Explorer, the new links show up as equity-passing. Are they actually passing equity or is this a mistake?
Link Explorer | | infotrust20 -
Sites internal links are not showing as inbound links
My sites internal links are not showing as inbound links while my competitor site’s internal links are showing as their inbound links (In OSE). Is my site’s inter-linking weak? Or there could be other reasons.
Link Explorer | | vivekrathore0