Why doesn't Moz crawler follow robots.txt?
-
It is crawling the entire site, and there is stuff we do not want it to. Please advise.
-
Which I am ok with, but why am I getting duplicate content?
-
Yes, it doesn't tell them which pages not to crawl - just not to index them
-
It has been used correctly. The site is a Magento site and they have it built in. There are a lot of filters for products so it uses rel=canonical to tell Google which to index.
-
rel=canonical is not really an robots instruction file - rel=canonical is to help with duplicate copy where you have the same or similar pages and your telling search engines which pages is the preferred page.
If you don't want pages crawling you have to tell Search engines in the robots file
-
Hi There,
Rel=canonical tags tell robots, which page is actually to index out of many.
For SEOs, canonicalization refers to individual web pages that can be loaded from multiple URLs. This is a problem because when multiple pages have the same content but different URLs, links that are intended to go to the same page get split up among multiple URLs. This means that the popularity of the pages gets split up. Unfortunately for web developers, this happens far too often because the default settings for web servers create this problem.
https://moz.com/learn/seo/canonicalization
I feel you have not used it correctly, check the above article and see if it helps.
Thanks,
Vijay
-
So I made a mistake it isn't the robots.txt that is the issue. I am getting hit with a ton of duplicate content penalties so I figured that was it. The problem is that I have pages with rel=canonical tags that it is ignoring. Does Roger not read those?
-
Hi
Have to agree with the above, Rogerbot does listen to robot.txt file, unlike Bing - while they are getting better Bing ignores the robots.txt file frequently.
Ive analysed quite a few server logs over the years and Roger has always listened to the file - its usually a mistake the in the robots file.
There is an option to test your robots.txt file in GCS - while this is testing to see if Google will crawl the page - usually Roger has the same instructions as Google.
However if you are still pretty certain that Roger is ignoring robots.txt please DM your Server Logs and your website and I will take a look and analyse it for you (free of course).
Thanks
Andy
-
All major search engines, including Moz's crawler Rogerbot and Internet Archives, respect Robots.txt as a standard “robots exclusion protocol” to communicate with web crawlers and web robots.
In case you wish to exclude some specific information from all Search Engines, you can use the following sample code as reference to block specific directories.
User-agent: *
Disallow: /cgi-bin/
Disallow: /tmp/
Disallow: /junk/However, if you want to specifically block Mz's Rogerbot from crawling specific sections of your website. You may take the following reference code to block specific areas / directories in your website from rogerbot:
User-agent: Rogerbot
Disallow: /cgi-bin/
Disallow: /tmp/
Disallow: /junk/I hope this helps, If you have specific questions, please feel free to respond, I will be happy to answer them.
Regards,
Vijay
-
Hi there! Moz's crawler, rogerbot, does follow robots.txt. When he's not following robots.txt, it's usually because the robots.txt protocol is formatted improperly. Learn more about formatting your page here: https://moz.com/learn/seo/robotstxt
For more information on Roger, including how to block him, head here: https://moz.com/help/guides/moz-procedures/what-is-rogerbot
And if you want to test your formatting, try the Robots Checker here: https://support.google.com/webmasters/answer/6062598
If you're still unable to determine why rogerbot is crawling your site, feel free to write in to help@moz.com!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why is there difference in keywords of moz and ahrefs
Hello, my welding helmets website Helmet Adviser is about reviews and is causing me confusion. When I check my site keywords in Moz it shows lesser keywords and when I check in ahrefs it shows more. Same is for backlinks. But Moz keywords research is very authentic and i want to rely on it for keywords and backlinks too. Please elaborate for me why is there diff between Moz and ahref, so I can better utilize Moz by understanding what is going on in the background.
Link Explorer | | ericcampbell10 -
How Does Moz Spam Score Increase and Decrease?
Hello Moz Community, I'm facing spam score issue on my site General Queen according to MOZ my website spam score is 30% but anybody can explain how can i decrease it? Firstly I disavowed all low-quality and high spam score links (6 months ago) but my spam score was not decreased after that I created some high-quality and 0 spam score backlinks but still, my spam score is same 😞 According to Moz link explorer, my domain backlinks spam score ratio was below; Stats of 14 July 2019 .. Screen shot is attached 1-30% .. 45.3%
Link Explorer | | BloggerSEObd
31-60% .. 32.1%
61-100% .. 22.6% Stats of 13 June 2020 .. Screen shot is attached 1-30% .. 65.1%
31-60% .. 30.7%
61-100% .. 4.2% You can see a huge difference in the backlinks spam score in it. But why my spam score is still 30%? Looking forward to your reply. Thanks & Regards, abu1NiB XGLkr5z1 -
Why moz and ahrefs reported a remove link that is exist?
Hi in last few days I checked my website in moz link explorer and ahrefs and it showed lots of my backlink as removed! But when I checked this on the browser its OK is there any problem with these tools? Anyone has this problem? Thanks
Link Explorer | | namibiagonzo0 -
Moz's new Link Explorer displaying the DA marginally less than Site Explorer
Moz's new Link Explorer displaying the DA marginally less than Site Explorer. Old one is showing it 46 while new link explorer is showing the DA as 40.
Link Explorer | | dhananjay.kumar11 -
Error getting your data in moz ose
Why am I receiving a "There was an error getting your data" in moz ose? Everything worked fine yesterday but now I'm having trouble getting link metrics for my site.
Link Explorer | | TitanDigital1 -
Why moz is not showing My Website External Backlinks?
Google Webmaster showing my external links 41
Link Explorer | | djsamrock
Majestic showing my external links 28
Ahrefs showing my external links 13
&
Moz is showing my external links 0... Will anyone please explain Why moz is not showing My Website External Backlinks?0 -
Not sure why the data on the reports is stale. Meaning it hasn't been updated since my purchase date. Hard to know if I am making any progress.
I am a MOZ Pro subscriber and I am not sure why the data on the reports is stale, meaning it hasn't been updated since my purchase date. Hard to know if I am making any progress. How often does the data update?
Link Explorer | | mcorcelli1 -
OSE Stats - Number of Unique C IP's and PA/DA keep going up and down. Why?
For the past 3 months (if not more) I have been checking my sites stats on OSE and have noticed drastic decreases and increases in unique C-IP's, ranging from 600 to 1200. I have also noticed the PA & DA going up and down by 5 points. We have a talented team that work hard to market our business ethically with strong content and PR relationships, white-hat link building all the way, active social media and a dedicated on-site technical team. The data from MajesticSEO seems to be a lot more steady and consistent. Why is OSE showing drastic changes in 1 to 2 week intervals? Many thanks Ross
Link Explorer | | David_Connor0