How do you stop Moz crawling a page?
-
Hello,
I have a contact form which generates thousands of duplicate crawl errors. I'm going to use to block Google indexing these pages. Will this also block MOZ from crawling these pages and displaying the error?
Thanks!
-
this is all well and good and I am able to do these, but how do I keep Moz from crawling an index.php file. our site is http://4signs.com no index file there at all so I'm not sure why it would be crawled.
thoughts?
-
Hi guys,
Awesome discussion so far Yes, Chris is correct in that using noindex as a way to block Moz is not a effective way to do it. Since our tool is not a typical indexer (such as Google), we don't have some of the behavior of a normal spider. Instead, Roger is very good at rooting out issues that other crawlers might not notice. One thing Roger is also good at is obeying robots.txt.... you know him being a robot and all
You can find more information about our friend here:
http://moz.com/learn/seo/robotstxt http://moz.com/help/pro/rogerbot-crawler
So if you are looking to block it from looking at a page without making content changes to your code, I would definitely look into using robots.txt. You can even use a user-agent specific directive to make sure you don't end up telling other robots/spiders to do the same thing.
I hope that helps! Please let us know if you have questions
Peter
Moz Help Team. -
On http://moz.com/help/pro/rogerbot-crawler Moz gives an answer to the question "We are still seeing duplicate content on Moz even though we have marked those pages as "noindex, follow. Any idea why?
Moz is not a search engine index, it uses a crawler. If those pages are not blocked by the robots.txt file, then Moz will crawl them. They ignore the noindex tag because they don't index anything. Search engines will honor the noindex tag and not index a page if you specify with the robots meta tag. However, to remove pages from the crawl, disallow them in the robots.txt or metarobots.
Their answer is not exactly clear, but according to it, no, a meta noindex will not block rogerbot from crawling your page.
-
Hi Gary,
You may find the following link helpful - http://moz.com/learn/seo/robotstxt on top of this you can read how to stop the moz bot here - http://moz.com/help/pro/rogerbot-crawler
If you have blocked bots from your page this will include the Mozbot. Hope this helps.
-
Yes it will.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Crawl Diagnostics: How many pages (deep) will it crawl for dup content
Does anyone know how deep the crawl diagnostics will crawl when searching for dup content? Will it crawl the entire site, or will it only crawl "x" amount of pages? Thanks!
Moz Bar | | tdawson090 -
Performed a Moz Crawl Test - Says I have 107 External Links on Homepage??
Hello Mozzers! Exactly at the title suggests, I performed a crawl test on one our sites and the report says we have 107 external links on the homepage and another 34 on one of our internal category pages. On both of these pages I can only find 7 external links, anyone know why the crawl test is saying this? And if Moz is finding these external links could google be doing the same and punishing our site for the high number of external links? Any response appreciated! Richard
Moz Bar | | Richard-Kitmondo0 -
Moz Bar Not Showing DA?
Hello all, This could be something to do with our site or the Moz bar on Chrome, I just need to know which it is so if it is our website we can look into it further. On certain sections of our website the Moz bar doesn't display any Domain Authority, not even zero, the bar just isn't present. These types of pages are php which pull in data through a feed daily. Speaking to an SEO expert they said it could be where the page is being updated so frequently, or it could be something more sinister and technically not quite right. Does anyone have any ideas? Is the Moz bar just not working for these types os pages or is it more likely something to do with my site? Ironically it's these pages which I'm having trouble with that are not showing in SERPs! Thanks! 3Foorka
Moz Bar | | HB176 -
Is Moz going to provide mobile ranking tools?
With the mobilegeddon update quickly approaching us on April 21, I wanted to know if Moz is going to provide any insight into mobile rankings vs. desktop rankings? Are there any other tools we can use to benchmark and gain insight into this kind of data?
Moz Bar | | jgrammer2 -
Moz / more changes on the way?
I love Moz and the community and all the tools here. I admit I haven't rolled around in all the new things rolled out a few months ago. I thought there were more changes on the way but I wasn't sure if those already happened and I missed them or if I need to be patient? Affiliate program, client reporting? Thanks for any response. Have a great weekend! Matthew
Moz Bar | | Mrupp441 -
Confusing Moz Crawl?
Hi there, I am not sure if I am missing on something but the moz crawls are rather confusing. After singing in I have received 11 emails with crawls and today I have received again new, When I go to check there to the dashboard it shows 26 pages with issues. When I scroll down I see the pages with issue. Then when I click on the first page listed, to view the issues it says this: Rel Canonical
Moz Bar | | Rebeca1
Using rel=canonical suggests to search engines which URL should be seen as canonical. For this site: http://villasdiani.com/ but we have sorted out the canonical issues a long time ago. Is this a wrong information or is it really true that we do not specify the canonical for our site? Then the second page with issue is there listed http://villasdiani.com/beach-villas/ and it says: Duplicate Page Title
You should use unique titles for your different pages to ensure that they describe each page uniquely and don't compete with each other for keyword relevance. But it does not point out which page is duplicate with this one! I do not have any other page named the same way. It also says in Issues overview 26pages with issues, but it shows on the bottom only 5 under and when I click on view more it brings me to high priority issues where is 0. The most is freaking me out this report: When I click on links, there are listed on the bottom the pages with highest authority among which I found this http://villasdiani.com/db I have never created this kind of page! Funny enough when I click on it it really open that page! How this can be??? In issues overview it also shows on the bottom, right corner 11 page with duplicate content but when I click on it to review it it brings me to high priority issues windows where is not displayed anything Can somebody advice me regarding of this. I have sign up here to learn and sort out the problems with the site but so far I am only getting more confused here. Thank you very much for looking into this.0