Standard Syntax in robots.txt doesn't prevent Moz bot from crawling
-
A client is getting many false positive site crawl errors for things like duplicate titles and duplicate content on pages that include /tag/ in the URL. An example is https://needquest.com/place_tag/autism-spectrum-disorder/page/4/
To resolve this we have set up a disallow statement in the robots.txt file that says
Disallow: /page/For some reason this appears not to work, as the site crawl errors continue to list pages like this. Does anyone understand why that would be and what we need to do to properly disallow crawling these pages?
-
Thanks, Tawny,
If you look at Duplicate titles, check the first one (https://needquest.com/place_tag/autism-spectrum-disorder/). All the URLs with a duplicate title have /page/ in them. I will suggest they move the Allow statement and see if that helps.
-
I'm not seeing that URL coming up with Duplicate Title or Duplicate Content issues — when I search by that URL I see no Content issues at that URL. I do see that URL in the All Crawled Pages section, but I can't find it bringing up Content issues in the app.
That said, I took a look at your robots.txt file, and I think this could be a result of having an Allow command before the rest of the Disallow commands. I think possibly if you put that Allow command at the end of the block of Disallow commands, rogerbot would see the disallow for /page/ and stop crawling those URLs.
If you're still running into trouble, I would suggest writing in to us at help@moz.com so we can take a closer look at the Campaign and what could be going on there.
-
Any reason the Disallow: /page/ isn't preventing URLs like
https://needquest.com/place_tag/autism-spectrum-disorder**/page/**4/
from generating duplicate descriptions and title errors in our site crawl? It was my hope that those pages wouldn't be crawled at all. -
Sorry, Tawny ... I did go back and correct y question. We did apply Disallow: /page/ to address this issue. The /place_tag/ is found in many pages we DO want to crawl and index ... and we only want here to disallow those page 2, page 3, page 4, etc. pages.
(We also disallowed /tag/, /category/, and a few other common issues that generate false positives in the site crawl.)
-
Hey there!
Tawny from Moz's Help Team here.
Adding a disallow directive for /tag/ won't help with the example URL you've provided — that URL doesn't have /tag/ in the URL pathway. To block us from seeing content like that URL you listed, you'd need a disallow directive for /place_tag/.
If you include that disallow directive, that should stop us from seeing duplicate content on pages with /place_tag/ in the URL.
Hope that helps! If you've still got questions, feel free to shoot us a note over at help@moz.com and we'll do our best to sort things out with you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Where does SEOMoz pull it's keyword data from?
I would like to know where the data in the keyword recommendation tool comes from. Specifically, from what sources is SEOMoz referencing to compile the keyword recommendations and analysis? Thanks!
Getting Started | | FilterEasy1 -
Canonical url does not seem to be recognised by Moz
Hey guys, forgive me... brand new user of Moz. Moz is telling me I have duplicate url's - a lot of them. Investigating I see that it is classing http://creativedigital.co.nz/artprints/lake-tekapo-church-of-the-good-shepherd?tag=lake as a duplicate of http://creativedigital.co.nz/artprints/lake-tekapo-church-of-the-good-shepherd even though there is a canonical tag in head as follows I'm not quite sure what I am doing wrong. Any advice would really be appreciated 🙂 Mike
Getting Started | | creative.digital0 -
What is the best use for Moz tools specially keyword difficulty for startup ?
Hello, I'm so new in Moz and SEO world and i just started my website, a WordPress blog, I'm in a content creation period and i want to make it right from the beginning but I'm confused about how to use Moz tools in this period because i don't have content or traffic so no analytic as i think, so What is the best use of Moz tools in this period? About keyword difficulty tool i think this is the most tool i will use in the beginning, how i choose which keywords to use from my keywords list, in this time I'm depending on the on page SEO only, no backlinks no social engagements, which keywords to use to appear fast in search engines for a startup? less than "% difficulty " or between, I"m new in this word Please Moz and SEO experts give me a hand here. Note: I'm using Medium Moz pro plan.
Getting Started | | Romekio1 -
New to Moz Pro? Join our free webinar this Friday!
Hello everyone! We'll be holding a webinar on Friday to help new members learn about what all Pro has to offer, show some off our most popular tools, and get you comfortable with the dashboard. Register here: https://www3.gotomeeting.com/register/589105390 Date: Friday, August 29th (this Friday!) Time: 10:00 AM - 11:00 AM PDT Hope to see you all there!
Getting Started | | jennita6 -
Moz Analytics - How can I turn a whole report to 'Monthly'?
Hello, After spending 3 days setting up 14 clients on this system, by this, I mean I went through each client and re-made the same report 14 times as there is no alternative..I have noticed on my reports that: 'Dashboard' is set to 'Weekly' Social is set to 'Daily' Branding is set to 'Monthly' First questions, when I try to run a report, I go to Add Module and change 'Weekly' to 'Monthly', but when I press next, it changes itself back again, am I doing something wrong? UPDATE: I have found that I have go back in the reports, change the heading to 'Monthly' and then re-add this section again. How can I just run a whole report with monthly values? Surely, I do not have to re-do the entire report AGAIN, just to get a different value? I am mortified at how unfinished this product is, if this is the case.
Getting Started | | Paul_Tovey0 -
Moz Staff Should Consider it - important
Hello Rand, I was looking for the best content quality checker and I've found many websites saying Free to service. but I got bad experienced there was something poorly coded system on their website so they couldn't check the content quality and duplication. So I suggest you to make a tool that should be helpful for users who are seeking to find out the quality of their content. it should Tell us following factors which are important! Content quality score - English and Grammar Duplication Uniqueness Suggestion to optimize the content
Getting Started | | shubham12340 -
Can't setup new campaign
Hi everyone, I'm trying to set up a new campaign for a website which has Cloudflare installed. After I enter the campaign name and URL the loading circle comes up and spins for a while, but then it just stays on the same page. No error message is given. I can't get to the next page of the setup campaign form sequence so that I can set up this campaign. Has anyone else had this problem and is there any fix? Thanks in advance
Getting Started | | _jrmo
James0 -
Why does moz show "not in top 50" for all my keywords???
Hello, I signed up to moz pro 4 days ago. And so far it seems to be tracking visits etc. But all my keywords say "not in top 50" . Why is this? Is this normal? Just to confirm most of the keywords i pasted in from my webmaster tools and i only chose the ones that were in top 50
Getting Started | | casper09030