Crawl Diagnostics Warnings - Duplicate Content
-
Hi All,
I am getting a lot of warnings about duplicate page content. The pages are normally 'tag' pages.
I have some news stories or blog posts tagged with multiple 'tags'.
Should I ask google not to index the tag pages? Does it really affect my site?
Thanks
-
Thanks Marcus.
It is wordpress I am using and already have the Yoast WP plugin. I'll try nondexing the author and date taxonomies too.
I have done so with the tags but they still show up on the SEOmoz report.
Good idea about varying the categories - will give that a go and see if anything changes
-
Hey Stacey
It all depends on how these tag pages are used and whether they factor as landing pages or are just a tool for people to view related content once on the site.
Are you using WordPress? If so, WordPress features a bunch of taxonomies, tags being one and where you have posts by a single author they may be duplicated on the homepage, date archive, author archive, categories, sub categories and tags so you can end up with a lot of pages that look pretty much the same.
This is fairly straightforward to resolve though and if you just install the Yoast WordPress SEO plugin and then noindex any pages that are really just for users to browse you can ensure your important pages remain indexed and there is not lots of duplication or competition.
Really, it is more than a technical problem and it comes down to how you organise your posts and content on the blog and a default blog root and specific indexed category pages (ideally with some additional, unique content) can work best (but again, the specifics depends on the blog and the content).
Anyhow, it is an easy change, try no indexing the tags, date archives, author archives etc and using some smart category organisation and see if it moves the dial at all for you. You can always put things back if you don't find it helps.
Hope that helps!
Marcus
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Drupal 8 tags and categories cause duplicate content shown in MOZ
Hi all, There is something difficult to trace that is causing duplicate content that is related to categories and tags i.e. https://example.com/contact Associated Pages https://example.com/tags/business https://example.com/taxonomy/term/41 example 2 https://example.com/category/example-category-1 Associated Pages https://example.com/category/occupiers-liability example 3 https://example.com/tags/test https://example.com/tags/test-2 Above two pages display same content (maybe due to similar posts feature) My question here is: Is this caused by Drupal website misconfiguration (or one of its modules) since website uses similar posts feature or it's something else. Duplicate content for example.com/index.php issue has been solved by redirects. Should something similar be done in case of tags / categories? Any discussion / suggestions on that matter are greatly appreciated. Thank you.
Moz Pro | | Optimal_Strategies0 -
WEbsite cannot be crawled
I have received the following message from MOZ on a few of our websites now Our crawler was not able to access the robots.txt file on your site. This often occurs because of a server error from the robots.txt. Although this may have been caused by a temporary outage, we recommend making sure your robots.txt file is accessible and that your network and server are working correctly. Typically errors like this should be investigated and fixed by the site webmaster. I have spoken with our webmaster and they have advised the below: The Robots.txt file is definitely there on all pages and Google is able to crawl for these files. Moz however is having some difficulty with finding the files when there is a particular redirect in place. For example, the page currently redirects from threecounties.co.uk/ to https://www.threecounties.co.uk/ and when this happens, the Moz crawler cannot find the robots.txt on the first URL and this generates the reports you have been receiving. From what I understand, this is a flaw with the Moz software and not something that we could fix form our end. _Going forward, something we could do is remove these rewrite rules to www., but these are useful redirects and removing them would likely have SEO implications. _ Has anyone else had this issue and is there anything we can do to rectify, or should we leave as is?
Moz Pro | | threecounties0 -
Duplicate Content
Hello, I'm managing a site which shows as having duplicate page issues (in the crawl analyser) for 3 pages. Basically the site is offering 3 different options of the same product so depending on which size you select, you are directed to the relevant page. These 3 pages are basically identical apart from a slight difference in copy regarding the size (small, medium, large) Is this likely to be a big issue regarding SEO, and what would the moz community suggest re this? Thank you!
Moz Pro | | wearehappymedia0 -
Tags on my website cause duplicate content
Hi I just recently started a website and I am new to MOZ pro. What Moz pro detected on my website under high priority is that "duplicate page content" and what I realize about these duplicate page content is regarding the tags i put on my post. Because it is a wordpress blog, we are allow to add tags on the side before we publish our post. And because of these tags, it linked to the same page but different url. for example website.com/tags/whatever website.com/tags/whatever 2 and both these url direct to the same page So how do i solve this? do i just stop tagging whenever i write a post? delete all tags while it is not necessary? i seen method like 301 redirect or rel=canonical but is there anyway to solve this problem so I do not face this issue whenever i make a new post in my blog? I mean it doesnt make sense to redirect 301 to every single tags i have whenever i write a new post right? thanks guys
Moz Pro | | andzon0 -
Lag time between MOZ crawl and report notification?
I did a lot of work to one of my sites last week and eagerly awaited this week's MOZ report to confirm that I had achieved what I was trying to do, but alas I still see the same errors and warnings in the latest report. This was supposedly generated five days AFTER I made the changes, so why are they not apparent in the new report? I am mainly referring to missing metadata, long page titles, duplicate content and duplicate title errors (due to crawl and URL issues). Why would the new crawl not have picked up that these have been corrected? Does it rely on some other crawl having updated (e.g. Google or Bing)?
Moz Pro | | Gavin.Atkinson0 -
False Pro reporting of duplicate titles
I am testing Pro. About 250 pages of content at my website. Pro says ALL of my pages have duplicate titles., but when I click on details, they display as unique titles. Ie: first page of results of Pro is as follows. While the content of my website is on one major topic the title meta tags are NOT identical. Is this an issue with Pro, or is Pro looking at something other than the title meta tags? Please advise ? Fiance Visa Help What is Adjustment Of Status from K1 Visa Adjustment of Status support Taiwan US Consulate Visa Interview Adjustment of Status Order Form How to Choose between K1 Fiancee or CR1 Marriage Visa Removal of Conditions on Residence support US Embassies + Consulates that process Fiancee and Spousal Visas | | | | |
Moz Pro | | microonae
| | | | |
| | | | |
| | | | |
| | | | |
| | | | |
| | | | |
| |0 -
Is it possible to exclude pages from Crawl Diagnostic?
I like the crawl diagnostic but it shows many errors due to a forum that I have. I don't care about the SEO value of this forum and would like to exclude any pages in the /forum/ directory. Is it possible to add exclusions to the crawl diagnostic tool?
Moz Pro | | wfernley2 -
Dynamic URL pages in Crawl Diagnostics
The crawl diagnostic has found errors for pages that do not exist within the site. These pages do not appear in the SERPs and are seemingly dynamic URL pages. Most of the URLs that appear are formatted http://mysite.com/keyword,%20_keyword_,%20key_word_/ which appear as dynamic URLs for potential search phrases within the site. The other popular variety among these pages have a URL format of http://mysite.com/tag/keyword/filename.xml?sort=filter which are only generated by a filter utility on the site. These pages comprise about 90% of 401 errors, duplicate page content/title, overly-dynamic URL, missing meta decription tag, etc. Many of the same pages appear for multiple errors/warnings/notices categories. So, why are these pages being received into the crawl test? and how to I stop it to gauge for a better analysis of my site via SEOmoz?
Moz Pro | | Visually0