I have a WP site which uses categories to display the same content in several locations. Which items should get a canonical tag to avoid a ding for duplicate content?
-
So...I have a Knowledge Center and press room that pretty much use the same posts. So...technically the content looks like its on several pages because the post shows up on the Category listing page.
Do I add a Canonical tag to each individual post...so that it is the only one that is counted?
Also...I have a LONG disclaimer that goes at the bottom of most of the posts. would this count as duplicate content? Is there a way to markup a single paragraph to tell the spiders not to crawl it?
-
Hi Kane,
Thank you so much for your quick response even in the holiday season, I really appreciate that. I don't think that there is any unique content on tag or category pages. I will go ahead and noindex them right away.
Once again, thanks for your help.
Happy Holidays...
Jatin
-
Hi Jatin,
If you have any kind of unique content on a tag or category page, then it's reasonable to index it. For example, a category page with 1-2 paragraphs of intro text explaining what types of content are found in that category. In this case, any posts or post teasers showing up on that category/tag page would not be unique content.
If there is no unique content on that page, then I'd suggest noindex for category and tag pages.
-
Hello Kane,
I am also facing this issue and found some answers here in Q&A section. Most of the members are recommending to make tag & categories page non-index. But, you are not recommending this. Do you think that in the matter of duplicate content issue, it is better to non-index the tags and categories page?
If not, then should I worry about the duplicate content issues about these pages?
-
Let's resolve the boilerplate disclaimer text first: it is fine to have a section of content that is duplicated on lots of blog posts - the most important consideration is that each page has a decent amount of unique text from all other URLs on the website. Now there are limits to this - if every post has 200-400 words unique content (not really enough) and the boilerplate is 500 words, then your balance is off in that ratio. If you have 300-1000+ unique words on post and boilerplate text that is <100 words, then I would not worry about that at all. However, you can use the tag that Nick mentioned if you want to - it shouldn't hurt anything.
Now, regarding the canonicalization of posts and how it relates to category/archive pages:
Each post will have its own canonical tag, for example:
- http://www.domain.com/blog/best-post-ever/
- http://www.domain.com/blog/big-announcement-new-product
- http://www.domain.com/blog/annual-company-report
Then each category page or archive page would have it's own canonical tag:
When you are viewing a category or tag or archive page that lists a bunch of posts - there is only one canonical tag visible in the code for that page, and it's for the page itself - not for the posts listed on the page.
I'm guessing you're asking this question because you read that duplicate content on category and tag pages was common, and that is true. However - canonicals are not involved in fixing this at all. The thing with Wordpress is that unless you built a theme yourself - you shouldn't have to touch any of this. Default Wordpress with Yoast SEO plugin installed will handle this for you. I have worked on hundreds of Wordpress sites for 10 years and can count the number of times I manually specified a canonical tag in Wordpress on one hand.
The duplicate content of two pages such as your knowledge center and press room should be mitigated by 1) reducing the number of posts that fall into both categories, 2) making sure there is some unique content (50-300 words) on the knowledge center and press room pages other than a list of blog posts, and 3) not stressing too hard about duplicate content on these pages.
Hope that helps, please feel free to respond with questions if I missed something.
-
Hi Lindsay,
I'd recommend limiting the number of categories you select for each post - generally my rule is 3 categories maximum. That being said, I've found that the most successful strategy is to create & categorize content in a way that easily satisfies your user's intent. Categories = broad topics/areas of interest your ideal buyer wants to read about (broad keyword search phrases). Articles/posts = focus on one specific question related to the broad category (longtail phrases).
For example: let's say you have a shoe company and you've created a style blog that discusses the latest trends. One option is to do what a lot of companies do, and choose generic blog categories like trends, inspiration, comfort, etc...
Let's say you research and decide write an article called: Best Shoes To Wear To Coachella (because it's a longtail keyword). How do you categorize it? It's definitely goes in trends, but it's also kind of inspirational, and you also have a section about comfortable shoes to wear to Coachella. You can't choose just one category, so you end up adding the post to 4-5 categories.
The biggest problem with this type of organization structure is not duplicate content - it's that users (a) can't easily find your content because they don't know what your categories mean and (b) they're confused about what content they've already read, because they see the same articles in multiple categories.
In my opinion, the better way to choose categories and article topics, as I sort of mentioned above, is to start with broad topics that people want to learn about.
Instead, you might choose categories based on popular search queries. For example: Festival Shoes, New This Season, Celebrity Favs, How To Wear It, etc. In this case, your article: Best Shoes To Wear To Coachella would go under the festival shoes category. You could also have articles about Stagecoach, SXSW, etc. This isn't the best example, but I hope this make sense!
Long story short: done correctly, this type strategy is helpful in a number of ways: (1) your user is able to easily understand where to find the information they're looking for, (2) you avoid duplicate content, because your articles are written to correspond with 1 (maybe 2) categories, and (3) your category pages will be hyper-optimized for lots of longtail keywords that are related to your main category keyword. This will make your category pages like mini-landing pages that have a higher probability of ranking broad/more competitive keywords.
I hope this helps!
-
Hi Lindsay, Good questions. My recommendation would be to place a cononical tag on your posts and consider setting your category page to noindex.
As for the disclaimer, you can wrap that in the following tags to tell Google that it should not index that specific content.
<code>This (X)HTML content will NOT be indexed by Google.</code>
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
PDF web traffic hitting our site
Hi there, Over the last few months our traffic has spiked due to irrelevant pdf documents sending us crap traffic, our bounce rate is sky high as well as other metrics. I don't want to just filter out this traffic in GA rather try and stop our site from being attacked. Any advice on a way forward would be great. Thanks
Reporting & Analytics | | ICMPmarketing0 -
Page Tracking using Custom URLs - is this viable?
Hi Moz community! I’ll try to make this question as easy to understand as possible, but please excuse me if it isn’t clear. Just joined a new team a few months ago and found out that on some of our most popular pages we use “custom URLs” to track page metrics within Google Analytics. NOTE: I say “custom URLs” because that is the best way for me to describe them. As an example: This page exists to our users: http://usnews.rankingsandreviews.com/cars-trucks/Ram_HD/2012/photos-interior/ But this is the URL we have coded on the page: cars-trucks/used-cars/reviews/2012-Ram-HD/photos-interior/ (within the custom variance script labeled as “var l_tracker=” ) It is this custom URL that we use within GA to look up metrics about this page. This is just one example of many across our site setup to do the same thing Here is a second example: Available page to user: http://usnews.rankingsandreviews.com/cars-trucks/Cadillac_ATS/2015/ Custom “var l_tracker=” /cars-trucks/2015-Cadillac-ATS/overview/ NOTE: There is a small amount of fear that the above method was implemented years ago as a work-around to a poorly structured URL architecture. Not validated, but that is a question that arose. Main Questions: Is the above implementation a normal and often used method to track pages in GA? (coming from an Omniture company before – this would not be how we handled page level tracking) Team members at my current company are divided on this method. Some believe this is not a proper implementation and are concerned that trying to hide these from Google will raise red flags (i.e. fake URLs in general = bad) I cannot find any reference to this method anywhere on the InterWebs - If method is not normal: Any recommendations on a solution to address this? Potential Problems? GA is currently cataloguing these tracking URLs in the Crawl Error report. Any concerns about this? The team wants to hide the URLs in the Robots.txt file, but some team members are concerned this may raise a red flag with Google and hurt us more than help us. Thank you in advance for any insight and/or advice. Chris
Reporting & Analytics | | usnseomoz0 -
How often does google content experiments stats update?
From my experience it seems to update once per day (every 24 hours), can anyone confirm this is the case or have a link to an official announcement which confirms how often the data updates? It would be handy to know when it updates so we can see the latest information as it comes in.
Reporting & Analytics | | Twist3600 -
Implementing demographics for a nopCommerce site
Hi Guys, I have been trying to implement demographics in Analytics for a site built in nopCommerce and I have run out of things to try to correct the problem and get some data! The code I am using is below: Any help would be greatly appreciated - it's driving me mad! Thanks, Dan
Reporting & Analytics | | SEOBirmingham810 -
Strange Traffic / Viewed content in Analytics
Hi Im looking at the content viewed in December for a clients site and there's a high number of page views for some non-existent pages which received multiple hits eg. domain.com/dr-capability/ with 241 page views (54 unique page views) How can you have multiple page views (and many unique page views) for pages that don't exist ? Is it possible page is being generated automatically or something ? if so how come ? Any ideas as to whats causing this much appreciated ? All Best Dan ps - also many above average home page views
Reporting & Analytics | | Dan-Lawrence0 -
How to put structured data on your site?
HI All, I Would like to know your views on structured data on your site. What are the qualities and point that should keep in mind when we talking about the "structured data on your site". how we can apply structured data. How we use data highlighter for the website which don't have any events. your thoughts please.
Reporting & Analytics | | lucidsoftech
thanks0 -
Has anyone used or have the lowdown on LeadLander?
Client received an email on LeadLander showing actual business names in traffic results. Has anyone used or have the lowdown on LeadLander? Is it spammy? Seems to not have much buzz online, mostly active in 2008.
Reporting & Analytics | | mxmo0 -
Should we "no-follow" archives or categories?
I'm reading some reports from my first crawl of 10K pages and I'm wondering if it's wise to mark the archives "no-follow." I have a WP tool that provides a tool that offers the no follow for categories or archives recommending to choose either one or the other but not both. What would be the best solution?
Reporting & Analytics | | JavaManOne0