I have a WP site which uses categories to display the same content in several locations. Which items should get a canonical tag to avoid a ding for duplicate content?
-
So...I have a Knowledge Center and press room that pretty much use the same posts. So...technically the content looks like its on several pages because the post shows up on the Category listing page.
Do I add a Canonical tag to each individual post...so that it is the only one that is counted?
Also...I have a LONG disclaimer that goes at the bottom of most of the posts. would this count as duplicate content? Is there a way to markup a single paragraph to tell the spiders not to crawl it?
-
Hi Kane,
Thank you so much for your quick response even in the holiday season, I really appreciate that. I don't think that there is any unique content on tag or category pages. I will go ahead and noindex them right away.
Once again, thanks for your help.
Happy Holidays...
Jatin
-
Hi Jatin,
If you have any kind of unique content on a tag or category page, then it's reasonable to index it. For example, a category page with 1-2 paragraphs of intro text explaining what types of content are found in that category. In this case, any posts or post teasers showing up on that category/tag page would not be unique content.
If there is no unique content on that page, then I'd suggest noindex for category and tag pages.
-
Hello Kane,
I am also facing this issue and found some answers here in Q&A section. Most of the members are recommending to make tag & categories page non-index. But, you are not recommending this. Do you think that in the matter of duplicate content issue, it is better to non-index the tags and categories page?
If not, then should I worry about the duplicate content issues about these pages?
-
Let's resolve the boilerplate disclaimer text first: it is fine to have a section of content that is duplicated on lots of blog posts - the most important consideration is that each page has a decent amount of unique text from all other URLs on the website. Now there are limits to this - if every post has 200-400 words unique content (not really enough) and the boilerplate is 500 words, then your balance is off in that ratio. If you have 300-1000+ unique words on post and boilerplate text that is <100 words, then I would not worry about that at all. However, you can use the tag that Nick mentioned if you want to - it shouldn't hurt anything.
Now, regarding the canonicalization of posts and how it relates to category/archive pages:
Each post will have its own canonical tag, for example:
- http://www.domain.com/blog/best-post-ever/
- http://www.domain.com/blog/big-announcement-new-product
- http://www.domain.com/blog/annual-company-report
Then each category page or archive page would have it's own canonical tag:
When you are viewing a category or tag or archive page that lists a bunch of posts - there is only one canonical tag visible in the code for that page, and it's for the page itself - not for the posts listed on the page.
I'm guessing you're asking this question because you read that duplicate content on category and tag pages was common, and that is true. However - canonicals are not involved in fixing this at all. The thing with Wordpress is that unless you built a theme yourself - you shouldn't have to touch any of this. Default Wordpress with Yoast SEO plugin installed will handle this for you. I have worked on hundreds of Wordpress sites for 10 years and can count the number of times I manually specified a canonical tag in Wordpress on one hand.
The duplicate content of two pages such as your knowledge center and press room should be mitigated by 1) reducing the number of posts that fall into both categories, 2) making sure there is some unique content (50-300 words) on the knowledge center and press room pages other than a list of blog posts, and 3) not stressing too hard about duplicate content on these pages.
Hope that helps, please feel free to respond with questions if I missed something.
-
Hi Lindsay,
I'd recommend limiting the number of categories you select for each post - generally my rule is 3 categories maximum. That being said, I've found that the most successful strategy is to create & categorize content in a way that easily satisfies your user's intent. Categories = broad topics/areas of interest your ideal buyer wants to read about (broad keyword search phrases). Articles/posts = focus on one specific question related to the broad category (longtail phrases).
For example: let's say you have a shoe company and you've created a style blog that discusses the latest trends. One option is to do what a lot of companies do, and choose generic blog categories like trends, inspiration, comfort, etc...
Let's say you research and decide write an article called: Best Shoes To Wear To Coachella (because it's a longtail keyword). How do you categorize it? It's definitely goes in trends, but it's also kind of inspirational, and you also have a section about comfortable shoes to wear to Coachella. You can't choose just one category, so you end up adding the post to 4-5 categories.
The biggest problem with this type of organization structure is not duplicate content - it's that users (a) can't easily find your content because they don't know what your categories mean and (b) they're confused about what content they've already read, because they see the same articles in multiple categories.
In my opinion, the better way to choose categories and article topics, as I sort of mentioned above, is to start with broad topics that people want to learn about.
Instead, you might choose categories based on popular search queries. For example: Festival Shoes, New This Season, Celebrity Favs, How To Wear It, etc. In this case, your article: Best Shoes To Wear To Coachella would go under the festival shoes category. You could also have articles about Stagecoach, SXSW, etc. This isn't the best example, but I hope this make sense!
Long story short: done correctly, this type strategy is helpful in a number of ways: (1) your user is able to easily understand where to find the information they're looking for, (2) you avoid duplicate content, because your articles are written to correspond with 1 (maybe 2) categories, and (3) your category pages will be hyper-optimized for lots of longtail keywords that are related to your main category keyword. This will make your category pages like mini-landing pages that have a higher probability of ranking broad/more competitive keywords.
I hope this helps!
-
Hi Lindsay, Good questions. My recommendation would be to place a cononical tag on your posts and consider setting your category page to noindex.
As for the disclaimer, you can wrap that in the following tags to tell Google that it should not index that specific content.
<code>This (X)HTML content will NOT be indexed by Google.</code>
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Two long established sites with similar audiences, what do we do?
Hi guys, We operate two long established and reasonably well ranking sites — our company website which was built on a keyword domain: market-stalls.co.uk (approx 15 years online) and our online store which was established several years later on a different domain: tradersupplies.co.uk (approx 9 years online). (At the bottom of this post I've attached real world traffic and turnover figures that demonstrate the issue we're facing) The problem is... The above sites target very similar audiences and keywords and both rank fairly well but I know are likely competing against eachother We're a small company (8-10 employees) and we (or rather, I) don't have the time or resources to blog, build back links, manage opseo and all the social channels etc for both sites. I'm struggling to cope with one. The question is... Do we abandon the original company site (market-stalls.co.uk) in favour of pooling all our resource in to improving rankings for our online store (tradersupplies.co.uk). All our social media presence relates to tradersupplies.co.uk. We don't have any social channels for market-stalls.co.uk. Ironically, the only blog we have is established on market-stalls.co.uk — set up a couple of years ago in the hope to pull ourselves back up the rankings — but it hasn't been updated in over a year due to time restraints. Or do we attempt to keep both sites operational, despite a lack of resource? That would likely include a fairly sizeable overhaul of market-stalls.co.uk to bring it up to date with modern design standards, establishing social media channels for market-stalls.co.uk, creating a blog on tradersupplies.co.uk, and regularly updating two blogs and two sets of social media channels with unique content. Sounds like a pretty huge job right!? Obviously, had we been setting up our business in 2017 and having read the many community posts on the subject of multiple websites, we wouldn't be splitting our time between two websites and would be focussing solely on building one highly ranking site. But unfortunately we're not in this position and we're in a quandary because we don't know whether or not we should let our original, highly ranking company site drop off the radar in favour of focussing on building traffic to our online store. This situation arose out of a decision to establish our online store on a different domain to our company website. Back in 2007 I rebuilt market-stalls.co.uk and spent a lot of time optimising it. The site blew up and we were ranking very well for all kinds of keywords related to market stalls In 2009 we opened our online store tradersupplies.co.uk which sells all of the products advertised on market-stalls.co.uk and then some By using "buy now" buttons on market-stalls.co.uk which redirected to tradersupplies.co.uk, our original site was driving a large amount of traffic and sales to tradersupplies.co.uk. At it's peak it was driving almost £6,000 GBP a month in sales. This has since dropped to around a third/quarter of this total. As the business grew we began to run short of time to maintain market-stalls.co.uk and it has inevitably slipped down the rankings This has also had a direct impact on the referral traffic and resulting sales on tradersupplies.co.uk. I've attached below the analytics which show the drop in referral traffic to tradersupplies.co.uk and the drop off in sales. I have a feeling I know the answer to this debacle but I'm keen to hear the opinions of those that may have found themselves in this position before! UPDATE: I've just had a call with our Magento developer halfway through writing this post ... he has suggested we transfer all content from market-stalls.co.uk over to CMS pages on our Magento powered online store, and create 301 redirects. Apparently this will carry the weight of market-stalls.co.uk over to tradersupplies.co.uk. Does anyone have any thoughts on this? turnover.jpg
Reporting & Analytics | | tinselworm0 -
Need advice on setting up primary domain and shopify site analytics to work best together
Hello, I have a client that I have been working on their primary site for the last year or so. In the last month they decided to have one of their internal employees setup a small shopify store. Now they are asking for the analytics tracking codes for it. My question for you is what would be the best way for me to set that up? variables: primary domain and shopify domain, google and bing analytics Have been looking at how cross domain tracking works (https://support.google.com/tagmanager/answer/6106951), and the instructions for setting up ecommerce in analytics for shopify (https://help.shopify.com/manual/reports-and-analytics/google-analytics/google-analytics-setup). But am still not 100% which route would be the best, any input would be greatly appreciated! thank you, Dustin
Reporting & Analytics | | pastedtoast1 -
Curious, anyone ever had over half of their indexed links drop on an e-commerce site?
In a year went from around 300k indexed pages to around >100k according to GWT. Could this be duplicate content issue, lost links, spam, aged links or all of the above? either way an audit is in order. Thanks! Chris
Reporting & Analytics | | Sundance_Kidd0 -
Rel=Canonical vs. No Index
Ok, this is a long winded one. We're going to spell out what we've seen, then give a few questions to answer below, so please bear with us! We have websites with products listed on them and are looking for guidance on whether to use rel=canonical or some version of No Index for our filtered product listing pages. We work with a couple different website providers and have seen both strategies used. Right now, one of our web providers uses No Index, No Follow tags and Moz alerted us to the high frequency of these tags. We want to make sure our internal linking structure is sound and we are worried that blocking these filtered pages is keeping our product pages from being as relevant as they could be. We've seen recommendations to use No Index, Follow tags instead, but our other web provider uses a different method altogether. Another vendor uses a rel=canonical strategy which we've also seen when researching Nike and Amazon's sites. Because these are industry leading sites, we're wondering if we should get rid of the No Index tags completely and switch to the canonical strategy for our internal links. On that same provider's sites, we've found rel=canonical tags used after the first page of our product listings, and we've seen recommendations to use rel=prev and rel=next instead. With all that being said, we have three questions: 1)Which strategy (rel=canonical vs. No Index) do you recommend as being optimal for website crawlers and boosting our site relevance? 2)If we should be using some version of No Index, should we use Follow or No Follow? 2)Depending on the product, we have multiple pages of products for each category. Should we use rel=prev & rel=next instead of rel=canonical among the pages after page one? Thanks in advance!
Reporting & Analytics | | Leithmarketing0 -
Is it possible to get demographic and interest information from DoubleClick cookies?
We use Google Analytics and we are currently extracting information from the Google Analytics cookies about our visitors. Is it possible to access DoubleClick cookies in a similiar fashion and get some demographic/interest information for each visitor to our website (if they have a DoubleClick cookie set)? If so, any information on how to retrieve it would be very appreciated.
Reporting & Analytics | | WebpageFX0 -
One big site for a loose theme or multiple sites for each specific theme?
Hi, Our company produces content related to one loose overall theme. Within that we write content and sell products on three specific sub-themes. Some of our customers cross over and have an interest in two or all three themes and others are only interested in one. At present we have one site covering reviews of products relating to all three sub-themes... ....and three other sites offering how-to guides and tutorials dedicated to each sub-theme. We do not have a lot of time to commit to SEO and so we are considering merging the content we have on all three subjects into the one site covering them all. Each of these sub-themes could have websites in their own right and my worry is that it will be harder to rank for these subject specific terms if the content is not on a site dedicated to that subject. But of course if they were all together then any links we build will be consolidated into one big site. Does anyone have any experience of this or have any advice on what the best thing to do would be? Thanks for your help!
Reporting & Analytics | | frantan0 -
Any harm and why the differences - multiple versions of same site in WMT
In Google Webmaster Tools we have set up: ourdomain.co.nz
Reporting & Analytics | | zingseo
ourdomain.co.uk
ourdomain.com
ourdomain.com.au
www.ourdomain.co.nz
www.ourdomain.co.uk
www.ourdomain.com
www.ourdomain.com.au
https://www.ourdomain.co.nz
https://www.ourdomain.co.uk
https://www.ourdomain.com
https://www.ourdomain.com.au As you can imagine, this gets confusing and hard to manage. We are wondering whether having all these domains set up in WMT could be doing any damage? Here http://support.google.com/webmasters/bin/answer.py?hl=en&answer=44231 it says: "If you see a message that your site is not indexed, it may be because it is indexed under a different domain. For example, if you receive a message that http://example.com is not indexed, make sure that you've also added http://www.example.com to your account (or vice versa), and check the data for that site." The above quote suggests that there is no harm in having several versions of a site set up in WMT, however the article then goes on to say: "Once you tell us your preferred domain name, we use that information for all future crawls of your site and indexing refreshes. For instance, if you specify your preferred domain as http://www.example.com and we find a link to your site that is formatted as http://example.com, we follow that link as http://www.example.com instead." This suggests that having multiple versions of the site loaded in WMT may cause Google to continue crawling multiple versions instead of only crawling the desired versions (https://www.ourdomain.com + .co.nz, .co.uk, .com.au). However, even if Google does crawl any URLs on the non https versions of the site (ie ourdomain.com or www.ourdomain.com), these 301 to https://www.ourdomain.com anyway... so shouldn't that mean that google effectively can not crawl any non https://www versions (if it tries to they redirect)? If that was the case, you'd expect that the ourdomain.com and www.ourdomain.com versions would show no pages indexed in WMT, however the oposite is true. The ourdomain.com and www.ourdomain.com versions have plenty of pages indexed but the https versions have no data under Index Status section of WMT, but rather have this message instead: Data for https://www.ourdomain.com/ is not available. Please try a site with http:// protocol: http://www.ourdomain.com/. This is a problem as it means that we can't delete these profiles from our WMT account. Any thoughts on the above would be welcome. As an aside, it seems like WMT is picking up on the 301 redirects from all ourdomain.com or www.ourdomain.com domains at least with links - No ourdomain.com or www.ourdomain.com URLs are registering any links in WMT, suggesting that Google is seeing all links pointing to URLs on these domains as 301ing to https://www.ourdomain.com ... which is good, but again means we now can't delete https://www.ourdomain.com either, so we are stuck with 12 profiles in WMT... what a pain.... Thanks for taking the time to read the above, quite complicated, sorry!! Would love any thoughts...0 -
Duplicate page content
I'm seeing duplicate page content for tagged URLs. For example:
Reporting & Analytics | | DolbySEO
http://www.dolby.com/us/en/about-us/careers/landing.html
http://www.dolby.com/us/en/about-us/careers/landing.html?onlnk=al-sc as well as PPC campaigns. We tag certain landing pages purposefully in order to understand that traffic comes from these pages, since we use Google Analytics and don't have the abiility to see clickpaths in the package we have. Is there a way to set parameters for crawling to exclude certain pages or tagged content, such as those set up for PPC campaigns?0