I have a WP site which uses categories to display the same content in several locations. Which items should get a canonical tag to avoid a ding for duplicate content?
-
So...I have a Knowledge Center and press room that pretty much use the same posts. So...technically the content looks like its on several pages because the post shows up on the Category listing page.
Do I add a Canonical tag to each individual post...so that it is the only one that is counted?
Also...I have a LONG disclaimer that goes at the bottom of most of the posts. would this count as duplicate content? Is there a way to markup a single paragraph to tell the spiders not to crawl it?
-
Hi Kane,
Thank you so much for your quick response even in the holiday season, I really appreciate that. I don't think that there is any unique content on tag or category pages. I will go ahead and noindex them right away.
Once again, thanks for your help.
Happy Holidays...
Jatin
-
Hi Jatin,
If you have any kind of unique content on a tag or category page, then it's reasonable to index it. For example, a category page with 1-2 paragraphs of intro text explaining what types of content are found in that category. In this case, any posts or post teasers showing up on that category/tag page would not be unique content.
If there is no unique content on that page, then I'd suggest noindex for category and tag pages.
-
Hello Kane,
I am also facing this issue and found some answers here in Q&A section. Most of the members are recommending to make tag & categories page non-index. But, you are not recommending this. Do you think that in the matter of duplicate content issue, it is better to non-index the tags and categories page?
If not, then should I worry about the duplicate content issues about these pages?
-
Let's resolve the boilerplate disclaimer text first: it is fine to have a section of content that is duplicated on lots of blog posts - the most important consideration is that each page has a decent amount of unique text from all other URLs on the website. Now there are limits to this - if every post has 200-400 words unique content (not really enough) and the boilerplate is 500 words, then your balance is off in that ratio. If you have 300-1000+ unique words on post and boilerplate text that is <100 words, then I would not worry about that at all. However, you can use the tag that Nick mentioned if you want to - it shouldn't hurt anything.
Now, regarding the canonicalization of posts and how it relates to category/archive pages:
Each post will have its own canonical tag, for example:
- http://www.domain.com/blog/best-post-ever/
- http://www.domain.com/blog/big-announcement-new-product
- http://www.domain.com/blog/annual-company-report
Then each category page or archive page would have it's own canonical tag:
When you are viewing a category or tag or archive page that lists a bunch of posts - there is only one canonical tag visible in the code for that page, and it's for the page itself - not for the posts listed on the page.
I'm guessing you're asking this question because you read that duplicate content on category and tag pages was common, and that is true. However - canonicals are not involved in fixing this at all. The thing with Wordpress is that unless you built a theme yourself - you shouldn't have to touch any of this. Default Wordpress with Yoast SEO plugin installed will handle this for you. I have worked on hundreds of Wordpress sites for 10 years and can count the number of times I manually specified a canonical tag in Wordpress on one hand.
The duplicate content of two pages such as your knowledge center and press room should be mitigated by 1) reducing the number of posts that fall into both categories, 2) making sure there is some unique content (50-300 words) on the knowledge center and press room pages other than a list of blog posts, and 3) not stressing too hard about duplicate content on these pages.
Hope that helps, please feel free to respond with questions if I missed something.
-
Hi Lindsay,
I'd recommend limiting the number of categories you select for each post - generally my rule is 3 categories maximum. That being said, I've found that the most successful strategy is to create & categorize content in a way that easily satisfies your user's intent. Categories = broad topics/areas of interest your ideal buyer wants to read about (broad keyword search phrases). Articles/posts = focus on one specific question related to the broad category (longtail phrases).
For example: let's say you have a shoe company and you've created a style blog that discusses the latest trends. One option is to do what a lot of companies do, and choose generic blog categories like trends, inspiration, comfort, etc...
Let's say you research and decide write an article called: Best Shoes To Wear To Coachella (because it's a longtail keyword). How do you categorize it? It's definitely goes in trends, but it's also kind of inspirational, and you also have a section about comfortable shoes to wear to Coachella. You can't choose just one category, so you end up adding the post to 4-5 categories.
The biggest problem with this type of organization structure is not duplicate content - it's that users (a) can't easily find your content because they don't know what your categories mean and (b) they're confused about what content they've already read, because they see the same articles in multiple categories.
In my opinion, the better way to choose categories and article topics, as I sort of mentioned above, is to start with broad topics that people want to learn about.
Instead, you might choose categories based on popular search queries. For example: Festival Shoes, New This Season, Celebrity Favs, How To Wear It, etc. In this case, your article: Best Shoes To Wear To Coachella would go under the festival shoes category. You could also have articles about Stagecoach, SXSW, etc. This isn't the best example, but I hope this make sense!
Long story short: done correctly, this type strategy is helpful in a number of ways: (1) your user is able to easily understand where to find the information they're looking for, (2) you avoid duplicate content, because your articles are written to correspond with 1 (maybe 2) categories, and (3) your category pages will be hyper-optimized for lots of longtail keywords that are related to your main category keyword. This will make your category pages like mini-landing pages that have a higher probability of ranking broad/more competitive keywords.
I hope this helps!
-
Hi Lindsay, Good questions. My recommendation would be to place a cononical tag on your posts and consider setting your category page to noindex.
As for the disclaimer, you can wrap that in the following tags to tell Google that it should not index that specific content.
<code>This (X)HTML content will NOT be indexed by Google.</code>
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
PDF best practices: to get them indexed or not? Do they pass SEO value to the site?
All PDFs have landing pages, and the pages are already indexed. If we allow the PDFs to get indexed, then they'd be downloadable directly from google's results page and we would not get GA events. The PDFs info would somewhat overlap with the landing pages info. Also, if we ever need to move content, we'd now have to redirects the links to the PDFs. What are best practices in this area? To index or not? What do you / your clients do and why? Would a PDF indexed by google and downloaded directly via a link in the SER page pass SEO juice to the domain? What if it's on a subdomain, like when hosted by Pardot? (www1.example.com)
Reporting & Analytics | | hlwebdev1 -
SEO Effect of inserting No indexed Contents in normal Pages (Nextgen Gallery)
Hello Dear Community, I'm running a photography website and have a question about the indexability of "No indexed Content" inserted on indexable pages. Background : I read everywhere that best practice is to "no index" all useless pages with few content, what I did with Yoast plugin : I no indexed all my nextgen galleries and "ngg_tags" since they create single pages for every photo, tags or slideshow. I did the same for all my porfolio-posts, price lists, testimonials and so on... Nevertheless, I inserted these galleries and portfolios on SEO optimized page for my target keywords. + Nextgen plugin automatically adds these images in the page sitemap. My idea is to have only my Seo optimized page showing in Google and not the others. Problem: I've been checking the results in Google Search Console, filtering by images : I discovered that most of the images featured in these Masonry galleries are not showing in google, and actually almost all the images indexed are the Wordpress from media gallery. I double checked with Screaming Frog, and the software doesn"t see images on these pages. My question is: Is the low indexablilty of these contents are related to the No indexation of the original contents ??? Does somebody has experienced the same issue that these contents doesn't show on Google ? in advance many thanks for your help
Reporting & Analytics | | TristanAventure0 -
Longterm wordpress blog not providing seo benefit to main site - help needed please
Hi I have a bigcommerce ecommerce store, with a Wordpress blog on a subdomain. The store and blog have been active for four years, the blog is regularly updated with original content, has many links to the store, is promoted regularly via my brand's social media channels and mailing list, and has the simplest SEO basics covered via a Yoast SEO plugin. But the store sees very little, if any, SEO benefit from the blog. My question is: based on this information, and the details below, is there an issue with the connection between the blog and main site in SEO terms? And if there is, how can I start fixing it? Further info: 1 In my Moz dashboard for the store site, the blog does not show at all as providing any inbound links or linking domains 2 Google Analytics also shows zero referral traffic to the store site from the blog since April 2015 3 Moz crawl issues is flagging ‘duplicate page content issues’ for pretty much every page of the blog, and the analysis provided suggests this may be related to tags but I have only basic SEO knowledge and am fast getting out of my depth here. 4 I have today altered the settings within the Yoast plugin on the blog to ‘noindex’ for Tags, Meta Robots, based on advice I have found in this section but am already well over my head and unsure even this is correct. An agency have been running SEO for the store since 2012 but since uncovering how little they have done in this time for the money paid, I am now taking matters back into my own hands. However I am on a very steep learning curve and this one is beyond me right now - please does anyone have any suggestions where I can start looking to uncover the root issue? Any guidance or advice would be greatly appreciated Thanks very much and hope to hear from someone!
Reporting & Analytics | | Warren_331 -
Site operator result anomaly
"Site:" search for site:http://www.mycity4kids.com/Bangalore/activity-based-approach is showing 76 results.I am using SERPS Redux to collect all the indexed pages, but when I re-checked indexed status of these pages using "site operator" google showed that these pages are not indexed. What is the possible explanation for this? Thanks
Reporting & Analytics | | prsntsnh0 -
Using Filters On Google Analytics
Hi Guys, I am In need of some technical help with regards to Google analytics. One of my clients has a site providing HR Software on SaaS basis. The users login via the site for the various HR Tools. Obviously this is skewing the analytics data and showing 10,000+ extra visitors that actually don't exist. Now I understand i can add filters and exclude certain pages for instance the login page, however, some users will visit the home page then click login. How can I exclude them just to show new visitors to the site rather than users that are just trying to login. Hope this makes sense Neil
Reporting & Analytics | | nezona0 -
Google Analytics Title tag vs landing page visitors numbers
Hi folks, Just wondering if anyone has any ideas as to why im getting different results in Google analytics. I'm using the Content Efficiency Analysis Report from http://www.kaushik.net which is absolutely awesome. When I search via my title tag I get 920 Unique Visitors over the month but when I search via the landing page URL with the same title tag I get 28. Any ideas to why their should be such a difference. I've also noticed that on that page i'm also getting a Rel Cononical TRUE using a site crawl. Any ideas are much appreciated
Reporting & Analytics | | acs1110 -
Is the link data from Open Site Explorer in real time or an average?
I just started using Open Site Explorer to track internal and external link data. Is this information given in real time or is it an average over a specified period of time?
Reporting & Analytics | | mequoda0 -
Should we "no-follow" archives or categories?
I'm reading some reports from my first crawl of 10K pages and I'm wondering if it's wise to mark the archives "no-follow." I have a WP tool that provides a tool that offers the no follow for categories or archives recommending to choose either one or the other but not both. What would be the best solution?
Reporting & Analytics | | JavaManOne0