Dupe Content: Canonicalize the Wordpress Tag or NoIndex?
-
Mozzers,
Here we go. I've read multiple posts for years on taxonomy dupe content. In fact, I've read 10 articles tonight on taxonomies and categories.
A little background: I am using Wordpress SEO with the Yoast plugin.
**Here is the scenario: We have 560 tags - some make sense - some do not. **
What do I do?
-
Do I not worry about it?
-
Matt Cutts said twice that I should not stress about it, because in the worse non-spammy case, Google may just ignore the duplicate content. Matt said in the video, “I wouldn’t stress about this unless the content that you have duplicated is spammy or keyword stuffing.” (Found Via Search Engine Land - http://searchengineland.com/googles-matt-cutts-duplicate-content-wont-hurt-you-unless-it-is-spammy-167459).
-
Do I NoIndex,Follow the Tags?
-
Yoast and a Moz post both say I should NoIndex and Follow the Tags. From the post: "Tag, author, and date archives will all look too similar to other content. So it does not make sense to have them indexed." BUT! **The tags have been indexed for YEARS! And both articles go onto say **"if your blog has already existed for some time, and you've been indexing tags all along for example, you shouldn't just go deindexing them" (http://moz.com/blog/setup-wordpress-for-seo-success).
-
So do I deindex tags that have been indexed for years? I checked the analytics, and in the past month, tags have brought in less than 1% of traffic, but they are bringing in traffic.
-
Do I canonicalize the tags?
-
Canonicalize the URL from "http://domain.com/blog/tag/addiction/" to "http://domain.com/blog/" ? And if I canonicalize, would you canonicalize to the /blog or to the base /tag?
Thanks for any and all help. I just want to clarify this issue. One of the reasons is because I received a Moz Report with a TON of dupe content warning from the tags and categories.
-
-
I sometimes put noindex on category pages too, because I have the manual excerpts also present on the blog index pages (the pages that list all blog posts in chronological order).
But is really a decision I make based on the type of website. There are websites where category pages present much interest then the chronological post listing. On these I don't use noindex, but I also do something to prevent duplicate content - I add some static text to the category pages so that they don't contain only the post excerpts.
-
Thanks Sorina,
I took your advice. Ill keep you posted on what happens. I ended up noindexing the tags, and should help the dupe content. I really love how you put it: "Does this page contain any unique content, content that can't be found anywhere else on the website."
Leaving the initial question just a bit, but staying within taxonomies, Sorina, what do you do about categories? Do you normally noindex them since they also do not contain unique content?
-
When deciding to index/noindex a page I always ask myself: "does this page contains any unique content, content that can't be found anywhere else on the website?"
For tags pages the answer is always NO. On a WordPress site Tag pages contain either automatic excerpts and this content is also available on the actual posts pages or in category pages, either manual excerpts and this content is also available in the category pages. So I decide to noindex tags.In your case, where the tags pages bring less then 1% search traffic I would noindex them without worrying.
-
No I always leave categories free to be indexed on eCommerce sites. (unless perhaps there is some keyword cannibalisation going on with another page). The first page any way, paginated pages get noindexed and sort pages get canonicaled/blocked.
The reason for this is that (usually) wordpress category/archive pages are just a list of articles with no other value. But with category pages on eCommerce sites I always add content on the page about the particular category and the products within it, making the page a useful page for visitors.
Also there is generally less duplication on eCommerce categories, as I only every use the product title and price, where as on a wordpress category there is going to be a snippet of a paragraph or two, or even the entire article that gets duplicated.
Categories pages can be very important on eCommerce sites, but i don't view wordpress categories as much so.
Again, people might argue about about whats right, and I don't think there is really a right or wrong answer; but i do know what works for me.
-
Thanks for that link! Been looking for the same info. Does this apply to eCommerce site setup as well???
-
Maximillian,
Thanks for the response. In fact, I just read that article. I enjoyed Harrison's analysis. I also made sure my pagination plugin was updated with rel=next and rel=prev are on the pages.
-
As you've pointed out, people argue until the cows coming home weather to noindex or not to noindex tags, categories, date and author archives; but using the canonical tag is definitely wrong in this case. The canonical tag should be used on pages that have the same content, perhaps just differing in order, but this might not be the case if you canonical a tag page to the blog start page, as there will be articles on that page that don't appear on the tag page.
So you are are left with just to noindex, or index these tag pages. Here is a great post on the subject:
No Indexing WordPress Taxonomies: Do or Don’t
To summarise, he found that if he noindexed the tag and category archives and installed a better pagination plugin, his traffic increased by 30% in two weeks.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is it better to find a page without the desired content, or not find the page?
Are there any studies that show which is best? If you find my page but not the specific thing you want on it, you may still find something of value. But, if you don't you may associate my site with poor results, which can be worse than finding what you want at a competitor site. IOW maybe it is best to have pages that ONLY and ALWAYS have the content desired. What do the studies suggest? I'm asking because I have content that maybe 1/3 of the time exists and 2/3 of the time doesn't...think 'out of stock' products. So, I'm wondering if I should look into removing the page from being indexed during the 2/3 or should keep it. If I remove it then my concern is whether I lose the history/age factor that I've read Google finds important for credibility. Your thoughts?
Search Behavior | | friendoffood0 -
Correct approach to a business website with separate content for personal and business customers
I'm laying the groundwork for a fairly involved website. The website is for a telco that caters to both residential and B2B. I was browsing the websites of the likes of Verizon, AT&T, Sprint & T-Mobile. What I saw is that they compartmentalize almost everything - all their business pages are in a business subdoman, all their investor info is in an investor subdomain and so-on. So I'm going implement this strategy on this website update. I just want to make sure that my idea makes sense and isn't a complete cluster****. I've attached a link to the mind map. Everything with "(sub)" attached to it is a subdomain. Everything else is a page at the root level of the top domain. Most of the visitors we get to the website are residential, so instead of loading a portal at first and asking if they're there for person or business reasons, I'm considering forwarding all visitors to the top-level domain to the personal.example.com site. Is this okay or would it be better to just keep the content in the top-level rather than forwarding all traffic to a subdomain? Thank you! 1JY7DWw
Search Behavior | | CucumberGroup0 -
Content marketing where articles aren't high traffic
Hello, If no one is writing articles in your niche and articles are very scarce in the top 100 landing pages, what does that tell you about content and content marketing in your niche
Search Behavior | | BobGW0 -
Best way to remove worthless/thin content?
I have a Wordpress site with about 3,000 pages and 1,000 of those are no value/duplicate content and drive no traffic. They are blog posts each with a single image and permalinks like example.com/post1, example.com/post2 etc. I've started by deleting pages and 301 redirecting to relevant pages that actually have content. Is deleting and 301 redirecting the best route? Is 1,000 to many 301 redirects? Should I just delete the pages that aren't really relevant to anything else? Anything else I should know about deleting all of these pages? Any help would be great!
Search Behavior | | gfreeman230 -
Decline in engagement metrics, due to nav changes vs. content changes
With improvements in our rankings, we are seeing adverse changes in our measures of engagement. My gut reaction is to believe we are attracting more unqualified traffic, thus higher bounce rates, declines in pages/visit and time on site (approx 15%, 15%, 25%, respectively). While recent improvements in navigation might have contributed to these engagement declines, do you have any suggestions how best to determine whether these declines are due to nav changes vs. due to copy/content issues? There's been no change in copy content during this period. Thanks.
Search Behavior | | ahw0 -
Books about Content Marketing & Persona creation?
Hello SEOmoz, First question here on the forum, but a silent follower for years 🙂 I'm looking for a good book - you define what is "good" - about content marketing and or persona creation that you read and proved to be usable in real-life situations I've read "Accelerate" but found it too light-weight. Therefore your recommendations would come very in handy. Looking forward to your replies! Best regards, Nikolaas
Search Behavior | | TheReference1 -
Has anyone yet seen any Search or User benefits from implementing any Schema tags (as in Schema.org) ? Thanks
Schema tags have been around for a while, am really interested in knowing of any noticeable benefits seen by anyone who has had experience of implementing Schema tags.
Search Behavior | | SimonCullum0 -
Google Is Displaying An Alt image tag as my homepage's page title
Google is randomly displaying the alt image tag as the page title for my homepage. It happens when you search for the brand name, but the page title appears as "BrandName Logo" (obviously not "BrandName"). Has anybody seen this happen before?
Search Behavior | | MichaelWeisbaum0