Block /tag/ or not?
-
I've asked this question in another area but now i want to ask it as a bigger question. Do we block /tag/ with robots.txt or not. Here's why I ask:
My wordpress site does not block /tag/ and I have many /tag/ results in the top 10 results of Google. Have for months. The question is, does Google see /tag/ on WordPress as duplicate content? SEOMoz says it's duplicate content but it's a tag. It's not really content per say.
I'm all for optimizing my site but Google is not penalizing me for /tag/ results.
I don't want to block /tag/ if Google is not seeing it as duplicate content for only one reason and that's because I have many results in the top 10 on G.
So, can someone who knows more about this weigh in on the subject for I really would like a accurate answer.
Thanks in advance...
-
Thanks for all the info. Last question, does having a list of monthly archives on the bottom of my site hurt in terms of dup content? I just have at the bottom the month/year and when you click it, it shows all the posts in that month. Should I be removing this or does it matter?
-
It would be meta noindex. Yoast is my plugin of choice. Happen to have a little article right here if you need to see if its "safe" to remove them from a traffic standpoint.
-Dan
-
I use All in one SEO pack and have checked noindex for the tags and the categories and the archives. I suppose it doesn't make any difference if I do it there or in the robots.txt file. Either way their being blocked. Do you know if there's a penalty for having blocked them in WP and the robots file?
-
I'd say noindex, follow them - many SEO plugins can do this for you, Yoast SEO for example. That way Googs can still crawl them, which may assist with discovery, but won't index them.
-
Exactly what I was looking for. Thank you!
So, I suppose the best and proper way to block it is by robots.txt correct?
-
You mean "more about this" than me? I run 3 businesses on 3 Wordpress blogs. I've done the research. Many of my clients are Wordpress users. But here's what others think:
- Yoast thinks it's duplicate content: http://yoast.com/articles/wordpress-seo/#advancedseo
- David Fuller ranked for tags then didn't: http://www.seomoz.org/q/wordpress-tags-duplicate-content Same link Dan at Evolving thinks you should noindex tags as well.
- WPMU and Matt Cutts think it's duplicate content: http://wpmu.org/categories-tags-and-how-to-avoid-duplicate-content-on-wordpress/
- How to Tech thinks it's duplicate content: http://howtotechtips.com/remove-wordpress-duplicate-content-search-results-and-tags-from-google/
- As you said, SEOMoz thinks it's duplicate content.
- Many Warriors suggest noindexing tags for dupe content reasons: http://www.warriorforum.com/adsense-ppc-seo-discussion-forum/373744-wordpress-tags-death-me-duplicate-content-question.html
- 3 other pro SEOs say to noindex here: http://www.seomoz.org/q/solving-link-and-duplicate-content-errors-created-by-wordpress-blog-and-tags
Google search shows
_No results found for _"tags do not create duplicate content".
No results found for "tags are not duplicate content".
And 2.5 million results for tags "duplicate content"
The short term answer is that you're ranking for them now so leave them be.
The long term answer is it's duplicate content and you need to fix it.
Even if your tag pages don't show the entire post, multiple tag pages show the same excerpt. This is duplicate content. By itself - not even talking about the post.
**You said: **_SEOMoz says it's duplicate content but it's a tag. It's not really content per say. _
If you want to see with your own eyes the duplicate content, please post a URL.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Do you need a canonical tag for search and filter pages?
Hi Moz Community, We've been implementing new canonical tags for our category pages but I have a question about pages that are found via search and our filtering options. Would we still need a canonical tag for pages that show up in search + a filter option if it only lists one page of items? Example below. www.uncommongoods.com/search.html/find/?q=dog&exclusive=1 Thanks!
Technical SEO | | znotes0 -
Should I use canonical tag in these cases?
Should I use canonical tag in these cases? On the page itself (with the tag pointing to itself) On pages that doesn't have duplicate versions
Technical SEO | | GoMentor0 -
Redirecting root to /default.aspx
Hello, I have a client who's home page redirects to /default.aspx - what are the possible SEO impacts of this? As the home page redirects to /default.aspx and does not load under both there does not seem to be a duplicate content issue. Also the redirect should carry over most of the link juice from the home page to /default.aspx therefore are there any negative SEO "side effects" of this set-up? Thanks in advance!
Technical SEO | | RikkiD220 -
Help - we're blocking SEOmoz cawlers
We have a fairly stringent blacklist and by the looks of our crawl reports we've begin unintentionally blocking the SEOmoz crawler. can you guys let me know the useragent string and anything else I need to enable mak sure you're crawlers are whitelisted? Cheers!
Technical SEO | | linklater0 -
Trackback/Syndication
Using wordpress or any other blog to properly syndicate an article without duplication risk. Can I trackback by just leaving a link to the original within or at the bottom of a post or is there a specific code to add.. What is the best way to trackback?
Technical SEO | | SEODinosaur0 -
OK to block /js/ folder using robots.txt?
I know Matt Cutts suggestions we allow bots to crawl css and javascript folders (http://www.youtube.com/watch?v=PNEipHjsEPU) But what if you have lots and lots of JS and you dont want to waste precious crawl resources? Also, as we update and improve the javascript on our site, we iterate the version number ?v=1.1... 1.2... 1.3... etc. And the legacy versions show up in Google Webmaster Tools as 404s. For example: http://www.discoverafrica.com/js/global_functions.js?v=1.1
Technical SEO | | AndreVanKets
http://www.discoverafrica.com/js/jquery.cookie.js?v=1.1
http://www.discoverafrica.com/js/global.js?v=1.2
http://www.discoverafrica.com/js/jquery.validate.min.js?v=1.1
http://www.discoverafrica.com/js/json2.js?v=1.1 Wouldn't it just be easier to prevent Googlebot from crawling the js folder altogether? Isn't that what robots.txt was made for? Just to be clear - we are NOT doing any sneaky redirects or other dodgy javascript hacks. We're just trying to power our content and UX elegantly with javascript. What do you guys say: Obey Matt? Or run the javascript gauntlet?0 -
How to block/notify google that your domain has been added to sites with very low trustworthiness?
Hey Guys, I am writing to SEOmoz community because a problem occurred which I do not know how to solve: My domain (xyz.com) occured on very strange sites with very low trustworthiness (even blocked by google). Checking the site, I found out that all of the pictures were ALT=xyz.com. Could this hurt my position of my site on google rankings? How to prevent such actions, what should I do? Thanks for you help in advance!
Technical SEO | | Kajmany0 -
Duplicate content and tags
Hi, I have a blog on posterous that I'm trying to rank. SEOMoz tells me that I have duplicate content pretty much everywhere (4 articles written, 6 errors at the last crawl). The problem is that I tag my posts, and apparently SEOMoz thinks that it's duplicate content only because I don't have so many posts, so pages end up being very very similar. What can I do in these situations ?
Technical SEO | | ngw0