Duplicate Content & Tags
-
I've recently added tags to my blog posts so that related blog posts are suggested to visitors.
My understanding was that my robot.txt was handling duplicate content so thought it wouldn't be an issue but after Moz crawled by site this week is reported 56 issues of duplicate content in my blog.
I'm using Shopify, so I can edit the robot.txt file but is my understanding correct that if there are 2 or more tags then they will be ignored? I've searched the Shopify documents and forum and can't find a straight answer. My understanding of SEO is fairly limited.
Disallow: /blogs/+
Disallow: /blogs/%2B
Disallow: /blogs/%2b -
If the only option is to disallow via the robots.txt, then I would agree with your setup - disallow the slugs specific to the tags you don't want indexed. I've heard shopify is a little rough to work with sometimes because of the limitations, so whatever you can do I think is better than nothing. Remember that the robots exclusion is treated as a suggestion and not a command, so if it's possible to assign a no-index meta tag to those URL types that would be best case.
Looks like you're on the right track with the post below:
{ % if handle contains "tagged" % }
{ % endif % }
The one suggestion I would make is that you use noindex,follow so the content will still be crawled, but the duplicate tag won't get indexed. That would create multiple paths to the content on your site, but not create an index bloat issue with multiple tags.
-
Yoast is a WordPress plugin, not Shopify so that option isn't available with the current CMS. Just wanted to chime in to make sure others aren't looking for Yoast SEO in the Shopify app store.
-
I'm using Meta Tagger as the SEO plugin, I've not heard of Yoast SEO but will certainly check it out.
I understand that I need to exclude the tags from being crawled and think I might have worked it out but I'm not 100% sure, as I mentioned my understanding is fairly limited.
My URL which is being seen as duplicate content looks like this
http://www.tangled-yarn.co.uk/blogs/news/tagged/sock-knitting
If I exclude the handle 'tagged' from being index this should work. I think the code should be
{ % if handle contains "tagged" % }
{ % endif % }
Do you think this will work?
-
Do you use Yoast SEO, or another plugin? The key is to set tags to no index so that the crawler only goes through your category links. The issue is that your tag URLs are being indexed and you don't want that. The option is under XML site map.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content
Dear community, We have 15 product specific landing pages. They all share a block called "Why invest in VanEck ETFs?", see e.g., https://www.vaneck.com/de/en/mining-etf https://www.vaneck.com/de/en/space-etf/ https://www.vaneck.com/de/en/esports-etf/
Content Development | | marketing-europe
Can this lead to SEO penalization because of duplicate content?0 -
Hi friends, I have a question. I want to know how Google detects the author of a content.
Hi friends, I have a question. I want to know how Google detects the author of a content.
Content Development | | Alopii0 -
Stolen Content and a Panda Penalty
Hey Folks Question for those folks that have spent some time helping people with the recent penalties and the like. I have a client who has a clear Panda Penalty, huge drop in traffic on the initial Panda date and a further drop on the second date. Much smaller incremental drops on subsequent recent updates as well. From digging in it seems fairly cut and dry - copyscape shows another 250 or so sites with content from this site and there are nearly 2000 external URLs with duplicate content across these sites. We are talking complete, shameless copies of all of the text, sometimes the images as well. The client claims the content is all 100% unique and is his content and that the other blogs must have stolen his content resulting in the penalty - which, if it is true, and I have no reason to suspect otherwise, kind of sucks. Now, many moons ago, way before Penguin or Panda (maybe around 2006) I had a client that had suddenly lost all traffic and their historical rankings. No funny business, it was a small company, had been online since around 2000 and they were pretty much the first of their kind and always did very well from organic search. As it turned out, the content from the site had not really changed since it was set up and as lots of companies had sprung up offering a similar service they had seen their content copied wholesale, across many sites, all over the world. We attempted to contact many of these sites and got some results but many were just old, abandoned copy cat sites on advert supported hosting that had ceased to trade so we maybe got rid of about 20%. Well, in the end we just decided to rewrite the content, we did this and sure enough, the site bounced back to it's previous standing and has been pretty much there ever since. Now that was kind of easy, the site had maybe 20 pages, and it needed a sprucing up but in this case the site has around 500 pages so doing a rewrite is not going to be so easy. Problem is, I don't see removal requests being particularly successful either. So, I see the options and steps as being. Contact all the sites and request the removal of the content use the Google content removal facility:
Content Development | | Marcus_Miller
https://www.google.com/webmasters/tools/removals File a DMCA takedown for anything remaining Report Scraped Pages to Google:
https://docs.google.com/spreadsheet/viewform?formkey=dGM4TXhIOFd3c1hZR2NHUDN1NmllU0E6MQ&ndplr=1 Submit a spam report for all sites involved ? Submit a reconsideration request to let Google know what we have been doing (unlikely In a nutshell, do everything we can to get this content removed and then documenting this to Google in the hope we catch hold of someone who hears our plight. Interestingly enough, this is a sensitive one, so no URL but I would welcome any thoughts or experiences any of you may have had with similar problems. There is a little extra info here from Matt Cutts + Barry Schwartz that kind of tallies with my approach above but would really like to hear any feedback. http://www.seroundtable.com/google-stolen-content-13243.html Cheers all Marcus0 -
Creating the best content in your industry
Im currently working with a new client and their goal is to create the absolute best content in their industry. I've seen alot of articles on WHY to create the best content but not a lot on HOW to create the best content. Can anyone recommend a article they recall which talked more about the HOW. I'm looking for a process on how to create awesome content, how to go about it. Any suggestions?
Content Development | | monster990 -
How Google judge about duplicate content?
With recent Search engines updates one thing is clear we cannot ignore content. Content marketing definitely going to be most important part of our SEO strategy. I have few doubts about content marketing (circulation of content over web) where I want suggestions of community members. There would be different thoughts so I would like to have as many as responses to know what majority thinks: When we are writing guest posts, does article needs to be unique with each and every blog we are writing or we can safely circulate one good piece of content to 10-15 blogs who are interested in our creative. We have written a good blog post for our own domain. Apart from social sharing should it be posted to other related blogs too or it should be unique to our domain only. Social sharing, mentions, like of blog matters in rankings?Seems yes they do but need to know what majority thinks. Finally what is the safe number to circulate your content over web.
Content Development | | EG0CENTRIX0 -
Duplicate content on forums?
I am creating a forum. I am concerned that when I create the forum, users will copy content from other places to post onto my forum. How negative/bad is this in terms of google eyes? I am concerned when people copy press releases and re-post it to the forum. Should I make a rule that all content must be typed and not copied? Or is a little copying okay?
Content Development | | sseibel0 -
Keeping web site fresh re content
We currently use a wp blog but we don not host on the web domain. What are the advanatges to moving the blog to the domain The only 1 I can think of is every time we update the blog this should help keep the web site fresh with new content .
Content Development | | NotThatFast0 -
Should I Have No Index, No Follow On Blog Category & Tag Pages?
At some point in the past I read or was told that No Index, No Follow tags on category and tag pages were a good thing on a standard WordPress blog in order to prevent duplicate content issues. Is this still true or was it ever true?
Content Development | | eTundra0