What is the best way to resolve duplicate content issue
-
Hi
I have a client whose site content has been scraped and used in numerous other sites. This is detrimental to ranking. One term we wish to rank for is nowhere.
My question is this: what's the quickest way to resolve a duplicate content issue when other sites have stolen your content?
I understand that maybe I should firstly contact these site owners and 'appeal to their better nature'. This will take time and they may not even comply.
I've also considered rewriting our content. Again this takes time.
Has anybody experienced this issue before? If so how did you come to a solution?
Thanks in advance.
-
No worries Alex
I mean, contacting the webmasters would technically be simpler, but the chances that you're going to get a response, never mind a take-down of your content, is going to be pretty slim. Hence I suggested the rewriting.
It's a pain in the arse and requires you to do more work because of someone's laziness, which if course isn't right. But hopefully, with the fresh content and the tags in place, you'll be given the full credit.
In addition, if any of the content come in the form of blog posts, or if you'd like to do this site-wide, implementing a rel=author tag and verifying Google authorship would again be a signal to Google that your content is original. Here are a couple of handy guides to help with the markup:
http://searchengineland.com/the-definitive-guide-to-google-authorship-markup-123218
http://www.vervesearch.com/blog/seo/how-to-implement-the-relauthor-tag-a-step-by-step-guide/
-
Hi Tom
That's a great help.
I just wanted to ensure there wasn't a simpler solution besides rewriting the content. I guess that is the easiest and will ensure canonical tag solution is implemented too.
Thanks.
-
Hi Alex
I think the best solution here and the one that you can control the most is to rewrite the content and then ensure that your new content is seen as the originator.
Rewriting the content will take time, but obviously ensures that the content is unique, removing the duplicate content issue.
If I were you, I would then use a rel=canonical tag solution, so that every page (and new page) has a canonical tag on it.
Among other things, this will tell Google that your site is the originator of this content. Any other versions of it on your site or across the web is being used purely for user experience and therefore should not be ranked over the original.
As you will be publishing the content first, it should be crawled first by the search engines as well. To ensure that it is, I would also share your pages on social media when they go live, as it helps to index the pages much quicker.
This way, the site scraping your content should (in theory) not be able to rank for the content - or at the very least will be seen by Google as the copier of the content, while you will be seen as the originator, due to being indexed first with the canonical tag.
You can read more on canonicals with this handy Moz guide.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
To avoid the duplicate content issue I have created new urls for that specific site I am posting to and redirecting that url to the original on my site. Is this the right way to do it?
I am trying to avoid the duplicate content issue by creating new urls and redirecting them to the original url. Is this the proper way of going about it?
On-Page Optimization | | yagobi210 -
How to optimize WordPress Pages with Duplicate Page Content?
I found the non WWW ans WWW duplicate pages URL only, more than thousand pages.
On-Page Optimization | | eigital0 -
Not sure if I need to be concerned with duplicate content plus too many links
Someone else supports this site in terms of making changes so I want to make sure that I know what I am talking about before I speak to them about changes. We seem to have a lot of duplicate content and duplicate titles. This is an example http://www.commonwealthcontractors.com/tag/big-data-scientists/ of a duplicate. Do I need to get things changed? The other problem that crops up on reports is too many on page links. I am going to get shot of the block of tags but need to keep the news. Is there much else I can do? Many thanks.
On-Page Optimization | | Niamh20 -
Duplicate Content
Hi I am new to SEO and at the moment looking at warnings from the crawl diagnostics report. When I have looked at the content from the urls given I cant see anything obvious that relates to duplicate content. Whats the best way to find out the problem please?
On-Page Optimization | | Pauline080 -
Duplicate Page Content on Empty Manufacturer Pages
I work for an internet retailer that specializes in pet supplies and medications. I was going through the Crawl Diagnostics for our website, and I saw in the Duplicate Page Content section that some of our manufacturer pages were getting flagged. The way our site is set up is that when products are discontinued we mark them as discontinued and use 301 redirects to redirect their URLs to other relevant products, brands, or our homepage. We do the same thing with brand and manufacturer pages if all of their products are discontinued. 90% of the time, this is a manual process. However, the other 10% of the time certain products come and go automatically as part of our inventory system with one of our fulfillment partners. This can sometimes create empty manufacturer pages. I can't redirect these empty pages because there's a chance that products will be brought back in stock and the page will be populated again. What can we do so that these pages won't get marked as duplicates while they're empty? Write unique short descriptions about the companies? Would the placement of these short descriptions matter--top of the page under the category name vs bottom of the page underneath where the products would go? The links in the left sidebar, top, and in the footer our part of our site architecture, so those are always going to be the same. To contrast, here's what a manufacturer page with products looks like: Thanks! http://www.vetdepot.com/littermaid-manufacturer.html
On-Page Optimization | | ElDude0 -
Dealing with thin content/95% duplicate content - canonical vs 301 vs noindex
My client's got 14 physical locations around the country but has a webpage for each "service area" they operate in. They have a Croydon location. But a separate page for London, Croydon, Essex, Luton, Stevenage and many other places (areas near Croydon) that the Croydon location serves. Each of these pages is a near duplicate of the Croydon page with the word Croydon swapped for the area. I'm told this was a SEO tactic circa 2001. Obviously this is an issue. So the question - should I 301 redirect each of the links to the Croydon page? Or (what I believe to be the best answer) set a rel=canonical tag on the duplicate pages). Creating "real and meaningful content" on each page isn't quite an option, sorry!
On-Page Optimization | | JamesFx0 -
Suggestions to avoid duplicate content
Hi, we have about 6500 products, almost all with descriptions. SEOMOZ is showing about 2500 of them with duplicate content. The reason for this is that only one or two words are different for each product. For example, we have 500 award certificates. All are the same size and have the same description. But one is swimming, one baseball, one reading, etc, etc. Apparently the 1 word difference is not enough to differentiate. We have the same issue with our trophies - they are identical, except for figures. Does anyone have any good tips on how to change the content to avoid this issue and to avoid making up content for 2500 items? Thanks! Neil trophycentral.com
On-Page Optimization | | trophycentraltrophiesandawards0 -
Best Way To Host Images For Image Optimization
I need an image optimization expert to tell me whether or not we are hosting images properly for SEO. Currently, we upload all images to Picasa and then call them out with a webpart in our content management system. See example here - http://www.tennisnow.com/Photos/2011-BNP-Paribas-Open-Day-5.aspx Here's an example of the url that is attached to each image - http://lh5.ggpht.com/_1Oyc-Zgkrpk/TX5H-Pfyd7I/AAAAAAAARbc/nG3Cw-G5tsY/s400/1215548409_FU9xA-L.jpg We have a lot of images, and we've hosted them on Picasa for speed purposes based on a recommendation from our developer (makes the pages load faster). I've read that Google can now factor page load time into its ranking parameters. We are not seeing the images from each photo gallery being indexed on images.google.com. We are torn. What should we do to rank for these images?
On-Page Optimization | | tennisexpress0