Stolen Content and a Panda Penalty
-
Hey Folks
Question for those folks that have spent some time helping people with the recent penalties and the like.
I have a client who has a clear Panda Penalty, huge drop in traffic on the initial Panda date and a further drop on the second date. Much smaller incremental drops on subsequent recent updates as well.
From digging in it seems fairly cut and dry - copyscape shows another 250 or so sites with content from this site and there are nearly 2000 external URLs with duplicate content across these sites.
We are talking complete, shameless copies of all of the text, sometimes the images as well.
The client claims the content is all 100% unique and is his content and that the other blogs must have stolen his content resulting in the penalty - which, if it is true, and I have no reason to suspect otherwise, kind of sucks.
Now, many moons ago, way before Penguin or Panda (maybe around 2006) I had a client that had suddenly lost all traffic and their historical rankings. No funny business, it was a small company, had been online since around 2000 and they were pretty much the first of their kind and always did very well from organic search.
As it turned out, the content from the site had not really changed since it was set up and as lots of companies had sprung up offering a similar service they had seen their content copied wholesale, across many sites, all over the world.
We attempted to contact many of these sites and got some results but many were just old, abandoned copy cat sites on advert supported hosting that had ceased to trade so we maybe got rid of about 20%.
Well, in the end we just decided to rewrite the content, we did this and sure enough, the site bounced back to it's previous standing and has been pretty much there ever since.
Now that was kind of easy, the site had maybe 20 pages, and it needed a sprucing up but in this case the site has around 500 pages so doing a rewrite is not going to be so easy.
Problem is, I don't see removal requests being particularly successful either.
So, I see the options and steps as being.
- Contact all the sites and request the removal of the content
- use the Google content removal facility:
https://www.google.com/webmasters/tools/removals - File a DMCA takedown for anything remaining
- Report Scraped Pages to Google:
https://docs.google.com/spreadsheet/viewform?formkey=dGM4TXhIOFd3c1hZR2NHUDN1NmllU0E6MQ&ndplr=1 - Submit a spam report for all sites involved ?
- Submit a reconsideration request to let Google know what we have been doing (unlikely
In a nutshell, do everything we can to get this content removed and then documenting this to Google in the hope we catch hold of someone who hears our plight.
Interestingly enough, this is a sensitive one, so no URL but I would welcome any thoughts or experiences any of you may have had with similar problems.
There is a little extra info here from Matt Cutts + Barry Schwartz that kind of tallies with my approach above but would really like to hear any feedback.
http://www.seroundtable.com/google-stolen-content-13243.html
Cheers all
Marcus
-
Hey, I used copyscape to locate all the content and have suggested copyscape sentry going forward. The problem is for this site the scale of the copying, it seems to go back several years and is pretty widespread.
Cheers!
Marcus
-
Hey Egol
Well, this was kind of my initial suggestion to the client. I simply don't think that the pain and suffering and ultimately waiting to get this resolved is worth the effort and a rewrite is likely the easiest (if still painful option).
I guess, I just want to give this guy all of the options, and my advice so if they want to have a shot at getting the content taken down then they can have at least have a go. This way, I can advise, show the various pathways and my experience but they can choose how to tackle the issue.
At times, this job is a lot like dealing with my kids, I give advice based on years of painful experience, they choose the difficult path, what can you do?
For some folks, the fact that this happens is too much for them to take on board and despite it being exactly what it is - I do understand the 'digging your heals in' approach to wanting to get other sites to take it all down - i also from painful experience know that sometimes you just have to take your punches and get on with a rewrite.
Thanks for the input!
Cheers!
Marcus -
Well DMCA will hurt more than help. (the time you lose) but what you can do is to grad a copyscape account, upgrade it at pro and keep track of all your unique content. If your sales are based on this factor is worth trying. Also have a look at the hosting providers, they may have the same rules.
Act as fast as copyspace announce you there is a problem. Now you may ask me how to act? Well all the steps are listed above. Google rules still apply for filling a DMCA request at chillingeffects they can remove content from google search results.
Take a look at this screenshot Is in romanian, but it says that Chillingeffects took actions are removed x pages/websites duo to DMCA.
-
Glad to hear. Please give us updates on how everything is going.
-
I sell a few specialty outdoor sport items that are branded by a US company but manufactured in China. Several years ago I wrote unique, detailed descriptions for these items that were much more detailed than the brand owner's.
My pages used to rank really well for the generic item names (similar to "rock climbing shoes"). Then at least 100 "made in China" websites grabbed my descriptions and posted them verbatim. My rankings tanked in Google. I didn't even get much long tail from google.
I felt that it was a waste of time to contact all of those websites and try to get them to stop using my content. They are outside of the USA and they would probably laugh at a DMCA.
So, I have a choice of rewriting that content or discontinuing sales.
-
Hey, great advice, many thanks. The hosting provider is a great idea, allows us to go in via the back door, I like it.
-
I may not be a guru, but if the story is true (uniquer content), there are several steps you can take to regain the traffic and lose the penality. I'm glad to see the steps already listed on your comment, but there are too many and I would recommend you to focus on this 3 steps +1 extra step.
-
Remove any website which stole the content from your client using the link you have provided.
-
File a DMCA - This is a must! Stolen content is stolen so you have all the rights to file a DMCA.
-
Report the scrapped content. Now here is the catch. Give clear informations. Google staff will not sit to check the original post dates, names of who made the content or anything else so besure to offer this informations, even screenshots if you/your client have. Also have a look on waybackmachine and if the pages are stored there be sure to give the link to it!
You need to do all this work to prove your client content is unique and it was scrapped/copied.
Extra Step: Try to find where are that websites hosted and file a DMCA to the hosting provider (fastes way). Let them know they are hosting website which have copied/scrapped content and that you are going to take all the actions against them if they do not take that website/page down. 95% of the hosting providers have a rule which says "no illegal content allowed".
This is how I normally act with this kind of situations.
I may not be to helpfull in your situation, but this is how I normally act.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why should I avoid publishing off-topic content on my website?
As a fun project, my team wanted to build a mini-food blog based off the lunches we make here at our office -- but we're a software company and, topically, our product has nothing to do with food. Therefore, I suggested that we not publish this content on our website + create a Medium publication instead (this would also help us avoid the headache of creating an entirely new section of our website / potential 404 issues from non-technical editors / etc.) However, I struggled to articulate _why _it's a best practice to only publish relevant content on your website. Is it to help search engines understand what your website is about as an entity? Spam signals?
Content Development | | AsanaOps0 -
Curated content on page one of google for medium competition keywords?
Has anyone here ranked curated content on page one of Google for medium competition keywords?
Content Development | | jtbaker19710 -
Repurpose/reuse blog content - email address to make available for this
Hi, From Rand's recent Whiteboard Friday, I learned this: "Have right on the blog page the email address to use if they want to repurpose/reuse the content. That way if someone wants to give us a backlink and quote/reference our blog, they have an easy way to get permission." My question is, what do I say with the email address when I list our contact email? Something like 1. Just list the email address 2. "To reuse/repurpose our content, please contact adress@email.com." or something else?
Content Development | | BobGW0 -
Need to know about content marketing strategy
Hi, Can anybody guide me to a document or a presentation that elaborates how the content for overall Internet marketing strategy should be developed? And how do people beyond marketing department contribute to its success? Regards
Content Development | | IM_Learner0 -
301 Redirect & Duplicate Content
We currently have 16465 audiobook products presented at our Web store. 5411 of them are out-of-publication (OOP). Here's an example: Harry Potter Audiobook 2 : Harry Potter and the Chamber of Secrets - J.K. Rowling - cassette audiobook Many of the 5411 OOP products are duplicates and triplicates of one title but were offered on a different medium (cassette, CD or MP3 CD) or were a different type (abridged, unabridged, dramatized). The description (story-line) is the same for all. Because we know once a page gets on the Internet, it can live there for years, we decided to keep OOP product pages at our Web store to: Let those who may have searched for the product and clicked on a link to an OOP product's page that it was no longer available. Invite them to explore our Web store. Let them know that although the product may not be available on cassette, CD or MP3 CD, that it might be available as a digital download. We know that Google does NOT like duplicate content from one site to another and even within the same site. If we redirect all the 5411 pages to one OOP page, will this eliminate this duplicate content issue? The OOP page would explain that the title they were looking for is no longer available but that it might be available as a digital download.
Content Development | | lbohen0 -
Best way to resolve duplicate content issue?
Not sure about what to do about this - I have a client who has a ton of pages (around 1200) which are all City specific pages, for long-tail search. These are all written with paragraphs in the format such as: Order to [City] today. So every page has essentially the same content. The site also only has 1562 pages, so with 1200 of them being City-specific same-content pages, that can't be good. However the problem is that these pages still rank very well (usually Position 1 or 2) for the terms they're targeting, and bring in enough traffic and revenue to justify their purpose. We also have Country specific pages, and these are all with unique content, rather than the scripted content on the City pages. So for example, for Italy we might have: Italy Page (Unique Content) Rome (Duplicate Content) Milan (Duplicate Content) Venice (Duplicate Content) etc. (Duplicate Content) For a low traffic country (Austria), we tried to 301 the City pages to the Country page, but that only resulted in us seeing a drop in search results for the city keywords, from (usually) Position 1 to more like Page 3 or 4, so quite a drop. So, without writing 1200 pages worth of unique content, what would your advice be?
Content Development | | TME_Digital0 -
Onsite Content - Word Count & KW Density
Does the word count of a webpage make a difference to search engines? Are longer word counts on pages indexed higher or given higher priority? For example,say you have 300 words of copy packed with 20 keywords, and say you also have 700 words of copy that have the same 20 keywords worked in, does Google have a preference over which one it ranks higher?
Content Development | | greentent0 -
Duplicate content
Hello Seomoz team, i'm french and so my english is not very good ;-). I work for a brand site and we publish content about our products. The problem is : as a brand site, many sites that sell our products, copy our content. And we have duplicate content. And since these sites have worked SEO, they put in place rel canonical tag. as a brand, how to avoid being accused by Google duplicate content? tanks for you answer. I hope it's clear. Take care Denis
Content Development | | android_lyon0