HELP! How do I stop scraper sites - is there any recourse?
-
Our site has lots of unique content and photos and it is constantly being scraped and posted on other websites. Most of these are no-name sites that pop up and exist for adwords revenue.
Aside from the fact that we don't want our content being copied, this is an SEO nightmare because they often link back to us from pages that are stuffed with keywords and have very low domain authority (it's a form of negative SEO).
My question is:
Does anyone have experience with fighting this phenonmenon?
What have you done that is effective?
Does anyone have experience with a service such as http://www.dmca.com/ProtectionPro.aspx ? Does it work/is it worth it?
Any input is appreciated!
-
Nice link Mark. News to me, really. But the fact that Schema.org and HTML5 both have author identification methods shows that it may be used by other search engines and/or services. And the followup article to your link there is "Google Authorship May Be Dead, But Author Rank Is Not." http://searchengineland.com/google-authorship-dead-author-rank-202254
But darn, man! All that time wasted getting authorship to work back then. Google's authorship verification process was indeed grueling.
-
I agree with everything besides for the authorship markup bit. Authorship markup is not being tracked by Google anymore - see http://searchengineland.com/goodbye-google-authorship-201975.
That said, the larger point about being the first content to go up is a good one. If we can all figure out where the original is from, assume that Google can too.
-
Kevin has a really good point here. You need to input markup that tells Google that the content is yours. I find that adding self-referential canonical tags can help with this. Just be careful to input them correctly.
-
Two schools on that one. They may not be hurting your business now, so you can forget about them. That's only until you can't. If they continue rip off your work, they may take from you in the future--ad revenue, traffic stats, e-commerce, news reports, whatever you're doing--that's all money. If I had time to fill out the form, I'd do it.
-
First thing to do is insert authorship markup and check that google recognizes you as an author of the site you're posting to. There is something to say for original content, and Google knows. If your content goes up first and is indexed first by Google, chance are you're going to rank better than the scrapper sites.
If these sites really bother you, you can submit a Copyright Removal form here https://www.google.com/webmasters/tools/dmca-notice, but a legal order to remove the content would be better (acted upon faster). Filing copyright infringement reports for eBay listings was very effective for me, but my experience with Google is limited. Let us know if you do file and how the process goes.
Generally speaking, it's actually pretty good that site are linking to your posts. If you are extremely uncomfortable with any particular site's backlink(s), you can use the GWT Disavow tool https://support.google.com/webmasters/answer/2648487/?hl=en&authuser=1
Good luck, and let us know what you do.
-
Yes agreed but if you are seeing that scrapers sites outrank your sites in SERPs in that case you should fill the form.
Thanks
-
Thanks for the reassuring response, Alick.
Based on what you're saying (and that post from Niel Patel) it's a waste of time to even fill out Google form (these sites are not outranking us). Agree?
-
Hi ,
First let Google know about this by using this form @ https://docs.google.com/forms/d/1Pw1KVOVRyr4a7ezj_6SHghnX1Y6bp1SOVmy60QjkF0Y/viewform
Second I would like to tell you that its myth that scrapers will hurt your Site. Scrapers don’t help or hurt you. Do you think that a little blog in Asia with no original writing and no visitors confuses Google? No. It just isn’t relevant.
To know more on this please visit below URL
https://blog.kissmetrics.com/myths-about-duplicate-content/
Thanks
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Language Tunnel - Help!
Hi, First post here. A few months back (before they were my client), my client updated their site to include a language tunnel. It looks like some other updates were made as well to "prettify" the site's URLs. Unfortunately, after this update, lots of well-ranking landing pages are now completely gone with no redirects in place. Normally, I would just give them a list of these old pages and say "301 Redirect" to X page. However, as part of this site update, they added country code into the mix. So now, instead of just 6 or 7 languages, we are looking at 30-40 permutations of language and country (with some countries having multiple languages). The functionality of the new site is fine, but all of the old 404s are not being kind to the search engine traffic. My question is: what's the best way to resolve this problem? These old pages usually specify a language code (but no country code). So, for example, I am thinking of redirecting all of the Spanish 404 urls to a Spanish "country tunnel". However, this is obviously not the same as what we had before, where the actual pages were indexed. Since my old pages no longer exist and I've got this country problem now (to stand in the way of a straightforward redirect), is there any way to appease the SEO gods on this?
Intermediate & Advanced SEO | | navdm0 -
Only the mobile version of the site is being indexed
We've got an interesting situation going on at the moment where a recently on-boarded clients site is being indexed and displayed, but it's on the mobile version of the site that is showing in serps. A quick rundown of the situation. Retail shopping center with approximately 200 URLS Mobile version of the site is www.mydomain.com/m/ XML sitemap submitted to Google with 202 URLs, 3 URLS indexed Doing site:www.mydomain.com in a Google search brings up the home page (desktop version) and then everything else is /m/ versions. There is no rel="canonical" on mobile site pages to their desktop counterpart (working on fixing that) We have limited CMS access, but developers are open to working with us on whatever is needed. Within desktop site source code, there are no "noindex, nofollow, etc" issues on the pages. No manual actions, link issues, etc Has anyone ever encoutnered this before? Any input or thoughts are appreciated. Thanks
Intermediate & Advanced SEO | | GregWalt0 -
Site redesign..have I done everything?
Hello, We have a site that was recently put through the redesign process-a couple of weeks ago. It was a tired site that was optimized well, but still struggled because it was so outdated. I went ahead and re-optimized, submitted a new sitemap, and did the fetch. Have I missed a step? Could someone offer insight into what they do when a site is redesigned and the steps taken to make sure that Google crawls and "appreciates" 🙂 the new site as soon as possible? Thanks in advance for any and all help!
Intermediate & Advanced SEO | | lfrazer0 -
Site Navigation
Hi Mozzers, I am an SEO at uncommongoods.com and looking for your opinion on our site nav. Currently our nav & URLs are structured in 3 levels. From the top level down, they are: 1. Category ex: http://www.uncommongoods.com/home-garden 2. Subcat ex: http://www.uncommongoods.com/home-garden/bed-bath 3. Family ex:http://www.uncommongoods.com/home-garden/bed-bath/bath-accessories Right now, all levels are accessible from our top nav but we are considering removing the family pages. If we did that, Google could still find & crawl links to the family pages, but they would have to drill down to the subcat pages to find them. Do you guys think this would help or hurt our SEO efforts? Thanks! -Zack
Intermediate & Advanced SEO | | znotes0 -
Seo flash site
Hey. Would hear whether it is possible to SEO a website which is flash site cms?
Intermediate & Advanced SEO | | Agger0 -
I'm pulling my hair out trying to figure out why google stopped crawling.. any help is appreciated
This is going to be kind of long, simply because there is a background to the domain name that is not typical to anybody in the world really and I'm not sure if its possible that it was penalized or ranked lower because of that or not. Because of that I'm going to include it with the hopes that giving the full picture some nice soul in the world who has more knowledge in this than me see's something or knows something and can point me in the right direction. Our site has been around for a few years, at one point the domain was seized by homeland security ICE, and then they had to give it back in Dec. which sparked a lot of the SOPA PIPA stuff and we became the poster child so to speak. The site had previously been up since 2008, but due to that whole mess the site was down for 13 months on the dreaded seized server with a scary warning graphic and site title which caused quite obviously a bunch of 404 errors and who knows what else damage to anything we'd had before that as far as page rank and incoming links. we had a lot of incoming links from high quality sites. We were advised upon getting the domain back to pretty much scrap all the old content that was on the site prior and just start fresh.. which we did. Googlebot started crawling slowly, but then as we started getting back into the swing of things people started linking to us,some with high page rank, we were getting indexed quite frequently and ranking high on search results in our niche.. Then something happened on March 4th, we had arguably our best day with google traffic, we'd been linked back by places like Huff Post etc for content in our niche.. and the next day literally it was a freefall. Darn near nothing. I've attached a screen shot from webmaster tools so you can see how drastic it was. I went crazy, trying to figure out what was wrong, searching obsessively through webmaster tools looking for any indication of a problem, searched the site on google site:dajaz1.com and what comes up is page 2 page 3 page 45 page 46. It's also taken to indexing our category and tag pages and even our search pages. I've now set those all to noindex follow but when I look at where the googlebots are at on the site, they're on the categories, pages, author pages, and tags. Some of our links are still getting indexed, but doing a search just of our site name and we're ranking below many of the media sites that have written about our legal issues, when a month ago we were at least top result for our own name. I've racked my brain trying to figure out the issue. I've disabled plugins, I'm on fetch as google bot all the time making sure our stuff is at least coming out as 200 (we had 2 days where we were getting 403 errors due to a super-cache issue, but once fixed googlebot returned like it never left) I've literally watched 1000 videos, read 100 forums, added in SEO plugins, tried to optimize the site to the point I'm worried I'm over doing it.. and still they've barely begun to crawl. As you can see there is some activity in the last 2-3 days, but even submitting a new site map once I changed the theme out of desperation it's only indexed 16. I've looked for errors all through webmaster tools and I can't find anything to tell me why that happened, how to fix it, and how to get googlebot to like us again. I'm pulling my hair out here. The links we have incoming are high quality links like huffington post , spin, complex, etc. Those haven't slowed down at all, we do outgoing links to sites we trust and are high quality as well. I've got interns working on how they're writing titles and such, I've gone through and attempted to fix duplicate pages and titles.. I've been going through and re-writing meta description tags What am I missing? I'm pulling my hair out trying to figure out what the issue is. Eternally grateful for any help provided. jnzb6.png
Intermediate & Advanced SEO | | malady0 -
Site comparison - what is wrong with me?
www.bcspeakers.com/ vs www.psbspeakers.com/ with the search term "speakers" why does BC speakers show up in around #50-60 and PSB is not in the top #1000? From all metrics on seomoz PSB kicks BC in every area by a large margine! can anyone see why BC is listed for that keyword and PSB is not?
Intermediate & Advanced SEO | | kevin48030 -
How do you prevent the mobile site becoming a duplicate of the full browser site?
We have a larger site with 100k+ pages, we need to create a mobile site which gets indexed in the mobile engines but I am afraid that google bot will consider these pages duplicates of the normal site pages. I know I can block it on the robots.txt but I still need it to be indexed for mobile search engines and I think google has a mobile crawler as well. Feel free to give me any other tips that I should follow while trying to optimize the mobile version. Any help would be appreciated 🙂
Intermediate & Advanced SEO | | pulseseo0