Website content has been scraped - recommended action
-
So whilst searching for link opportunities, I found a website that has scraped content from one of our websites. The website looks pretty low quality and doesn't link back. What would be the recommended course of action?
-
Email them and ask for a link back. I've got a feeling this might not be the best idea. The website does not have much authority (yet) and a link might look a bit dodgy considering the duplicate content
-
Ask them to remove the content. It is duplicate content and could hurt our website.
-
Do nothing. I don't think our website will get penalised for it since it was here first and is in the better quality website. Possibly report them to google for scraping?
What do you guys think?
-
-
It's good to be aware of the scrapers to see what they are trying to do with your content, and it can't hurt to ask them to remove it.
Don't ask for a link, you never want links for sites that rely on bad practices like that, it can hurt you.
This is most likely not effect you if left alone. If the scraper is grabbing from source code, then implementing a canonical tag in your content will help Google know where the content came from (but they probably already know).
-
Most of the time, contacting them is a waste of time. Being a weasel is their business model. Weasels usually have hidden domain registration data so finding their contact information is really hard.
If they have republished my content on blogspot, youtube, facebook or other community sites, I simply file a DMCA and the content is usually taken down quickly.
I don't want duplicates of my content on the web, especially not on powerful sites. Powerful sites are generally more responsible than Joe Schmoe working in his basement. Often just an email to them with "copyright infringement on yourdamndomain.com will get your content taken down. I've called people on the phone to tell them that they have my stuff on their site and that is faster than filling out forms. Be nice, not threatening and they usually comply if you get them on the phone.
I don't ask for links because I don't want weasels linking to me.
-
Was your site's scraped content already indexed in Google?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Moving content to a new domain
I need to move a lot of content with podcasts and show notes to a new domain. Instead of doing redirects, we want to keep some content on the current domain to retain the link value. There are business reason to keep content on both websites but the new website will primarily be used for SEO moving forward.If we keep the audio portion of the podcast on the old website and move the show notes and the audio portion of the podcast to the new website, is there any issues with duplicate content?Long-term, I presume Google will re-index the old and the new pages, thus no duplicate content, but I want to make sure I'm not missing anything. I was planning to fetch pages in Search Console as we migrate content.Thanks for your help!
Technical SEO | | JimmyFritz0 -
Duplicate content and rel canonicals?
Hi. I have a question relating to 2 sites that I manage with regards to duplicate content. These are 2 separate companies but the content is off a data base from the one(in other words the same). In terms of the rel canonical, how would we do this so that google does not penalise either site but can also have the content to crawl for both or is this just a dream?
Technical SEO | | ProsperoDigital0 -
Google not showing my website ?
The website is medicare.md. if you search for term "medicare doctors PG county maryland" it is #1 in bing and yahoo but not even showing on google.com first TEN pages, although not banned. Interestingly if you do that search on google.co.pk it is #4. Quite Puzzuling !! Would appreciate any help or advice . Sherif Hassan
Technical SEO | | sherohass0 -
Multi Company websites
Hello SEO community ! Hope you'll have some good advice for this project. 🙂 I'm working for a group of companies just starting its SEO experience. Nowadays they have 10 different websites with different names and pretty much the same objectives. So basicly, > Would it be better to gather all website under one adress with subdomains ? They want to display almost the same info, blogs and products.. It make dupplicate content a real pain and Social Media strategy a nightmare. More info: 10 websites for 8 subsidiaries, 1 holding, 1 online shop Each subisdiary has english + its proper language They want regular posts and info updates (blogs, newsletters) They don't have all the same name They all do the same activity Online shop is full a product keywords Ideas: Working on the holding website as mother ship - for branding (social media), actu (blogs), CM (videos, and more)- Displaying the online shop products in all websites (xml) Diplaying blog updates (no full message) via xml on all websites Linking all websites to the blog, shop and holding Tks a lot !
Technical SEO | | AymanH0 -
Crawling and indexing content
If a page element (div, e.g.) is initially hidden and shown only by a hover descriptor or Javascript call, will Google crawl and index it’s content?
Technical SEO | | Mont0 -
Duplicate Content
Hello All, my first web crawl has come back with a duplicate content warning for www.simodal.com and www.simodal.com/index.htm slightly mystified! thanks paul
Technical SEO | | simodal0 -
Solution for duplicate content not working
I'm getting a duplicate content error for: http://www.website.com http://www.website.com/default.htm I searched for the Q&A for the solution and found: Access the.htaccess file and add this line: redirect 301 /default.htm http://www.website.com I added the redirect to my .htaccess and then got the following error from Google when trying to access the http://www.website.com/default.htm page: "This webpage has a redirect loop
Technical SEO | | Joeuspe
The webpage at http://www.webpage.com/ has resulted in too many redirects. Clearing your cookies for this site or allowing third-party cookies may fix the problem. If not, it is possibly a server configuration issue and not a problem with your computer." "Error 310 (net::ERR_TOO_MANY_REDIRECTS): There were too many redirects." How can I correct this? Thanks0 -
Copying Content With Permission
Hi, we received an email about a guy who wants to copy and paste our content on his website, he says he will keep all the links we put there and give us full credit for it, so besides keeping all the links on the page, which is the best way for him to give us the credit? a link to the original article? an special meta tag? what? Thank you PS.Our site its much more authorative than his and we get indexed within 10min from the moment we publish a page, so I don't worry about him out raking us with our own content.
Technical SEO | | andresgmontero0