Possible scraper reusing content. Should I be concerned?
-
I've noticed a few overseas sites seem to be repurposing content from our blog. The process to report for DMCA seems lengthy. Should I be concerned enough to persue this or just write it off as something that happens?
Here's an original - http://www.martinsprocket.com/sprocket-sense/sprocket-sense/2015/12/11/free-sprocket-CAD-models
Here's an example - http://ptech.in/silica-crushing/free-martin-sprocket-autocad-drawing-download-martin.html
Thanks!
-
Thanks!
-
Thanks so much. I'll see what they can do!
-
Thanks for the response! I'll check these out.
-
This company has lots of similar sites with similar format. All use the chat system, with same operators. Some sites say they are in China, others say they are in India. They know exactly what they are doing. They have been doing it for years. They are flooding the web with your brand name and your products with the hope that it will bring traffic. They steal content to make their sites and are probably knocking-off your products or after your brand delivers visitors they try to sell them a knock-off of your competitor's product.
-
To be honest - best strategy in this case seems to try to contact the site owner.
It looks like a genuine site but if you do the site: command in Google you'll find plenty of strange pages (about minecraft, Ducati club, ...etc) all in the same strange layout as the page you mention. Probably the site got hacked and needs cleaning.
If contacting the owner doesn't help - you can always try file the Spam and/or DMCA report.
Dirk
-
You should be concerned IF scrapper rank higher than your own site.
Meanwhile send report to Google here:
https://docs.google.com/forms/d/14CP_1An9rWKjJ8ZXqxg1gwVt44qTDxHPnXEa_ZGbHBc/viewform?formkey=dGM4TXhIOFd3c1hZR2NHUDN1NmllU0E6MQ&ndplr=1
Sending report didn't guarantee that scrapper will be removed from SERP. If you have copyright infringement send reports too:
https://www.google.com/webmasters/tools/spamreport?hl=en&pli=1
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content when working with makes and models?
Okay, so I am running a store on Shopify at the address https://www.rhinox-group.com. This store is reasonably new, so being updated constantly! The thing that is really annoying me at the moment though, is I am getting errors in the form of duplicate content. This seems to be because we work using the machine make and model, which is obviously imperative, but then we have various products for each machine make and model. Have we got any suggestions on how I can cut down on these errors, as the last thing I want is being penalised by Google for this! Thanks in advance, Josh
Technical SEO | | josh.sprakes1 -
Duplicate Content Issues with Pagination
Hi Moz Community, We're an eCommerce site so we have a lot of pagination issues but we were able to fix them using the rel=next and rel=prev tags. However, our pages have an option to view 60 items or 180 items at a time. This is now causing duplicate content problems when for example page 2 of the 180 item view is the same as page 4 of the 60 item view. (URL examples below) Wondering if we should just add a canonical tag going to the the main view all page to every page in the paginated series to get ride of this issue. https://www.example.com/gifts/for-the-couple?view=all&n=180&p=2 https://www.example.com/gifts/for-the-couple?view=all&n=60&p=4 Thoughts, ideas or suggestions are welcome. Thanks
Technical SEO | | znotes0 -
Migrate Old Archive Content?
Hi, Our team has recently acquired several newsletter titles from a competitor. We are currently deciding how to handle the archive content on their website which now belongs to us. We are thinking of leaving the content on their site (so as not to suddenly remove a chunk of their website and harm them) but also replicating it on ours with a canoncial link to say our website is the original source. The articles on their site go back as far as 2010. Do you think it would help or hinder our site to have a lot of old archive content added to it? I'm thinking of content freshness issues.Even though the content is old some of it will still be interesting or relevant. Or do you think the authority and extra traffic this content could bring in makes it worth migrating. Any help gratefully received on the old content issue or the idea of using canonical links in this way. Many Thanks
Technical SEO | | frantan0 -
Duplicate Footer Content
A client I just took over is having some duplicate content issues. At the top of each page he has about 200 words of unique content. Below this is are three big tables of text that talks about his services, history, etc. This table is pulled into the middle of every page using php. So, he has the exact same three big table of text across every page. What should I do to eliminate the dup content. I thought about removing the script then just rewriting the table of text on every page... Is there a better solution? Any ideas would be greatly appreciated. Thanks!
Technical SEO | | BigStereo0 -
Duplicat content affecting SEO Rankings
We have one main site called buypropertyanywhere, it is a database it holds all the data for all our property websites. One of our most popular sites is housesalesbulgaria, which takes the data from buypropertyanywhere in regards to bulgarian property and display it. The same with housesalesturkey it takes the data in regards to turkish property and display it. We think because buypropertyanywhere and housesalesbulgaria has the same data it has high duplicate content . We think this is affecting the SEO rankings for housesalesbulgaria. Google is looking at housesalesbulgaria as if a copy of buypropertyanywhere. So therefore should we SEO buypropertyanywhere soley and link it to housesalesbulgaria through the articles and content we put on the site. Thanks in advance for any advice.
Technical SEO | | Feily0 -
Worpress Tags Duplicate Content
I just fixed a tags duplicate content issue. I have noindexed the tags. Was wondering if anyone has ever fixed this issue and how long did it take you to recover from it? Just kind of want to know for a piece of mind.
Technical SEO | | deaddogdesign0 -
How to tell if PDF content is being indexed?
I've searched extensively for this, but could not find a definitive answer. We recently updated our website and it contains links to about 30 PDF data sheets. I want to determine if the text from these PDFs is being archived by search engines. When I do this search http://bit.ly/rRYJPe (google - site:www.gamma-sci.com and filetype:pdf) I can see that the PDF urls are getting indexed, but does that mean that their content is getting indexed? I have read in other posts/places that if you can copy text from a PDF and paste it that means Google can index the content. When I try this with PDFs from our site I cannot copy text, but I was told that these PDFs were all created from Word docs, so they should be indexable, correct? Since WordPress has you upload PDFs like they are an image could this be causing the problem? Would it make sense to take the time and extract all of the PDF content to html? Thanks for any assistance, this has been driving me crazy.
Technical SEO | | zazo0 -
Strange duplicate content issue
Hi there, SEOmoz crawler has identified a set of duplicate content that we are struggling to resolve. For example, the crawler picked up that this page www. creative - choices.co.uk/industry-insight/article/Advice-for-a-freelance-career is a duplicate of this page www. creative - choices.co.uk/develop-your-career/article/Advice-for-a-freelance-career. The latter page's content is the original and can be found in the CMS admin area whilst the former page is the duplicate and has no entry in the CMS. So we don't know where to begin if the "duplicate" page doesn't exist in the CMS. The crawler states that this page www. creative-choices.co.uk/industry-insight/inside/creative-writing is the referrer page. Looking at it, only the original page's link is showing on the referrer page, so how did the crawler get to the duplicate page?
Technical SEO | | CreativeChoices0