Scraper Advice
-
Hi all,
I know we all deal with scraped content issues. I have one I could use advice on. I found a site that is posting our blog content on their site verbatim, including the links I added to the posts (which is good) and mention our blog home page in a right sidebar beside the content (also good). However, they aren't linking to the specific posts from their copied versions anywhere and their pages canonical back to their versions, not mine.
It's not a very spammy site and has a decent domain authority (though significantly lower than our own). I did a long tail search related to one of posts after discovering it, however, and found their version was outranking the original. I know I can report this one via Webmaster Tools.
I wanted to get your opinion on whether asking them to add a link back to the original post on our site might be sufficient, or do I need to ask for that plus a canonical tag update? I know getting both is ideal, but the links and relationship could be valuable, so I want to leave this particular bridge in tact if I can.
Just trying to decide if I take an "either/or" approach to my request when I mention those two action items, or if I need be a little firmer and ask them to do both and potentially risk losing a potential outlet for future content?
Thanks,
Andrew
-
Don't second-guess on that myth that scrapers can't hurt you. These guys are outranking you right now with your own content. Proof enough to me that Kissmetrics needs to take notice and pull down false information. Also, this is another clear example of Google not knowing how poorly their systsem is working and they know not that they know not.
Google would not be getting millions of DMCAs per week if they were right about this. I've sent them hundreds.
-
Normally, I'd take that harder approach as well. If this was a spammy site that was doing nothing but scraping, I'd definitely be going that route. I still might. I'm trying to see the best way to walk a fine line.
I think #2 on Kissmetrics' 3 Myths About Duplicate Content has me second-guessing myself. If it wasn't a somewhat decent site that has potential to help in terms of referral traffic, it would be a no-brainer.
For the outranking issue, it's weird. For the main term we target, we are top 3 in the SERPs. Change it a little bit and they're ranking, which is the only instance of that I found when testing all posts (5 total).
Thanks for the feedback. I really appreciate it.
-
What you do on your first step will set the tone for how they treat you in the future. So, if you are too liberal now, it will be hard to reign them in and they could start grabbing everything that you own.
If this was my site being grabbed, I would be contacting them to take the content down and be prepared to follow up with an attorney who is already in place for this type of situation, being ready to submit DMCA to Google, Adsense, hosting and more.
If you feel that the relationship could be valuable and have a different philosophy than mine, then I would at minimum insist on the rel=canonical pointing back to the source of the content on my website - and I would require them to ask before they use anything in the future. The fact that they are outranking you with your own content should have you shaking in your boots over this potential relationship. You are making deals with Goliath.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Our clients Magento 2 site has lots of obsolete categories. Advice on SEO best practice for setting server level redirects so I can delete them?
Our client's Magento website has been running for at least a decade, so has a lot of old legacy categories for Brands they no longer carry. We're looking to trim down the amount of unnecessary URL Redirects in Magento, so my question is: Is there a way that is SEO efficient to setup permanent redirects at a server level (nginx) that Google will crawl to allow us at some point to delete the categories and Magento URL Redirects? If this is a good practice can you at some point then delete the server redirects as google has marked them as permanent?
Technical SEO | | Breemcc0 -
noindex, follow for thin content advice
Hello there We struggle with a number of none indexed pages. I want to ask your professional opinion. The robots tag is set up as follows, <meta name='robots' content='noindex, follow' /> those pages haven`t got any value but contain valuable pages.
Technical SEO | | Kingagogomarketing
Is setting up robots name="robots" content="noindex, nofollow" / would be a good solution? Here is the page https://www.lrbconsulting.co.uk/tag/enforcement/page/2/
with noindex robot tag. Please let me know what you think. #noindex, follow for thin content
#noindex, follow
#meta robots set up0 -
Deleting Tags Properly - Advice Needed
I have over 18,000 tags. Needless to say, most of them are relatively useless to the user and generate no traffic, while cluttering the site. (I use Wordpress.) My plan is to delete tags, but I want to do so safely as to not accumulate website errors. (Tags pages are noindexed.) What process should I take here? Here was my basic plan (any help is appreciated). 1. Find irrelevant tags that are connected with hardly any posts. 2. Go into the post, and remove said tag. 3. Now, with a tag having a 'count' of 0, I go into Tags, and delete it. Safe, right? But now it seems those tag pages just turned into 404s "Uh-oh...Page not found!" Where do I go from here? Create 410's? Thanks Mike
Technical SEO | | naturalsociety0 -
Redirects Advice Please
Hi All, I have been approached by someone to look at their website who has seen a rank drop over the last week of around 15 places. On a quick look at their website I have seen what I am imaging could be the culprit as I imagine it will be creating a re-direct loop. However, i am not 100% with these things so would like some others opinions.com They have a wordpress website. There home page lets say https://theirsite.com/ They have an internal page built for a search term https://www.theirsite.com/keyword In wordpress they have set that page in settings to be the homepage. However, I looked on their server and via htaccess they have a 301 redirect from https://www.theirsite.com/keyword to https://www.theirsite.com/ So the questions are: 1. Could this be creating a loop? 2. The redirect was placed around a week before the rank drop. Could this possibly be the cause of the drop? 3. I am assuming that removing the 301 from htaccess is recommended? Thanks in advance for any advice
Technical SEO | | DaleZon0 -
Possible scraper reusing content. Should I be concerned?
I've noticed a few overseas sites seem to be repurposing content from our blog. The process to report for DMCA seems lengthy. Should I be concerned enough to persue this or just write it off as something that happens? Here's an original - http://www.martinsprocket.com/sprocket-sense/sprocket-sense/2015/12/11/free-sprocket-CAD-models Here's an example - http://ptech.in/silica-crushing/free-martin-sprocket-autocad-drawing-download-martin.html Thanks! f9Wfk2h
Technical SEO | | sprockets0 -
Duplicate Content on 2 Sites - Advice
We have one client who has an established eCommerce Site and has created another site which has the exact same content which is about to be launched. We want both sites to be indexed but not be penalised for duplicate content. The sites have different domains The sites have the same host We want the current site to be priority, so the new site would not be ranking higher in SERPs. Any advice on setting up canonical, author tags, alternate link tag etc Thanks Rich
Technical SEO | | SEOLeaders0 -
Well, I need some help, advice, something.
Hey all, I'm new to the SEOmoz thing but I like it so far. I think I have my site listing so messed up that it's effecting my rank. I have 3 domains. 1.) rt112media.com 2.) route112media.com 3.) route112.net. Each domain was purchased through GoDaddy.com and still remain there. I have my own hosting account which I was registered as rt112media.com with route112media.com and route112.net listed as add on domains. Technically, I would like for my main site to be route112media.com for everything. However when I registered the site as rt112media.com I didn't know the issues I would have as far as different domains so I registered with rt112media.com as my main domain name. Anyways, as of now I have rt112media.com as my main domain through my cpanel hosting.I have both domains route112media.com and route112.net set for 301 wildcard redirects to rt112media.com on my hosting account and my GoDaddy account. When I started my WMT account I didn't really know which domain to use cause I figured I could link them all to one. So, I signed up as routet12media.com. After a little while I realized it was not recieving anything because everything was being redirected to rt112media.com Anyways both addresses have been crawled and indexed so they are showing as two. So, I requested to change the route112media.com address to rt112media.com in WMT. That was about 2 weeks ago and it is still pending request. I'm not having further problems with WMT because of the www.rt112media.com vs http://rt112media.com. I am the verified owner of both but I can not switch the www.rt112media account to show the non www. account as the main one because I have the other pending. My site is still being crawled as 2 versions rt112media.com and route112media.com. So what is my best option? And what would be the worst cause scenario if I wanted to start completely over using route112media.com as my main domain with hosting and all. Sorry this was so long I just wanted to explain my situation. I'm lost. Any advice would be appreciated! http:/rt112media.com
Technical SEO | | Route112Media0 -
Joomla 1.7 sef404 advice please
Hi not sure if anyone uses joomla on here but if they do i would love there advice. I have just got a joomla site that is joomla 1.7 but i cannot find a sh404sef component to make the url friendly and would like to know if there is a component out there or if you can use an older one. I would have thought by now that they would have installed one with joomla 1.7 If anyone uses joomla, then i would be glad to hear from you.
Technical SEO | | ClaireH-1848860