Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Pages mirrored on unknown websites (not just content, all the HTML)... blackhat I've never seen before.
-
Someone more expert than me could help... I am not a pro, just doing research on a website... Google Search Console shows many backlinks in pages under unknown domains... this pages are mirroring the pages of the linked website... clicking on a link on the mirror page leads to a spam page with link spam... The homepage of these unknown domain appear just fine... looks like that the domain is partially hijacked... WTF?!
Have you ever seen something likes this?
Can it be an outcome of a previous blackhat activity?
-
The links actually are just in some forms
-
Thanks everyone, I'll do more research and update all
-
One more thing that could be a long shot but is worth keeping an eye on: If someone had zero ethics and wanted to steal your links, one thing they could do is reach out to webmasters that link to you and pretend to be you, and claim that the mirror site is the new version of the website and ask them to change their links. Once complete, they'd redirect the mirror site to their own commercial site.
So, keep an eye on your own link profile and theirs, just to make sure none of those links are changing.
-
We see scraped content all the time. Occasionally we see scraped HTML in full form but slightly edited.
In general I'd do the following:
- Disavow those domains in GSC. Also consider disavowing any domains pointing links at the mirror sites. It could be possible that they are building up junk links to a mirror site which canonicals your site. No idea why anyone would do that but worth covering your bases.
- Submit DMCA notices to Google, their webhost, etc.
- Take a look through server logs for anything screwy. If you see tons of weird traffic from countries that aren't relevant to your business, you might decide to block them with your .htaccess file.
- Certainly make sure nothing has been changed on your own site (canonicals, common Wordpress hacks, etc.), though nothing is suggesting to me that that has happened.
Other than that I would just keep an eye on it and keep an eye on Google Search Console.
-
Hi,
From what I read it looks similar to these cases - https://moz.com/community/q/getting-different-search-queries-in-google-webmaster & http://moz.com/community/q/chinese-site-ranking-for-our-brand-name-possible-hack
In these cases - sites where hacked using a vulnerability in a Wordpress slider - they copied sites they wanted to target on to some of the hacked domains and used other hacked domains to point links to them. At that point they were trying to redirect traffic from genuine e-commerce sites to bogus ones.
You could try to contact the site owners & tell them they have been hacked. At the same time file a complaint @Google
rgds,
Dirk
-
Some basic scraping maybe common, but they have gone out of there way to backlink to you...
Have you checked there metrics on open site explore...? Checked whois?
-
As far as I know website scraping is not uncommon, apart from the backlinks
-
I am honest I am still a little puzzled. But if what you are describing is what is happening. Something very strange is going on. I have not heard of it before.
A totally different domain, that you did not build is linking to your site with the exact copy of a page that it is linked to. If that is the case, your site is being specifically targeted. Someone has to have gone out of there way to copy your site and then re-build it on a different domain.
This is what I would do:-
I would contacting google asap.
Can you find who owns the site on whois?
Check the wayback machine to find out what the site was previously
User open site explorer to track the nature of the links on the other domain. .
And wow... that is unbelievable if it is correct...
-
Yes, sorry but I thought that was clear from the beginning.
If I click to the console links I am directed to the mirrored page in these domains. That's exactly how I saw them, otherwise I coudn't discover them
-
That console part provides clickable links. So that goes back to my original question. Can you click through to the domain and see a mimic copy of your site? Or is it simply showing you a link that clicks to nothing?
-
In the console there is a section called (translating into English) "Links pointing to your site"
Traffic has not changed, in fact these do not appear in GA referrals.
I can see all these domains, either the mirrorer paged, or the actual genuine pages pertaining to them originally. You cannot reach the mirrored pages via the menu in hp
-
Where in google search console are they reported?
Also to be clear, nothing has happened to your site. ie traffic has not changed. However you have found "strange" links in search console? Which do not click through to anything?
Or as I am still not clear - have you found another domain, that mimics your site that you can actually see?
-
Exactly, I am confused too.
Indeed, Goole Search Console reports do list them, but I can't see them in the html
Instance, I find this
<form <span="" class="html-tag" data-mce-mark="1">id="search_mini_form" action="http://mydomain.ext/it/catalogsearch/result/" method="get">
and also images hosted in the scraped websites's domain
Also, these pages apparently are under these domains that in turn appear to be real websites equally unaware of the scam
</form>
-
I am a little confused.
Are you saying that an exact copy of a unique page you have created on your website is on another domain name (which you know nothing about) and then that site is linked to your site, ie via a backlink?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Traffic exchange referral URL's
We have a client who once per month is being hit by easyihts4u.com and it is creating huge increases in their referrals. All the hits go to one page specifically. From the research we have done, this site and others like it, are not spam bots. We cannot understand how they choose sites to target and what good it does for them, or our client to have hits all on one days to one page? We created a filter in analytics to create what we think is a more accurate reflection of traffic. Should be block them at the server level as well?
White Hat / Black Hat SEO | | Teamzig0 -
How authentic is a dynamic footer from bots' perspective?
I have a very meta level question. Well, I was working on dynamic footer for the website: http://www.askme.com/, you can check the same in the footer. Now, if you refresh this page and check the content, you'll be able to see a different combination of the links in every section. I'm calling it a dynamic footer here, as the values are absolutely dynamic in this case. **Why are we doing this? **For every section in the footer, we have X number of links, but we can show only 25 links in each section. Here, the value of X can be greater than 25 as well (let's say X=50). So, I'm randomizing the list of entries I have for a section and then picking 25 elements from it i.e random 25 elements from the list of entries every time you're refreshing the page. Benefits from SEO perspective? This will help me exposing all the URLs to bots (in multiple crawls) and will add page freshness element as well. **What's the problem, if it is? **I'm wondering how bots will treat this as, at any time bot might see us showing different content to bots and something else to users. Will bot consider this as cloaking (a black hat technique)? Or, bots won't consider it as a black hat technique as I'm refreshing the data every single time, even if its bot who's hitting me consecutively twice to understand what I'm doing.
White Hat / Black Hat SEO | | _nitman0 -
Ever seen this tactic when trying to get rid of bad backlinks?
I'm trying to get rid of a Google penalty, but one of the URLS is particularly bizarre. Here's the penalized site: http://www.travelexinsurance.com. One of the external links Google cited as not being natural that links to the penalized site is: http://content.onlineagency.com/index.aspx?site=6599&tide=769006&last=3111516 In the backlink profile of the penalized site, there are about 100 different backlinks pointing to www.travelexinsurance.com from content.onlineagency.com/... So when I visit http://content.onlineagency.com/index.aspx?site=6599&tide=769006&last=3111516 it actually is displaying content from http://www.starmandstravel.com/787115_6599.htm, which you can see after clicking the "Home" button. That company is a legit travel agency who I assume knows nothing about content.onlineagency.com and is not involved in whatever is going on. And that's the case for every link from content.onlineagency.com. So I'm just wondering if someone can help me understand what sort of tactic content.onlineagency.com is using. One of my predecessors I fear used some black hat tactics. I'm wondering if this is a remnant of that effort.
White Hat / Black Hat SEO | | Patrick_G0 -
How do you change the 6 links under your website in Google?
Hello everyone, I have no idea how to ask this question, so I'm going to give it a shot and hopefully someone can help me!! My company is called Eteach, so when you type in Eteach into Google, we come in the top position (phew!) but there are 6 links that appear underneath it (I've added a picture to show what I mean). How do you change these links?? I don't even know what to call them, so if there is a particular name for these then please let me know! They seem to be an organic rank rather than PPC...but if I'm wrong then do correct me! Thanks! zorIsxH.jpg
White Hat / Black Hat SEO | | Eteach_Marketing0 -
Noindexing Thin Content Pages: Good or Bad?
If you have massive pages with super thin content (such as pagination pages) and you noindex them, once they are removed from googles index (and if these pages aren't viewable to the user and/or don't get any traffic) is it smart to completely remove them (404?) or is there any valid reason that they should be kept? If you noindex them, should you keep all URLs in the sitemap so that google will recrawl and notice the noindex tag? If you noindex them, and then remove the sitemap, can Google still recrawl and recognize the noindex tag on their own?
White Hat / Black Hat SEO | | WebServiceConsulting.com0 -
Pages linked with Spam been 301 redirected to 404\. Is it ok
Pl suggest, some pages having some spam links pointed to those pages are been redirected to 404 error page (through 301 redirect) - as removing them manually was not possible due to part of core component of cms and many other coding issue, the only way as advised by developer was making 301 redirect to 404 page. Does by redirecting these pages to 404 page using 301 redirect, will nullify all negative or spam links pointing to them and eventually will remove the resulting spam impact on the site too. Many Thanks
White Hat / Black Hat SEO | | Modi0 -
Rel author and duplicate content
I have a question if a page who a im the only author, my web will duplicate content with the blog posts and the author post as they are the same. ¿what is your suggestion in that case? thanks
White Hat / Black Hat SEO | | maestrosonrisas0 -
Will Google Penalize Content put in a Div with a Scrollbar?
I noticed Moosejaw was adding quite a bit of content to the bottom of category pages via a div tag that makes use of a scroll bar. Could a site be penalized by Google for this technique? Example: http://www.moosejaw.com/moosejaw/shop/search_Patagonia-Clothing____
White Hat / Black Hat SEO | | BrandLabs0