Pages mirrored on unknown websites (not just content, all the HTML)... blackhat I've never seen before.
-
Someone more expert than me could help... I am not a pro, just doing research on a website... Google Search Console shows many backlinks in pages under unknown domains... this pages are mirroring the pages of the linked website... clicking on a link on the mirror page leads to a spam page with link spam... The homepage of these unknown domain appear just fine... looks like that the domain is partially hijacked... WTF?!
Have you ever seen something likes this?
Can it be an outcome of a previous blackhat activity?
-
The links actually are just in some forms
-
Thanks everyone, I'll do more research and update all
-
One more thing that could be a long shot but is worth keeping an eye on: If someone had zero ethics and wanted to steal your links, one thing they could do is reach out to webmasters that link to you and pretend to be you, and claim that the mirror site is the new version of the website and ask them to change their links. Once complete, they'd redirect the mirror site to their own commercial site.
So, keep an eye on your own link profile and theirs, just to make sure none of those links are changing.
-
We see scraped content all the time. Occasionally we see scraped HTML in full form but slightly edited.
In general I'd do the following:
- Disavow those domains in GSC. Also consider disavowing any domains pointing links at the mirror sites. It could be possible that they are building up junk links to a mirror site which canonicals your site. No idea why anyone would do that but worth covering your bases.
- Submit DMCA notices to Google, their webhost, etc.
- Take a look through server logs for anything screwy. If you see tons of weird traffic from countries that aren't relevant to your business, you might decide to block them with your .htaccess file.
- Certainly make sure nothing has been changed on your own site (canonicals, common Wordpress hacks, etc.), though nothing is suggesting to me that that has happened.
Other than that I would just keep an eye on it and keep an eye on Google Search Console.
-
Hi,
From what I read it looks similar to these cases - https://moz.com/community/q/getting-different-search-queries-in-google-webmaster & http://moz.com/community/q/chinese-site-ranking-for-our-brand-name-possible-hack
In these cases - sites where hacked using a vulnerability in a Wordpress slider - they copied sites they wanted to target on to some of the hacked domains and used other hacked domains to point links to them. At that point they were trying to redirect traffic from genuine e-commerce sites to bogus ones.
You could try to contact the site owners & tell them they have been hacked. At the same time file a complaint @Google
rgds,
Dirk
-
Some basic scraping maybe common, but they have gone out of there way to backlink to you...
Have you checked there metrics on open site explore...? Checked whois?
-
As far as I know website scraping is not uncommon, apart from the backlinks
-
I am honest I am still a little puzzled. But if what you are describing is what is happening. Something very strange is going on. I have not heard of it before.
A totally different domain, that you did not build is linking to your site with the exact copy of a page that it is linked to. If that is the case, your site is being specifically targeted. Someone has to have gone out of there way to copy your site and then re-build it on a different domain.
This is what I would do:-
I would contacting google asap.
Can you find who owns the site on whois?
Check the wayback machine to find out what the site was previously
User open site explorer to track the nature of the links on the other domain. .
And wow... that is unbelievable if it is correct...
-
Yes, sorry but I thought that was clear from the beginning.
If I click to the console links I am directed to the mirrored page in these domains. That's exactly how I saw them, otherwise I coudn't discover them
-
That console part provides clickable links. So that goes back to my original question. Can you click through to the domain and see a mimic copy of your site? Or is it simply showing you a link that clicks to nothing?
-
In the console there is a section called (translating into English) "Links pointing to your site"
Traffic has not changed, in fact these do not appear in GA referrals.
I can see all these domains, either the mirrorer paged, or the actual genuine pages pertaining to them originally. You cannot reach the mirrored pages via the menu in hp
-
Where in google search console are they reported?
Also to be clear, nothing has happened to your site. ie traffic has not changed. However you have found "strange" links in search console? Which do not click through to anything?
Or as I am still not clear - have you found another domain, that mimics your site that you can actually see?
-
Exactly, I am confused too.
Indeed, Goole Search Console reports do list them, but I can't see them in the html
Instance, I find this
<form <span="" class="html-tag" data-mce-mark="1">id="search_mini_form" action="http://mydomain.ext/it/catalogsearch/result/" method="get">
and also images hosted in the scraped websites's domain
Also, these pages apparently are under these domains that in turn appear to be real websites equally unaware of the scam
</form>
-
I am a little confused.
Are you saying that an exact copy of a unique page you have created on your website is on another domain name (which you know nothing about) and then that site is linked to your site, ie via a backlink?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
It's possible a bounce-rate attack manipulate SEO?
My site has been visited by unusual users with one second session times. This leaves my analytics data confused.
White Hat / Black Hat SEO | | CompraBit0 -
Permanently Moving Few High Ranking Pages from One Domain to Another
We are planning to move few high ranking pages permanently (301 Permanent Redirection) to another domain, Currently these pages are getting good traffic from organic search and ranking on top positions in Google search engine result pages. We have few questions in our mind right now, It would be a great help if anyone can answer following questions; Is it possible to move few pages from one domain to another by using 301 Redirection in .htaccess file? Will it have any negative impact on our website's current search engine performance? Will it be considered as a legitimate SEO practice by Google Search Engine? Will Google understand that these pages have been moved permanently to another domain and start showing URL's from the new domain on the same positions where they were ranking before moving to new location?
White Hat / Black Hat SEO | | tigersohelll0 -
Website not moving?
We run a printing website www.fastprint.co.uk and have built a few decent tools such as http://www.fastprint.co.uk/adobe-shortcut-mapper/ and decent infographics such as http://www.fastprint.co.uk/blog/the-art-of-mixing-typefaces.html and had a fair few decent links from website over the course of the last 1 1/2 but we do not seem to be moving very far? If you take our site on sem rush (a decent percentage of our site traffic is through the above tools or decent blog posts so the number would be lower for E-commerce) http://www.semrush.com/uk/info/fastprint.co.uk+(by+organic)?sort=volume_desc in comparison to a few others http://www.semrush.com/uk/info/banana-print.co.uk+(by+organic)] http://www.semrush.com/uk/info/brunelone.com+(by+organic) Especially this site http://www.semrush.com/uk/info/instantprint.co.uk+(by+organic) I just don't get what we are doing wrong?
White Hat / Black Hat SEO | | BobAnderson0 -
Suggest me a best plan for linking building chart for small static website.
Hi Everyone, Can any one suggest me a clear idea for off page link building chart i.e) Our page is a 24 page website we like to plan for off page activity like bookmarking, classifieds, directory bla bla bla. So how many links we are supposed to post and in how much day time gap example: 15 Links in bookmarking, 10 links in classified, weekly one article submission, after one week the same cycle goes on.....
White Hat / Black Hat SEO | | dineshmap0 -
Whether letting an old category just 404 out is OK
Hello, We've got some hidden categories that are still indexed in the search engines. If there are no links to these hidden categories, can we just let them 404 out and be OK SEO wise? We can't 301 redirect them. It's about 50 categories.
White Hat / Black Hat SEO | | BobGW0 -
Blackhat Winners after Penguin 2.0
I know I'm not the only one that's seen this. After Penguin 2.0 some obvious blackhat SEOed sites flew up in the rankings. There's obviously a hole that hasn't been closed. I'm surprised it's been a month and that hole still hasn't been patched. I have no problem with other legit companies out ranking ours for various keywords. In that case I can feel alright knowing it's just something they were able to do that I wasn't but when I see complete blackhat sites ranking that's a whole different story. Estimated traffic before and after Penguin 2.0: http://goo.gl/gurXt What are they doing that's blackhat? Hidden text - compare the cached version vs. the live http://goo.gl/YYGDK 301ing lots of domains, many irrelevant. http://goo.gl/RjOJu Using a trade marked brand (steelers) - not SEO related but I'm sure the NFL wouldn't be happy. Linking between other domains they own. Notice how spammy these sites are. http://pittsburghwebdevelopment.org/2013/06/23/website-development-firm-website-design-pittsburgh/ http://seoinpgh.com/2013/06/23/website-designer-pittsburgh-affordable-web-design-in-pittsburgh-pa/ They were inflating their social presence. Wanted to show you but looks like twitter already took care of them https://twitter.com/seopittsburgh . Also making client sites link to them . http://pittsburghpaplumbing.com/2013/06/19/pittsburgh-plumbersplumbers-in-pittsburgh-paplumber-pittsburgh/ I've talked to other people and they've seen similar things. Thoughts, opinions? Can you find one good reason why this site would rank well for a competitive phrase?
White Hat / Black Hat SEO | | eyeflow0 -
Redirecting doesn't rank on google
We are redirecting our artist's official website to copenhagenbeta.dk. We have two artists (Nik & Jay and Burhan G) that top ranks on Google (first on page 1), but one of them (Lukas Graham) doesn't rank at all. We use the same procedure with all artists. http://copenhagenbeta.dk/index.php?option=com_artistdetail&task=biography&type=overview&id=49 Doesn't rank but the old artist page still does. Is it the old page that tricks Google to think that this is the active page for the artist?
White Hat / Black Hat SEO | | Morten_Hjort0 -
Herbal Viagra page same DA/PA as UC Berkeley??
Either there is some amazingly good SEO work going on here, or Google has an amazingly large hole in their metrics. http://nottowait.com/ http://www.ucdavis.edu/index.html The "nottowait" page has a PA of 85?! and a DA of 82?! The page is HORRIBLE. The page itself is an image of another page. The nav bar does not function, nor does any of the "click here" links. At the bottom there is a paragraph of keywords and broken english. This page is pure junk and should simply not have any value at all with respect to DA nor PA. It has a ton of incoming links from various sources which seem to be the source of all this value, which it passes on to other pages. This page really is an affront to the "content is king" concept. I suppose I should ask a question but all I can think of is, what is Matt Cutts' phone number? I want to ask him how this page has gotten away with being ranked so well for so long.
White Hat / Black Hat SEO | | RyanKent0