Site deindexed after HTTPS migration + possible penalty due to spammy links
-
Hi all, we've recently migrated a site from http to https and saw the majority of pages drop out of the index.
One of the most extreme deindexation problems I've ever seen, but there doesn't appear to be anything obvious on-page which is causing the issue. (Unless I've missed something - please tell me if I have!)
I had initially discounted any off-page issues due to the lack of a manual action in SC, however after looking into their link profile I spotted 100 spammy porn .xyz sites all linking (see example image).
Didn't appear to be any historic disavow files uploaded in the non https SC accounts.
Any on-page suggestions, or just play the waiting game with the new disavow file?
-
Thanks for answering all of my questions!
It's interesting that when I do a simple site:search in Google none of the main pages of your website are appearing. Most of the search results are either archives or comments. Typically, I've seen this kind of thing happen when something goes wrong in the redirects or a site is penalized.
It looks like the big dip in indexation didn't occur until about August. I would think that if you pulled the trigger in June, pages would start dropping out of the index much sooner.
In this case, your theory about a possible penalization might be right. I'd be interested to see what happens once Google considers the disavow file (unfortunately, that will take some time).
Does anyone else have any input or possible reasons why pages on this site have dropped out of the index so quickly?
-
Hi Serge,
Thanks for your input. I've answered your questions below.
- How long ago did you switch to https? - 21st June
- Have you submitted both non-www and www versions of the https site to Google Search Console (GSC)? - Yes
- Have you kept the http versions of your website in GSC? - Yes
- From the looks of it, your sitemap has been updated to reflect the https pages. Have you submitted the updated sitemap to GSC? - Yes - Submitted pages are not matching Indexed pages
- Are there any sitemap errors appearing in GSC? Any other errors? No Sitemap Errors. some 404ing pages.
- Could you attach a screenshot of the indexation rate on both https and http versions of the site from GSC?
- Could you confirm that all redirects were done 1-to-1 and properly redirected? (301s and not 302s) - Confirmed - all tools are reporting 200 status after hitting the 301.
We are still waiting to see some results from submitting the disavow file. So far, no positive movement.
Thanks for your help!
-
Hi there,
There could be a lot of reasons why certain pages of your website are dropping out of your index. Could you answer the following questions to help us narrow down the possible cause?
- How long ago did you switch to https?
- Have you submitted both non-www and www versions of the https site to Google Search Console (GSC)?
- Have you kept the http versions of your website in GSC?
- From the looks of it, your sitemap has been updated to reflect the https pages. Have you submitted the updated sitemap to GSC?
- Are there any sitemap errors appearing in GSC? Any other errors?
- Could you attach a screenshot of the indexation rate on both https and http versions of the site from GSC?
- Could you confirm that all redirects were done 1-to-1 and properly redirected? (301s and not 302s)
Some things that we could rule out:
- It looks like the site isn't using noindex tags in a way that would cause deindexing
- It looks like the robots.txt file isn't disallowing any important paths that would cause deindexation
- The http version of the www and non-www pages redirects to the www, https version of the site which is good
- Canonicals seem to be updated and pointing to the https version of the site
Sorry for all of the questions, I just want to make sure and rule out possible causes to focus in on what the issue could be.
Thanks, Serge
-
Hi!
what information do you seen in search console?
Assuming that you have already tested all of your old URL's and the redirection paths points correctly to the new URLs, does Google Search console indicates any problems with the number of URLs submitted to it?
canoncals? are they in use? pointing to the correct version of the site?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Moving site to new domain without access to redirect from old to new. How can I do this with as little loss to SERP results as possible?
I've been hired to build a new site for a customer. They were duped by some shady characters at goglupe.com (If you can reach them, tell them they are rats--phone is disconnected, address is a comedy club on Mission in SF). Glupe owns the domain name and would not transfer or give FTP access prior to dropping off the face of the earth. The customer doesn't want to chase after them with lawyers, so we are moving on. New domain, new site with much of the same content as previous site. All that I have access to is the old wordpress site. I plan to build the new site, then remove all pages/posts from the old site. Is there anything I can do to salvage the current page 1 ranking? Obviously, the new domain will take some time to get back there. Just hoping to avoid any pitfalls or penalties if I can. If I had complete access, I would follow all the standard guidelines. But I don't. Any thoughts? Thanks! Chris
Intermediate & Advanced SEO | | c_estep_tcbguy0 -
Google cache is for a 3rd parties site for HTTP version and correct for HTTPS
If I search Google for my cache I get the following: cache:http://www.saucydates.com -> Returns the cache of netball.org (HTTPS page with Plesk default page) cache:https://www.saucydates.com -> Displays the correct page Prior to this my http cache was the Central Bank of Afghanistan. For most searches at present my index page is not returned and when it is, it’s the Net Ball Plesk page. This is, of course hurting my search traffic considerably. ** I have tried many things, here is the current list:** If I fetch as Google in webmaster tools the HTTPS fetch and render is correct. If I fetch the HTTP version I get a redirect (which is correct as I have a 301 HTTP to HTTPS redirect). If I turn off HTTPS on my server and remove the redirect the fetch and render for HTTP version is correct. The 301 redirect is controlled with the 301 Safe redirect option in Plesk 12.x The SSL cert is valid and with COMODO I have ensured the IP address (which is shared with a few other domains that form my sites network / functions) has a default site I have placed a site on my PTR record and ensured the HTTPS version goes back to HTTP as it doesn’t need SSL I have checked my site in Waybackwhen for 1 year and there are no hacked redirects I have checked the Netball site in Waybackwhen for 1 year, mid last year there is an odd firewall alert page. If you check the cache for the https version of the netball site you get another sites default plesk page. This happened at the same time I implemented SSL Points 6 and 7 have been done to stop the server showing a Plesk Default page as I think this could be the issue (duplicate content) ** Ideas:** Is this a 302 redirect hi-jack? Is this a Google bug? Is this an issue with duplicate content as both servers can have a default Plesk page (like millions of others!) A network of 3 sites mixed up that have plesk could be a clue? Over to the experts at MOZ, can you help? Thanks, David
Intermediate & Advanced SEO | | dmcubed0 -
Need references to a company that can transition our 1000 page website from Http to Https without breaking our SEO backlinks and site structure
Hi Ya'll I'm looking for a company or independent who can transition our website from http to https. I want to make sure they know what they're doing with a Wordpress website. More importantly, i want to make sure they don't break any seo juice from external sources while internally nothing gets broken. Anyone have any good recommendations? You can reply back or DM me. Best, Shawn
Intermediate & Advanced SEO | | Shawn1240 -
Google Manual Penalty - Unnatural Links FROM My Site - Where?
Hi Mozzers, I've just received a manual penalty for one of my websites. The penalty is for 'unnatural links from my site which I find disturbing because I can't see that anything really wrong with it. The website is www.lighting-tips.co.uk - its a pretty new blog (only 6-7 posts) and whilst I've allowed guest posting I'm being very careful that the content is relevant and good quality. I'm only allowing 1 - 2 links and very few with proper anchor text so I'm wondering what has been done so wrong that I'm getting this manual penalty? Am I missing something here? Thanks in advance. Aaron
Intermediate & Advanced SEO | | AaronGro0 -
Site migration from non canonicalized site
Hi Mozzers - I'm working on a site migration from a non-canonicalized site - I am wondering about the best way to deal with that - should I ask them to canonicalize prior to migration? Many thanks.
Intermediate & Advanced SEO | | McTaggart0 -
Is the Penguin algorithmic penalty on a page basis or a site basis?
Just wondering if there has been any clarification of whether the Penguin algorithmic penalty is on a Page basis or a Site basis? In other words, is it all or nothing?
Intermediate & Advanced SEO | | darkgreenguy0 -
In mobile searches, does Google recognize HTML5 sites as mobile sites?
Does Google recognize HTML5 sites using responsive design as mobile sites? I know that for mobile searches, Google promotes results on mobile sites. I'm trying to determine if my site, created in HTML5 with responsive design falls into that category. Any insights on the topic would be very helpful.
Intermediate & Advanced SEO | | BostonWright0 -
Purchased new site with good SERP ranks, do I operate and build links or redirect the TLD?
I recently purchased a blog within my product category - it has many first page rankings for difficult keywords within my niche. I am wondering if it makes more sense for for me to continue to operate this blog and build links to my site and blog (blog is in wordpress) or to export the XML feed and upload the content to my blog (new site also in wordpress), at which point I would do a 301 at the Top-Level domain. Any thoughts, ideas, or personal experiences would be greatly appreciated. Thanks!
Intermediate & Advanced SEO | | NickEubanks0