Homepage not indexed - seems to defy explanation
-
Hey folks
Hoping to get some more eyes on a specific problem I am seeing with a clients site.
Site: http:www.ukjuicers.com
We have checked everything we can think of and the usual suspects here are not present:
- Canonical URL is in place
- Site is shown as indexed in search console
- No Crawl, DNS, Connectivity or server errors
- No robots.txt blocking - verified in search console
- No robots meta tags or directives
- Fetch as Google works
- Fetch & render works
- site command returns all other pages
- info command does not return the homepage
- homepage is cached and cache has been updated since this issue started: http://webcache.googleusercontent.com/search?q=cache:www.ukjuicers.com
- homepage is indexed in yahoo and Bing
- all variations redirect to the www.ukjuicers.com domain (.co.uk, .com, www, sans www etc)
The only issue I found after some extensive digging was some issues with the HTTP and HTTPS versions of the site both being available and both specifying the canonical version as themselves. So, http site used canonicals with http and https site used canonicals with https. So, a conflict there with the canonical exacerbating the problem it is there to solve.
The HTTPS site is not indexed though and we have set this up in webmaster tools and now the web developer has set redirects to ensure all versions even the https now 301 redirect to the http://www.ukjuicers.com page so these canonical issues have been ironed out.
But... it's still not indexing the homepage.
The practical implications of this are quite scary - the site used to be somewhere between 1st and 4th for keywords like 'juicers', 'juicer' etc. Now they are bottom of page 1 or top of page 2 with an internal page. They were jostling with the big boys (amazon, argos, john lewis etc) but now they are right at the bottom of the second page.
It's a strange one - i have seen all manor of technical problems over the years but this one seems to defy sensible explanation. The next step is to do a full technical SEO audit of the site but I am always of the opinion that with many eyes all bugs are shallow so if anyone has any input or experience with odd indexation problems like this would love to get your input.
Cheers
Marcus -
Glad you figured it out. I honestly didn't think it would have been the canonicals. I'm a little surprised that the bots didn't just choose not to respect the suggestion as opposed to blanking your site from the index. Didn't think that was even a possibility from incorrect canonicals. Good to know for the future though in case anything like this comes up with anyone else's site.
-
Yep - it's back. Looks like resolving the canonical issue fixed it. Seems it was a usual suspect after all.
-
Yep - bit of a weird one but in the end looks like the canonicals were the issue. Thanks for taking a look though man - super appreciated.
-
Hey Bernadette - thanks for the feedback. Site is back in the index now, looks like the canonicals were the culprit but the owners are keen for no future issues so I will dig in and take a look at these points. Cheers!
-
Hey folks
24 hours after we identified and fixed the canonical issue the site is now indexed again so it does look like it was indeed a canonical conundrum. Both the HTTP and HTTPS sites were claiming to be the canonical version so in some respects creating a conflict. We removed this conflict and it is now indexed.
Thanks for the extra eyes folks - appreciated and if anyone ever needs another pair of eyes to look a problem give me a shout.
Cheers
Marcus -
Hey Marcus. You just need some links from high authority website like moz:) People say you're indexed so case closed, job done:)
-
I just noticed that clicking on the entire slider, even out to the sides where it appears to be just white space, takes you to another page. At first I didn't realize what I was clicking that got me to the next page. When I do Crtl+A on the page, the full width of the slider images shows highlighted in blue, but to the side of those images outside of those bounds is linked. I'm wondering if Google sees this as cloaking and kicked out the homepage as a result.
*I did see that AGM pointed out it's indexed now, but that's not to say this wasn't the cause of original de-index.
-
As of this writing it looks like the page is indexed. By searching site:ukjuicers.com it comes up in the search results with about 861 other results. Not sure if there is anything you changed to get things working again but it seems to be in their index now.
-
I took a look at all of the usual suspects as well... which amounts to pretty much everything that everyone else mentioned but I was intrigued by this issue and thought maybe another set of eyes might notice something that was off. Nothing was wrong in the page source from what I saw, no issues crawling it myself and I didn't see any penalties. Normally I'd think that if your homepage wasn't appearing for branded organic searches then a penalty was levied against you but when that is the case the homepage is still normally find-able in a Site operator search. M__aybe it is related to all the backlinks that were lost/deleted in the past month but I'm not sure why that would be the case unless removing the homepage from the index was a Penguin response to link issues... but I was under the impression that peguin was devaluing the link source not the link recipient and deleting/removing links seems to be a preferred method of handling penguin-related issues. So if there is a relationship between penguin and your homepage being deindexed then I am not sure at all why nor am I certain how to fix it as I'm not seeing anything in particular that screams "linking issue" at me. (though I only did a fairly cursory inspection of things)
So I am stumped. Whenever the issue is figure out I would love to know how/why this came to be.
-
Marcus, I know this is frustrating. I've checked several things, and looked at many of the possibilities that you've already brought up. I don't have access to the Google Search Console, so I cannot comment about any of that data. I'm assuming that you don't have a manual action on the site or any other messages from Google.
What I've seen in the past is issues with schema markup, especially when it comes to reviews and how they're handled on sites. I'm not saying that this is the issue--but I've seen issues that Google has had with these (especially because there is the word "hidden" there in the code). So, you might look into that some more.
The issue could also be related to links--look at the links to the site's home page to see if there is an issue with low quality links pointing to that page or other unnatural links.
If someone has copied the page, added a canonical tag, and then added a "meta noindex tag" to their page, it's possible that they could have taken your page out of the index. This has happened before.
-
Unfortunately you're not amazon so maybe you must try harder;)
or force to index mainpage with some software or indexer website then wait a while.
I'm thinking about some negative seo made for your mainpage but so far can't see any symptoms.
-
This is a strange one then.... very strange.
Just performed a site: search and like you said it is not showing up as indexed. There is normally something technical to explain an issue like this, but I cannot see anything after looking at your site robots and source code.
-
Hey Krzysztof
Yeah, the page has little textual content but... neither does the amazon homepage. Ultimately the page is a jump in point for all the products and the content suits that. Certainly, I could understand Google not liking the page but would that not result in a reduced rank rather than a complete removal like this?
On the dodgy links front they have never done anything on that front - so anything there would be surprising (or just incidental cruft that is out there on scraper sites and the like).
Super odd.
-
Yep - super odd. 15 years or so in this game and never seen anything quite like this. Transient drops but usually it boiled down to some simple technical error or more often user error cough no index / robots.txt cough
-
Hey - the real issue here is the page is just not indexed. It's not there. Not that another page is a more suitable or preferential result. Ultimately that was the best page for a user to jump in at... The page is not even returned in a brand search so... can't see how any other page could be more suitable for that kind of search.
-
Hi Marcus
The only thing I think it can be the issue is the number of words on mainpage. Mostly I see images and words from menus, links and not main content. Digging deeper can help (seo audit).
This can be a penguin too but to know the answer, full link analysis is needed. After quick glance I see some unnatural links but not in larger number. Maybe they got footprints not visible at once (same ip, c class, content with link etc).
-
You're not kidding, this does defy explanation. When did it drop out of the index?
In all honesty, I don't have a solution, you've already checked everything I would have. I'm mostly commenting so I can keep up with this issue and see how it unfolds. Very curious to see if anyone can identify what's happening here.
-
Hmmm, is it a case of Google simply feels the homepage is not as engaging and relevant in terms of search to your users and they put more emphasis on product pages which it choose to feature instead.
I often find that for key terms our product pages almost always rank higher then the homepage unless a brand only search.
Secondly, is this a recent change? Could the most recent Penguin update have simply resulted in your competitors getting a boost where as before the previous algo was holding them back which has resulted in your position slide.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Regarding Mobile first Indexing
My Site name GiftaLove.com Desktop version - https://www.giftalove.com/
Technical SEO | | Packersmove
Mobile version - https://m.giftalove.com/ How to enable mobile first Indexing in Desktop and Mobile version sites. Not found any message from both sites desktop and mobile version. Please resolve my Issue.0 -
Google is indexing bad URLS
Hi All, The site I am working on is built on Wordpress. The plugin Revolution Slider was downloaded. While no longer utilized, it still remained on the site for some time. This plugin began creating hundreds of URLs containing nothing but code on the page. I noticed these URLs were being indexed by Google. The URLs follow the structure: www.mysite.com/wp-content/uploads/revslider/templates/this-part-changes/ I have done the following to prevent these URLs from being created & indexed: 1. Added a directive in my Htaccess to 404 all of these URLs 2. Blocked /wp-content/uploads/revslider/ in my robots.txt 3. Manually de-inedex each URL using the GSC tool 4. Deleted the plugin However, new URLs still appear in Google's index, despite being blocked by robots.txt and resolving to a 404. Can anyone suggest any next steps? I Thanks!
Technical SEO | | Tom3_150 -
Optimize homepage or category for main keyword?
Hello, we have an advertisements website. Usually in each country we have this one "main" keyword which has the largest amount of searches. Now we have done both ways in different markets. Sometimes optimize the homepage www.example.com for the keyword, sometimes optimize **www.example.com/thekeyword **the category page. I do not have solid data which works better, as Keyword difficulties are too big to compare. The good thing with category is that i would have internal links pointing to it and keyword in the url. Questions: 1. Which do you recommend better?
Technical SEO | | advertisingtech
2. Will there be any benefits in optimizing category vs homepage? Thanks!0 -
Is my page being indexed?
To put you all in context, here is the situation, I have pages that are only accessible via an intern search tool that shows the best results for the request. Let's say i want to see the result on page 2, the page 2 will have a request in the url like this: ?p=2&s=12&lang=1&seed=3688 The situation is that we've disallowed every URL's that contains a "?" in the robots.txt file which means that Google doesn't crawl the page 2,3,4 and so on. If a page is only accessible via page 2, do you think Google will be able to access it? The url of the page is included in the sitemap. Thank you in advance for the help!
Technical SEO | | alexrbrg0 -
Index problems, Part 2
Hi Guy's A few weeks ago i posted a question:
Technical SEO | | Happy-SEO
https://moz.com/community/q/index-problems After some good advice, we changed a few things: www.domain.com <<< NL version www.domain.com/fr/ <<<< French version (domain.com/nl/ 301 redirect to domain.com). So the SERPS for keyword ‘shutters’ went from #32 to #8...... for 2 day's.... and gone.... and not comming back anymore.... Did we missed something? Help is much appreciated, thanks 🙂3 -
Getting a Vanity (Clean) URL indexed
Hello, I have a vanity (clean looking) URL that 302 redirects to the ugly version. So in other words http://www.site.com/url 302 >>> http://www.site.com/directory/directory/url.aspx What I'm trying to do is get the clean version to show up in search. However, for some reason Google only indexes the ugly version. cache:http://www.site.com/directory/directory/url.aspx is showing the ugly URL as cached and cache:http://www.site.com/url is showing not cached at all. Is there some way to force Google to index the clean version? Fetch as Google for the clean URL only returns a redirect status and canonicalizing the ugly to the clean would seem to send a strange message because of the redirect back to the ugly. Any help would be appreciated. Thank you,
Technical SEO | | Digi12340 -
Index quickly a website? (Google,Bing..)
Hi, I would like to know what are the best practices in 2012 to index our website in less than 24 hours? (or less..) Thanks for your answer 😄
Technical SEO | | Probikeshop0 -
Existing Pages in Google Index and Changing URLs
Hi!! I am launching a newly recoded site this week and had a another noobie question. The URL structure has changed slightly and I have installed a 301 redirect to take care of that. I am wondering how Google will handle my "old" pages? Will they just fall out of the index? Or does the 301 redirect tell Google to rewrite the URLs in the index? I am just concerned I may see an "old" page and a "new" page with the same content in the index. Just want to make sure I have covered all my bases. Thanks!! Lynn
Technical SEO | | hiphound0