Homepage indexation issue
-
Hello all,
I've been scratching my head about this one for a while now... Let me explain the situation.
I'm working on a multi-lingual website. Visitors are redirected (301) when they visit the homepage to the correct domain.com/en/default.html, domain.com/nl/default.html, domain.com/fr/default.html or domain.com/de/default.html based on browser language. I have doubts about the impact on the ability for Google to index the website because of that, but that's a problem for another day.
The problem I'm having right now, is that domain.com/nl/default.html, domain.com/de/default.html and domain.com/fr/default.html are all indexed. When I search for the URL in Google I get the correct page on number one so I'm pretty sure those are indexed correctly. When I search for domain/en/default.html though, the homepage appears without /en/default.html extension. Does this mean Google assumes the domain.com page is the same as domain.com/en/default.html even though the redirect that's in place?
Would be great if someone could shed some light on this.
Thanks in advance!
-
Questions
- This type of behavior is considered a temporary redirect. Maybe it's better to think of the name as a conditional redirect. In which case, "Oh, your browser is in FR, with that condition let's send you here..." The 301 is supposed to be used as an unconditional redirect, telling crawlers that you're trying to migrate from URL A to B permanently, so get rid of URL A.
- Not necessarily. VS a 301, yes. but scrapability is mostly down to linking and sitemaps.
- Yup.
- Nope. You'd want to interlink directly to the other languages anyways though in case the 302 doesn't work for whatever reasons. Then the link is passing authority and the user has an option available to them if they'd like to get there on their own.
- Right. It'd be best to interlink with hreflang on each as you never know for certain how someone arrives at those pages. Best to give them and crawlers the guidance to where the other translations reside.
You're welcome! Hopefully that clears it all up for you.
-
Thanks a lot for your answer. The follow up question above also continues on your answer. Would be great to hear your thoughts.
-
Thanks a lot for this one. I have never worked on these kinds of automatic redirects so I thought to stay away from them till I got the indexation issue fixed. But I understand from your answer the two might be related.
Just to clarify and make sure I'm doing the right thing.
The situation at the moment: domain.com -> browser language = EN -> 301 redirect to domain.com/en/default.htm
domain.com -> browser language = NL -> 301 redirect to domain.com/nl/default.htm
domain.com -> browser language = DE -> 301 redirect to domain.com/de/default.htm
domain.com -> browser language = FR -> 301 redirect to domain.com/fr/default.htmIn this situation using the query "site:domain.com", the results include the /de/, /nl/, /fr/ and domain.com, but excludes /en/.
You advise to change this in to: domain.com -> browser language = EN -> 302 redirect to domain.com/en/default.htm
domain.com -> browser language = NL -> 302 redirect to domain.com/nl/default.htm
domain.com -> browser language = DE -> 302 redirect to domain.com/de/default.htm
domain.com -> browser language = FR -> 302 redirect to domain.com/fr/default.htmI need to include a hreflang=x-default on the domain.com page.
A few question that pop-up in my mind:
I always thought a 302 was only used for temporarily redirects?
Does using a 302 improves the scrapability of the website?
Would it possibily result in /en/default.htm be indexed again?
Does a 302 pass along authority?
I assume I need to implement href lang on the /LANGUAGE/default.htm pages as well right?Thanks a lot!
-
As others have mentioned, using a 301 redirect in this situation is not the most efficient use case, especially if you want all versions of your site to be indexed and available to users.
For displaying regional content (e.g. you have the same content, but it is translated to a specific language on different page versions) you would want to use hreflang to tell Google that you have multiple versions and the regions they satisfy. This will allow Google to serve up the regional specific content to users in international Google search engines and Google will index/know which versions are appropriate.
Reference link: https://support.google.com/webmasters/answer/189077
Additional reference: https://support.google.com/webmasters/answer/182192?hl=en#2
-
If you can, get that 301 redirection issue solved first as it's definitely not the type of one you want to use for this behavior. Google specifically recommends, "... to automatically serve the appropriate HTML content to your users depending on their location and language settings. You will either do that by using server-side 302 redirects or by dynamically serving the right HTML content." From here: http://googlewebmastercentral.blogspot.com/2014/05/creating-right-homepage-for-your.html. They go further into the hreflang tags here: https://support.google.com/webmasters/answer/189077?hl=en.
After getting the 301 cleaned up, for finding indexed pages it's better to use the "site:" search based operator in Google, Google Webmaster Tools, and Analytics. But really, get that 301 changed. Cheers!
-
If I understand your post correctly, domain.com/de/default.html has a 301 redirect to correctly direct traffic to the appropriate home page for their language?
If that is what you are saying than the chances are the the domain.com/de/default.html is no longer being indexed because of the 301 redirect. That redirect tells the engines that the page has moved, so they stop crawling it. I am not an expert on redirecting home pages, but maybe look into Ref lang tags and REL canonical tags in place of the 301 redirect.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is my page being indexed?
To put you all in context, here is the situation, I have pages that are only accessible via an intern search tool that shows the best results for the request. Let's say i want to see the result on page 2, the page 2 will have a request in the url like this: ?p=2&s=12&lang=1&seed=3688 The situation is that we've disallowed every URL's that contains a "?" in the robots.txt file which means that Google doesn't crawl the page 2,3,4 and so on. If a page is only accessible via page 2, do you think Google will be able to access it? The url of the page is included in the sitemap. Thank you in advance for the help!
Technical SEO | | alexrbrg0 -
Why Are Some Pages On A New Domain Not Being Indexed?
Background: A company I am working with recently consolidated content from several existing domains into one new domain. Each of the old domains focused on a vertical and each had a number of product pages and a number of blog pages; these are now in directories on the new domain. For example, what was www.verticaldomainone.com/products/productname is now www.newdomain.com/verticalone/products/product name and the blog posts have moved from www.verticaldomaintwo.com/blog/blogpost to www.newdomain.com/verticaltwo/blog/blogpost. Many of those pages used to rank in the SERPs but they now do not. Investigation so far: Looking at Search Console's crawl stats most of the product pages and blog posts do not appear to be being indexed. This is confirmed by using the site: search modifier, which only returns a couple of products and a couple of blog posts in each vertical. Those pages are not the same as the pages with backlinks pointing directly at them. I've investigated the obvious points without success so far: There are a couple of issues with 301s that I am working with them to rectify but I have checked all pages on the old site and most redirects are in place and working There is currently no HTML or XML sitemap for the new site (this will be put in place soon) but I don't think this is an issue since a few products are being indexed and appearing in SERPs Search Console is returning no crawl errors, manual penalties, or anything else adverse Every product page is linked to from the /course page for the relevant vertical through a followed link. None of the pages have a noindex tag on them and the robots.txt allows all crawlers to access all pages One thing to note is that the site is build using react.js, so all content is within app.js. However this does not appear to affect pages higher up the navigation trees like the /vertical/products pages or the home page. So the question is: "Why might product and blog pages not be indexed on the new domain when they were previously and what can I do about it?"
Technical SEO | | BenjaminMorel0 -
Homepage no longer indexed in Google
Have been working on a site and the hompage has recently vanished from Google. I submit the site to Google webmaster tools a couple of days ago and checked today and the homepage has vanished. There are no no follow tags, and no robots.txt stopping the page from being crawled. It's a bit of a worry, the site is http://www.beyondthedeal.com
Technical SEO | | tonysandwich
Any insights would be massively appreciated! Thanks.0 -
Geotargeting issue
Hi, So ive just starting working on a travel website and noticed that the .com website outranks the com.au in Australian SERPS, even though the .au site has been geotargeted (In GWT) for Australia.I also geotargeted the .com website to Canada (the primary place of business). Is this advisable? Will this affect rankings?
Technical SEO | | Tourman0 -
What to do if my site was De-indexed?
Hello fellow SEOs, I have been doing SEO for about a year now, I'm not expert, but I know enough to get the job done. I'm learning everyday about better techniques. So enough about that... Tonight I noticed that my site has, I believe, been de-indexed. Its a fairly new site, as we just launched it a few days ago and I went in and did all the title tags and meta. I still have to go in to do the h1 and h2 tags...plus add some alt tags and anchor text. Well anyways, after a couple of days after the title tags were implemented. I was propagating all over the place. Using my keyword tool here...I was number on the first page in Google for 71 or the 88 keywords. My new site was just indexed yesterday and thats when i noticed all my keywords. Well today I noticed that I am no where to be found, even if i type in my company's name. PLEASE help me out...any advice would be appreciated. Thank you. p.s. could my competitors could have done something to my site? just wondering... The website is www.eggheadconsultants.com
Technical SEO | | Jegghead1 -
Yahoo and Bing do not index all pages
Only 20% of our pages are indexed by Bing and Yahoo although we have correctly submitted the sitemap to bing webmaster tools and other search engines index all our content. Do you have any suggestions?
Technical SEO | | AEM130 -
Duplicate Content Issue
Hello, We have many pages in our crawler report that are showing duplicate content. However, the content is not duplicateon the pages. It is somewhat close, but different. I am not sure how to fix the problem so it leaves our report. Here is an example. It is showing these as duplicate content to each other. www.soccerstop.com/c-119-womens.aspx www.soccerstop.com/c-120-youth.aspx www.soccerstop.com/c-124-adult.aspx Any help you could provide would be most appreciated. I am going through our crawler report and resolving issues, and this seems to be big one for us with lots in the report, but not sure what to do about it. Thanks
Technical SEO | | SoccerStop
James0 -
Google News not indexing .index.html pages
Hi all, we've been asked by a blog to help them better indexing and ranking on Google News (with the site being already included in Google News with poor results) The blog had a chronicle URL duplication problem with each post existing with 3 different URLs: #1) www.domain.com/post.html (currently in noindex for editorial choices as showing all the comments) #2) www.domain.com/post/index.html (currently indexed showing only top comments) #3) www.domain.com/post/ (very same as #2) We've chosen URL #2 (/index.html) as canonical URL, and included a rel=canonical tag on URL #3 (/) linking to URL #2.
Technical SEO | | H-FARM
Also we've submitted yesterday a Google News sitemap including consistently the list of URLs #2 from the last 48h . The sitemap has been properly "digested" by Google and shows that all URLs have been sent and indexed. However if we use the site:domain.com command on Google News we see something completely different: Google News has indexed actually only some news and more specifically only the URLs #3 type (ending with the trailing slash instead of /index.html). Why ? What's wrong ? a) Does Google News bot have problems indexing URLs ending with .index.html ? While figuring out what's wrong we've found out that http://news.google.it/news/search?aq=f&pz=1&cf=all&ned=us&hl=en&q=inurl%3Aindex.html gives no results...it seems that Google News index overall does not include any URLs ending with /index.html b) Does Google News bot recognise rel=canonical tag ? c) Is it just a matter of time and then Google News will pick up the right URLs (/index.html) and/or shall we communicate Google News team any changes ? d) Any suggestions ? OR Shall we do the other way around. meaning make URL #3 the canonical one ? While Google News is showing these problems, Google Web search has actually well received the changes, so we don't know what to do. Thanks for your help, Matteo0