Homepage indexation issue
-
Hello all,
I've been scratching my head about this one for a while now... Let me explain the situation.
I'm working on a multi-lingual website. Visitors are redirected (301) when they visit the homepage to the correct domain.com/en/default.html, domain.com/nl/default.html, domain.com/fr/default.html or domain.com/de/default.html based on browser language. I have doubts about the impact on the ability for Google to index the website because of that, but that's a problem for another day.
The problem I'm having right now, is that domain.com/nl/default.html, domain.com/de/default.html and domain.com/fr/default.html are all indexed. When I search for the URL in Google I get the correct page on number one so I'm pretty sure those are indexed correctly. When I search for domain/en/default.html though, the homepage appears without /en/default.html extension. Does this mean Google assumes the domain.com page is the same as domain.com/en/default.html even though the redirect that's in place?
Would be great if someone could shed some light on this.
Thanks in advance!
-
Questions
- This type of behavior is considered a temporary redirect. Maybe it's better to think of the name as a conditional redirect. In which case, "Oh, your browser is in FR, with that condition let's send you here..." The 301 is supposed to be used as an unconditional redirect, telling crawlers that you're trying to migrate from URL A to B permanently, so get rid of URL A.
- Not necessarily. VS a 301, yes. but scrapability is mostly down to linking and sitemaps.
- Yup.
- Nope. You'd want to interlink directly to the other languages anyways though in case the 302 doesn't work for whatever reasons. Then the link is passing authority and the user has an option available to them if they'd like to get there on their own.
- Right. It'd be best to interlink with hreflang on each as you never know for certain how someone arrives at those pages. Best to give them and crawlers the guidance to where the other translations reside.
You're welcome! Hopefully that clears it all up for you.
-
Thanks a lot for your answer. The follow up question above also continues on your answer. Would be great to hear your thoughts.
-
Thanks a lot for this one. I have never worked on these kinds of automatic redirects so I thought to stay away from them till I got the indexation issue fixed. But I understand from your answer the two might be related.
Just to clarify and make sure I'm doing the right thing.
The situation at the moment: domain.com -> browser language = EN -> 301 redirect to domain.com/en/default.htm
domain.com -> browser language = NL -> 301 redirect to domain.com/nl/default.htm
domain.com -> browser language = DE -> 301 redirect to domain.com/de/default.htm
domain.com -> browser language = FR -> 301 redirect to domain.com/fr/default.htmIn this situation using the query "site:domain.com", the results include the /de/, /nl/, /fr/ and domain.com, but excludes /en/.
You advise to change this in to: domain.com -> browser language = EN -> 302 redirect to domain.com/en/default.htm
domain.com -> browser language = NL -> 302 redirect to domain.com/nl/default.htm
domain.com -> browser language = DE -> 302 redirect to domain.com/de/default.htm
domain.com -> browser language = FR -> 302 redirect to domain.com/fr/default.htmI need to include a hreflang=x-default on the domain.com page.
A few question that pop-up in my mind:
I always thought a 302 was only used for temporarily redirects?
Does using a 302 improves the scrapability of the website?
Would it possibily result in /en/default.htm be indexed again?
Does a 302 pass along authority?
I assume I need to implement href lang on the /LANGUAGE/default.htm pages as well right?Thanks a lot!
-
As others have mentioned, using a 301 redirect in this situation is not the most efficient use case, especially if you want all versions of your site to be indexed and available to users.
For displaying regional content (e.g. you have the same content, but it is translated to a specific language on different page versions) you would want to use hreflang to tell Google that you have multiple versions and the regions they satisfy. This will allow Google to serve up the regional specific content to users in international Google search engines and Google will index/know which versions are appropriate.
Reference link: https://support.google.com/webmasters/answer/189077
Additional reference: https://support.google.com/webmasters/answer/182192?hl=en#2
-
If you can, get that 301 redirection issue solved first as it's definitely not the type of one you want to use for this behavior. Google specifically recommends, "... to automatically serve the appropriate HTML content to your users depending on their location and language settings. You will either do that by using server-side 302 redirects or by dynamically serving the right HTML content." From here: http://googlewebmastercentral.blogspot.com/2014/05/creating-right-homepage-for-your.html. They go further into the hreflang tags here: https://support.google.com/webmasters/answer/189077?hl=en.
After getting the 301 cleaned up, for finding indexed pages it's better to use the "site:" search based operator in Google, Google Webmaster Tools, and Analytics. But really, get that 301 changed. Cheers!
-
If I understand your post correctly, domain.com/de/default.html has a 301 redirect to correctly direct traffic to the appropriate home page for their language?
If that is what you are saying than the chances are the the domain.com/de/default.html is no longer being indexed because of the 301 redirect. That redirect tells the engines that the page has moved, so they stop crawling it. I am not an expert on redirecting home pages, but maybe look into Ref lang tags and REL canonical tags in place of the 301 redirect.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Recover google INdexing issue after fixing malware attack.
Dear My Niche site attacked by malware on 1 st march 2018. Hacker inject a php file on my blogpage. Injected link like: mydomain.com/blog/dmy4xa.php? Then I scan My site by wordfence. Identifying all malware code.Then manually clean whole site with database. My site is completely free from malware. and remove all malware link from webmaster tools. Even Block my blog page by robots.txt . But new malware link index every week. So i need to remove those link every week. So this issue I decided to rebuild my site. Finally I rebuild my site another server. Then I flash my current server and migrate my site from those server on 10th january 2019 . I wait 1 month to deindex malware link. But new link are indexing every week. I discourage site for over 1 week and even delete site from google webmaster tools with all properties as well as verification file from server. Over 1 week , Link are showing. I feel boar to delete malware link every week. I need permanent solution. Please give me a perfect solution for this malware link index. Google index about 100 url .After that I clean my site with some tools. My site was free from malware. But Ne
Technical SEO | | Gfound1230 -
Mobile site not getting indexed
My site is www.findyogi.com - a shopping comparison site The mobile site is hosted at m.findyogi.com I fixed my sitemap and attribution to mobile site in May last week. My mobile site pages are getting de-indexed since then. Website - www.findyogi.com/mobiles/motorola/motorola-moto-g-16gb-b95ef8/price - indexed Mobile - m.findyogi.com/mobiles/motorola/motorola-moto-g-16gb-b95ef8/price - _not indexed. _ Google is crawling my website and mobile site normally. What am I am doing wrong?
Technical SEO | | namansr0 -
Sitemap indexed pages dropping
About a month ago I noticed my pages indexed from my sitemap are dropping.There are 134 pages in my sitemap and only 11 are indexed. It used to be 117 pages and just died off quickly. I still seem to be getting consistant search traffic but I'm just not sure whats causing this. There are no warnings or manual actions required in GWT that I can find.
Technical SEO | | zenstorageunits0 -
Canonical needed after no index
Hi do you need to point canonical from a subpage to main page if you have already marked a no index on the subpage, like when google is not indexing it so do we need canonicals now as is it passing any juice?
Technical SEO | | razasaeed0 -
How to change noindex to index?
Hey, I've recently upgraded to a pro SEOmoz account and have realised i have 14574 issues to do with 'blocked by meta-robot' and that 'This page is being kept out of the search engine indexes by the meta tag , which may have a value of "noindex", keeping this page out of the index.' How can i change this so my pages get indexed? I read somewhere that i need to change my privacy settings but that thread was 3 years old and now the WP Dashboard has updated.. Please let me know Many thanks, Jamie P.s Im using WordPress 3.5 And i have the XML sitemap plugin And i have no idea where to look for this robots.txt file..
Technical SEO | | markgreggs0 -
Children in this Sitemap index Warnings
Hi, I have just submitted a sitmap for one website. But I am getting this warning: Number of children in this Sitemap index 3
Technical SEO | | knockmyheart
Sitemap contains urls which are blocked by robots.txt.Sitemap: www.zemtube.com/videoscategory-sitemap.xmlValue: http://www.zemtube.com/videoscategory/exclusive/www.zemtube.com/videoscategory-sitemap.xmlValue: http://www.zemtube.com/videoscategory/featured/www.zemtube.com/videoscategory-sitemap.xmlValue: http://www.zemtube.com/videoscategory/other/It is a wordpress website and the robots.txt file is:# Exclude Files From All Robots: User-agent: *
Disallow: /wp-admin/
Disallow: /wp-includes/
Disallow: /tag/ End robots.txt file#I have also tried adding this to the robots.txtSitemap: http://www.zemtube.com/sitemap_index.xmlWebmaster-Tools-Sitemaps-httpwww.zemtube.com_.pdf0 -
How do I eliminate indexed products?
Please help! We got clobbered by Penguin and are at risk of having to close down after 10 years. We have been trying to figure out why and believe now it might be because of duplicate content. We added 2" inserts in March (over 500): http://www.trophycentral.com/inserts1.html Even though each is a different products, SEOMOZ is saying they are considered duplicate content. Given the timing, we think this might be the cause, even though it is totally legitimate. Question - since these are now indexed and since we can't easily add content quickly, what is the best way to handle this situation? A no-index tag? Is there a way to let Google know that their algorithm is detroying legitimate businesses??
Technical SEO | | trophycentraltrophiesandawards0 -
Why googlebot indexing one page, not the other?
Why googlebot indexing one page, not the other in the same conditions? In html sitemap, for example. We have 6 new pages with unique content. Googlebot immediately indexes only 2 pages, and then after sometime the remaining 4 pages. On what parameters the crawler decides to scan or not scan this page?
Technical SEO | | ATCnik0