Multilingual website - Url problem (sitemap)
-
At this moment our website both uses the language in the url like "en" and localizes the url itself ("books" in english and "boeken" in dutch). Because of the history of making our website multilingual we have a system that takes the browser language for the localization if the url doesn't contain a language like "en".
This means:
- www.test.com/books = browser language
- www.test.com/en/books = english language
- www.test.com/boeken = browser language
- www.test.com/nl/boeken = dutch language
Now for the sitemap this makes it a little troublesome for me because which hreflang is used for which url?
1) The first thing I thought of was using x-default for all urls that get the language of the browser.
<code><url><loc>http://www.test.com/books</loc></url></code>
But as you can see we now got 2 times x-default.
2) Another solution I thought of was just use the localization of the url to determine the language like:
<code><url><loc>http://www.test.com/books</loc></url></code>
But now we got 2 of each language for the same page.
3) The last solution I thought of was removing links without a language in the url (except for the homepage, which will still have an x-default) like:
<code><url><loc>http://www.test.comen/books</loc></url></code>
But for this solution I need to put 301's at pages that are "deleted" and also need to change the system to 301 to the right page. Although the last point isn't really a problem I'm kind of worried that I will lose some of the "seo points" with a 301. (When we changed our domain in the past we had a bad experience with the 301 of our old domain)
What do you think would be the best solution for SEO? Or do you have any other suggestions or solutions I haven't thought of.
-
The best way to go about this is to keep the URLs with the language in the structure. Redirect (301) the ones that don't have it to the ones that do.
However, it sounds like that causes a problem. If the above isn't a possibility, use a canonical from the non-language URL to the one with it. Then do your HREFLANG in sitemaps, and only use the URLs with the language tag in the sitemaps. You can also do the coding on the page, just make sure the HREFLANG tags are not on the non-language pages.
- Example URL: http://www.test.com/boeken would have a canonical tag that points to http://www.test.com/nl/boeken
- Only http://www.test.com/nl/boeken is listed in the sitemaps
- OR Only http://www.test.com/nl/boeken has HREFLANG tags. http://www.test.com/boeken would only have the canonical.
That should solve your problem.
-
From their support page (https://support.google.com/webmasters/answer/189077?hl=en) when you're combining region plus language you'll want to do this:
For example, you may have specific URLs for English speakers in Ireland (en-ie), Canada (en-ca), and Australia (en-au), but want all other English speakers to see your generic English (en) page, and everyone else to see the homepage. In this case you should specify the generic English-language (en) page for searchers in, say, the UK. You can annotate this cluster of pages using a Sitemap file or using HTML link tags like this:
That way you can even have a situation like nl-en or vice versa. I'd use 302s for redirecting based on conditional things like browser settings. Cheers!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Multistore Sitemap
I use Magento 2 Multistore and have 2 stores set up with identical products, one for the EU and one for the US. The best practice is to allow Google to crawl both sites, but what about the sitemap? Should I only include one store? The reason I ask is that Google has recently started ignoring canonicalized URLs, so even though the second store is canonicalized, it could affect my rank. My rank did drop with the last update when this was rolled out, I stopped some canonicalized URLs from generating and my rank went back up (albeit not as high as before).
International SEO | | moon-boots0 -
Correct Hreflang & Canonical Tags for Multi-Regional Website English Language Only having URL Parameters
Dear friends, We have a multi-regional website in English language only having the country selector on the top of each page and it adds countrycode parameters on each url. Website is built in Magento 1.8 and having 1 store with multiple store views. There is no default store set in Magento as I discussed with developer. Content is same for all the countries and only currency is changed. In navigation there are urls without url parameters but when we change store from any page it add parameters in the url for same page hence there are total 7 URLs. 6 URLs for each page (with country parameters) and 1 master url (without parameters) and making content duplicity. We have implemented hreflang tags on each page with url parameters but for canonical we have implemented master page url as per navigation without url parameters Example on this page. I think this is correct for master page but we should use URL parameters in canonical tags for each counry url too and there should be only 1 canonical tag on each country page url. Currently all the country urls are having master page canoncial tag as per the example. Please correct me if I am wrong and **in this case what has to be done for master page? **as google is indexing the pages without parameters too. We are also using GEOIP redirection for each store with country IP detection and for rest of the countries which are not listed on the website we are redirecting to USA store. Earlier it was 301 but we changed it to 302. Hreflang tags are showing errors in SEMRush due to redirection but in GWT it's OK for some pages it's showing no return tags only. Should I use **x-default tags for hreflang and country selector only on home page like this or should I remove the redirection? **However some of the website like this using redirection but header check tool doesn't show the redirection for this and for our website it shows 302 redirection. Sorry for the long post but looking for your support, please.
International SEO | | spjain810 -
GSC is not showing URLs in Regional Language Characters in Search Analytics
Hi, We have a regional language sub-domain, which has the URI in the regional language characters (Hindi). While looking at the Search Analytics data in GSC, we get all the URLs in ASCII format. To resolve this issue, we even encoded the characters in UTF-8, by adding the following in the of the page: However, we are still getting illegible URLs in search console. It would be really great if someone could help me out with this issue. Thanks!
International SEO | | Starcom_Search0 -
Sitemap for multilanguage website
Hello sorry silly question but prefer to be sure 🙂 I have an international website with different subfolders .com/es .com .com/fr etc All of them have independant sitemap, but i'd like to add in .com/robot.txt a sitemap with language. Do you know how I can do this ? Tks a lot !
International SEO | | AymanH0 -
Url for Turkish, Russian, Chinese, Arabic, Vietnamese and Arabic websites
Hello ! We gonna release our next website with new amazing languages. However I was wondering, is it better to keep the url in English or I can translate them in : Turkish (should be fine) Chinese Arabic Vietnamese Arabic Russian All websites are properly translated but I'm hesitating for the url. Tks a lot !
International SEO | | AymanH0 -
Backlinks to URLs with Language Parameters (for Chinese version of website) and SEO?
Hey all, We run a large eCommerce site in Australia and are preparing to launch to the Chinese market. Our site has been fully converted to Chinese and displays the version of the site detected as default in the user's browser unless they manually select otherwise. This is done by appending the parameter "?la=zh" onto the end of the URL, so for example the Chinese version would be: **www.example.com/australia?la=zh ** This then forces the product catalogue to display the relevant language version. My question is, for SEO purposes and back links in particular, since they aren't really a "true URL" (i.e: strictly speaking they aren't different "pages", just the same page being populated with different characters), would getting links from Chinese websites to the URL "www.example.com/australia?la=zh" really be viewed as any different from just "www.example.com/australia"? Do they pass the same amount of juice and is the difference detected by the engines (thinking mainly about Baidu in particular but of course Google as well)? Feedback from anyone with experience in SEO for multi-lingual sites would be much appreciated, thanks.
International SEO | | ExperienceOz0 -
Can I add more than one XML sitemap in my Robots.txt File?
I have 3 domains all hosted on the same Magento Enterprise Platform. Each domain targets a different county. .co.uk, .com.au, and .com. Can I add all three sitemaps in my robots.txt file? www.example.com/sitemap.xml www.example.com.au/sitemap.xml www.example.co.uk/sitemap.xml
International SEO | | hfranz0 -
Site structure for multi-lingual hotel website (subfolder names)
Hi there superMozers! I´ve read a quite a few questions about multi-lingual sites but none answered my doubt / idea, so here it is: I´m re-designing an old website for a hotel in 4 different languages which are all** hosted on the same .com domain** as follows: example.com/english/ for english example.com/espanol/ for **spanish ** example.com/francais/ for french example.com/portugues/ for portuguese While doing keyword search, I have noticed that many travel agencies separate geographical areas by folders, therefor an **agency pomoting beach hotels in South America **will have a structure as follows: travelagency.com/argentina-beach-hotels/ travelagency.com/peru-beach-hotels/ and they list hotels in each folder, therefor benefiting from those keywords to rank ahead of many independent hotels sites from those areas. What **I would like to **do -rather than just naming those folders with the traditional /en/ for english or /fr/ for french etc- is take advantage of this extra language subfolder to_´include´_ important keywords in the name of the subfolders in the following way (supposing the we have a beach hotel in Argentina): example.com/argentina-beach-hotel/ for english example.com/hotel-playa-argentina/ for **spanish ** example.com/hotel-plage-argentine/ for french example.com/hotel-praia-argentina/ for portuguese Note that the same keywords are used in the name of the folder, but translated into the language the subfolders are. In order to make things clear for the search engines I would specify the language in the html for each page. My doubt is whether google or other search engines may consider this as ´stuffing´ although most travel agencies do it in their site structure. Any Mozers have experience with this, any idea on how search engines may react, or if they could penalise the site? Thanks in advance!
International SEO | | underground0