Multilingual website - Url problem (sitemap)
-
At this moment our website both uses the language in the url like "en" and localizes the url itself ("books" in english and "boeken" in dutch). Because of the history of making our website multilingual we have a system that takes the browser language for the localization if the url doesn't contain a language like "en".
This means:
- www.test.com/books = browser language
- www.test.com/en/books = english language
- www.test.com/boeken = browser language
- www.test.com/nl/boeken = dutch language
Now for the sitemap this makes it a little troublesome for me because which hreflang is used for which url?
1) The first thing I thought of was using x-default for all urls that get the language of the browser.
<code><url><loc>http://www.test.com/books</loc></url></code>
But as you can see we now got 2 times x-default.
2) Another solution I thought of was just use the localization of the url to determine the language like:
<code><url><loc>http://www.test.com/books</loc></url></code>
But now we got 2 of each language for the same page.
3) The last solution I thought of was removing links without a language in the url (except for the homepage, which will still have an x-default) like:
<code><url><loc>http://www.test.comen/books</loc></url></code>
But for this solution I need to put 301's at pages that are "deleted" and also need to change the system to 301 to the right page. Although the last point isn't really a problem I'm kind of worried that I will lose some of the "seo points" with a 301. (When we changed our domain in the past we had a bad experience with the 301 of our old domain)
What do you think would be the best solution for SEO? Or do you have any other suggestions or solutions I haven't thought of.
-
The best way to go about this is to keep the URLs with the language in the structure. Redirect (301) the ones that don't have it to the ones that do.
However, it sounds like that causes a problem. If the above isn't a possibility, use a canonical from the non-language URL to the one with it. Then do your HREFLANG in sitemaps, and only use the URLs with the language tag in the sitemaps. You can also do the coding on the page, just make sure the HREFLANG tags are not on the non-language pages.
- Example URL: http://www.test.com/boeken would have a canonical tag that points to http://www.test.com/nl/boeken
- Only http://www.test.com/nl/boeken is listed in the sitemaps
- OR Only http://www.test.com/nl/boeken has HREFLANG tags. http://www.test.com/boeken would only have the canonical.
That should solve your problem.
-
From their support page (https://support.google.com/webmasters/answer/189077?hl=en) when you're combining region plus language you'll want to do this:
For example, you may have specific URLs for English speakers in Ireland (en-ie), Canada (en-ca), and Australia (en-au), but want all other English speakers to see your generic English (en) page, and everyone else to see the homepage. In this case you should specify the generic English-language (en) page for searchers in, say, the UK. You can annotate this cluster of pages using a Sitemap file or using HTML link tags like this:
That way you can even have a situation like nl-en or vice versa. I'd use 302s for redirecting based on conditional things like browser settings. Cheers!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Best way to interlink 25 different language versions of a website?
I have a website which has 25 different language versions on 16 different domains. Hreflan are setup to point to different language versions. In the footer we have deeplinks to the 25 language versions. Site is not spammy but in small niche and many language versions have very few other external links. For some time this site had lost rankings for reasons that are unclear till now. I see that large international sites such as booking.com, tripadvisor, apple all use different approaches to interlink their language versions. Interestingly Tripadvisor is nowadays loading the links to their other language versions dynamically only upon click so that these links do not show up in source code, deviating from their former implementation of static deeplinks to all language versions. Matt Cutts mentioned back in 2013 “If you have 50 different sites, I wouldn’t link to all 50 sites down in the footer of your website, because that can start to look pretty spammy to users. Instead you might just link to no more than three or four or five down in the footer, that sort of thing, or have a link to a global page, and the global page can talk about all the different verions and country versions of your website.” But in their webmaster guidelines google recommends: "Consider cross-linking each language version of a page. That way, a French user who lands on the German version of your page can get to the right language version with a single click." I assume for SEO anyway these links have no value, but for user experience it would certainly be better to provide somewhere deeplinks to other language versions. Also the fact that language versions are on different domains and have few external backlinks may increase a bit the risk in our case. I guess in doubt I would prefer to be safe and load deeplinks only upon click same as tripadvisor. Any thoughts/suggestions on best interlinking in our specific case?
International SEO | | lcourse0 -
GSC is not showing URLs in Regional Language Characters in Search Analytics
Hi, We have a regional language sub-domain, which has the URI in the regional language characters (Hindi). While looking at the Search Analytics data in GSC, we get all the URLs in ASCII format. To resolve this issue, we even encoded the characters in UTF-8, by adding the following in the of the page: However, we are still getting illegible URLs in search console. It would be really great if someone could help me out with this issue. Thanks!
International SEO | | Starcom_Search0 -
Can we use similar images on multi regional websites?
I have couple of questions regarding multi regional websites. Right now, We're working on http://www.canvaschamp.com/ and planning to design new website for Australia region. I have read Google's official guidelines regarding duplicate content on following URL. https://support.google.com/webmasters/answer/182192?hl=en Now, I have biggest question regarding images and videos. We're going to use all photo gallery images on Australia website. http://www.canvaschamp.com/canvas-prints We may use similar name, title tag, meta description for all images. So, Will it work for us? I want to save my website from duplicate content penalty. I am looking forward for any additional help which may help me to perform better on different region with similar images.
International SEO | | CommercePundit0 -
Interlinking 60 ccTLD country versions of website
We will be launching 60 new ccTLD country versions of an established core website.
International SEO | | lcourse
Quality, human translated content in 35 languages.
We have a CSS dropdown menu with the 60 country flags and country/language name as anchor text. Formatted as deeplinks to the corresponding pages in the 60 country versions, that we planned to add on each page. How would you recommend to implement the interlinking between these 60 ccTLD? My concerns are: won't we dillute the link value of our links in the main page content too much with links to 60ccTLD on each page? may we trigger a google penalty as the new ccTLD have no other links yet and each page will have the same anchor text and links from exactly the same 60 domains. would you place the country dropdown rather at the bottom of the page (e.g. footer) to avoid that google will not crawl all links in main content page. Any thoughts or suggestions are appreciated1 -
Is it a bad idea to use characters with accents or graves within URLs?
Is there an issue using within the URL for a page words with accents or graves, for example including "Estándares"? Thanks Stuart
International SEO | | mcvicar0 -
Best URL for international domain
Hello SEomozers We have a site with this structure: -www.xyz.com US customers, limited content in12 languages, -global.xyz.com for International customers, full content in 12 languages. The global site is the hub for the rest of the world. We are creating a new site from scratch and want to consolidate the two domains in one by using subfolders, such as: www.xyz.com for USA, www.wyz.com/jp for US customers reading in japanese and www.xyz.com/int-jp for customers from Japan (international - will set geo location with webmastertool) What do you think would be the best naming convention for the international?
International SEO | | FXDD
www.xyz.com/int-jp or
www.xyz.com/eu-jp or
www.xyz.com/eu/jp or
www.xyz.com/ ...any other suggestions?0 -
French Canadian Website and French Language URLs
Hello, One of my clients has a question on a new Quebec, Canada version of their website. The website content and copy is in the French Canadian language, but the IT Director has asked if, for the purpose of SEO, should the URLs be in French as well? So, this questions has two parts... For SEO, should the URL's be in French or left in English, to avoid crawl errors? For visitor UX, is there any reason to have them in French versus English?
International SEO | | Aviatech0 -
Geo-targeting a sub-folder that's had url's rewritten from a sub-domain
I have a client that's setting up a section of his site in a different language, and we're planning to geo-target those pages to that country. I have suggested a sub-folder solution as it's the most cost effective solution, and it will allow domain authority to flow into those pages. His developer is indicating that they can only set this up as a sub-domain, for technical reasons, but they're suggesting they can rewrite the url's to appear as sub folder pages. I'm wondering how this will work in terms of geo-targeting in Google Webmaster Tools. Do I geo-target the sub domain or the sub folder i.e. does Google only see urls or does it physically see those pages on the sub-domain? It seems like it might be a messy solution. Would it be a better idea just to forget about the rewrites and live with the site being a sub domain? Thanks,
International SEO | | Leighm0