Multilingual Sitemap with some non-matching URLs
-
The website has two languages, English (.com/en-int/) and French (.com/fr-fr/).
Some pages only exist in French, and some only in English, but there are many that are a 1-to-1 match. So, my questions is, in the multilingual sitemap, should I only include the URLs that are alternates, and then create a 2nd sitemap for all non-matching URLs?
Or should I have 3 sitemaps: 1) Multilingual sitemap for all matching URLs (https://support.google.com/webmasters/answer/26208650 , 2) English sitemap for only URLs not included in multilingual sitemap, 3) French sitemap for only URLs not included in multilingual sitemap. And then create a sitemap index file to link to all 3 sitemaps.
-
Thanks for your response Oleg. But if I separate matching URLs into different sitemaps, then doesn't that go against what Google recommends by including them in 1 multilingual sitemap?
-
you should have 1 sitemap index and 2 sitemaps.. 1 for English and 1 for French, and the index that links to them both.
If a page has an equivalent, include the rel="alternate" meta data. If not, don't include any meta data (just do the generic ones = loc, freq, priority)
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Multiregional / Multilingual SEO - Subfolders Question
Hello all, I wonder if you can help me... I have a question about subfolders in multi-regional / multi-lingual SEO - more specifically in reference to targeting the UK and the US. Having looked at some global websites these are the types of implementations I've most commonly seen: UK subfolders .com/uk .com/gb .com/gb/en-gb | .com/en-GB .com/gb-en .com/en-gb .com/uk/en US subfolders .com/us .com/us/en-us | .com/en-US .com/us-en .com/en-us .com/us/en Are any of these approaches better than others or is it all a matter of personal preference? What's the reason for using .com/gb over .com/uk (or vice versa) for example? Secondly, my assumption is that the examples above which include language subfolders do so because these companies are targeting different speaking users within these countries. Would I be right to think that since the organisation I work for is only targeting the American speakers in the US, we wouldn't need to go so far as to have language subfolders in addition to location subfolders? Would be great to get some feedback / suggestions! Thanks!
International SEO | | SEOCT0 -
Switch to Separate URL Configurations for International SEO?
We run an ecommerce website and sell to customers in the US and Canada. We recently realized that the way we serve content to our users isn't Google's recommended way. We use locale-adaptive pages in that some content changes slightly depending on where we think the user is located based on their IP address. But the URL doesn't change. Google's stance on locale-adaptive - https://support.google.com/webmasters/answer/6144055?hl=en&ref_topic=2370587 That being said, the changes are quite minor. It is usually only pricing and currency that changes depending on if we determine the user to be based in Canada or the US. However, I understand that there can be problems related to this configuration because of GoogleBot primarily using a US-based IP. We decided that the best course of action for us is to serve US content as our default content on all of our existing URLs. And Canadian content would be served using new url paths such as: example.com**/en-ca/**product1. All of this would also be configured to use hreflang tags. The problem we have run into is that it is a pretty huge development challenge to reconfigure how the site serves content when we have been using locale-adaptive for over a year. So developer resources would be taken away from other tasks and put toward this one for a relatively long time. Based on this information and the fact that we would like to both rank better in Canada and to follow Google's recommendations, how important would you say this change would be? I realize this isn't a black and white question and it depends a lot on business goals and preferences. However, I need to be able to gauge as best as I can how necessary it is to do this in order to make the decision of whether to do it or not. Any input is greatly appreciated!
International SEO | | westcoastcraig1 -
GSC is not showing URLs in Regional Language Characters in Search Analytics
Hi, We have a regional language sub-domain, which has the URI in the regional language characters (Hindi). While looking at the Search Analytics data in GSC, we get all the URLs in ASCII format. To resolve this issue, we even encoded the characters in UTF-8, by adding the following in the of the page: However, we are still getting illegible URLs in search console. It would be really great if someone could help me out with this issue. Thanks!
International SEO | | Starcom_Search0 -
70 characters roughly where Google cuts off the title how many bytes for non-latin characters?
Hi So I was asked a good question by our localisation team regarding titles/descriptions and their cut off points on the google listing. I am unable to find any reference anywhere in terms of non-latin characters and the number of characters/bytes they would be before they are cut off in Google's Listing. So for latin characters it is generally around 70 for the title and 170 for the description. Now the same does not apply for Japanese, Chinese and other non-latin character languages. These generally work in the number of bytes. Does anyone have a standard rule for ensuring the title/description are not too long/short when the listing displays in the search results? Thanks
International SEO | | ColumK0 -
Backlinks to URLs with Language Parameters (for Chinese version of website) and SEO?
Hey all, We run a large eCommerce site in Australia and are preparing to launch to the Chinese market. Our site has been fully converted to Chinese and displays the version of the site detected as default in the user's browser unless they manually select otherwise. This is done by appending the parameter "?la=zh" onto the end of the URL, so for example the Chinese version would be: **www.example.com/australia?la=zh ** This then forces the product catalogue to display the relevant language version. My question is, for SEO purposes and back links in particular, since they aren't really a "true URL" (i.e: strictly speaking they aren't different "pages", just the same page being populated with different characters), would getting links from Chinese websites to the URL "www.example.com/australia?la=zh" really be viewed as any different from just "www.example.com/australia"? Do they pass the same amount of juice and is the difference detected by the engines (thinking mainly about Baidu in particular but of course Google as well)? Feedback from anyone with experience in SEO for multi-lingual sites would be much appreciated, thanks.
International SEO | | ExperienceOz0 -
Backlinks to rewrite urls and russian cyrilic characters
Dear Mozers, I've been tried different methods for backlink building and I need some help in order to make it clear in my mind. I have a multilingual website. Also it has russian language which is our basic market. My main keyword is(Недвижимость в Греции) which means Properties in Greece and of course I target my russian first page which has url rewrite (because I have to include my keywords in url) with cyrrilic characters - http://www.halkidikiproperties.com/ru/халкидики-недвижимость-греции/. As I see these backlinks doesn't appear in seomoz tool and there is no improvment in link analysis tab. Do I have to target better my home page than this url rewrited page or maybe do I have to target www.halkidikiproperties.com/ru/ which is the main russian page but before the url rewrite? Also as I see my competitors targeting their homepage without any rewrite but they are not multilingual. Do I have to analyze better this specific rewrited-url by using Link analysis tool? Is it possible to analyze a specific url? Thank a lot , It will be really helpful your assistance. Regards, George
International SEO | | elgreco0 -
International (foreign language) URL's best practices
I'm curious if there is a benefit or best practice with regards to using the localized language on international sites (with specific ccTLDs). For example, should my french site (site.fr) use the french language as keywords within the URLs or should they be in english? e.g. www.site.fr/nourriture vs. www.site.fr/food Is that considered best practice for SEO (or just for brand perception those markets?). Is there a tangible loss in SEO if we do not use the correct language for those URLs and just stick with English around the world? I recall seeing a Matt Cutts video on the topic and he said that google does support i18n URL's but other SE's might not support them as gracefully but he didn't come down with a hard recommendation to go with i18n URL's or just English. Would love a strong ruling in favor one direction based on best practices.
International SEO | | mongillo0 -
French Canadian Website and French Language URLs
Hello, One of my clients has a question on a new Quebec, Canada version of their website. The website content and copy is in the French Canadian language, but the IT Director has asked if, for the purpose of SEO, should the URLs be in French as well? So, this questions has two parts... For SEO, should the URL's be in French or left in English, to avoid crawl errors? For visitor UX, is there any reason to have them in French versus English?
International SEO | | Aviatech0