Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Multilingual Sitemaps
-
Hey there, I have a site with many languages. So here are my questions concerning the sitemaps. The correct way of creating a sitemap for a multilingual site is as followed ( by the official blog of Google )
<urlset xmlns="</span>http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml">
<xhtml:link rel="alternate" hreflang="en" href="</span>http://www.example.com/"/>
<xhtml:link rel="alternate" hreflang="de" href="</span>http://www.example.com/de"/>
<xhtml:link rel="alternate" hreflang="fr" href="</span>http://www.example.com/fr"/><a href=" http:="" www.example.com="" fr"="" target="_blank"></xhtml:link><a href=" http:="" www.example.com="" de"="" target="_blank"></xhtml:link><a href=" http:="" www.example.com="" "="" target="_blank"></xhtml:link><a href=" http:="" www.sitemaps.org="" schemas="" sitemap="" 0.9"="" rel="nofollow" target="_blank"></urlset>
**So here is my first question. My site has over 200.000 pages that all of them support around 5-6 languages. Am I suppose to do this example 200.000 times?****My second question is. My root domain is www.example.com but this one redirects with 301 to www.example.com/en should the sitemap be at ****www.example.com/sitemap.xmlorwww.example.com/en/sitemap.xml ???****My third question is as followed. On WMT do I submit my sitemap in all versions of my site? I have all my languages there.**Thanks in advance for taking the time to respond to this thread and by creating it I hope many people will solve their own questions.
-
Thank you so much for replying to me.
Sorry I’ve just realise I’ve made a mistake in my first comment. We are using .com for our main site and we plan to add subfolders for individual countries in the future. Currently, we only have /row for all the countries outside of the UK that we deliver to.
Thanks again for getting back!
-
Ah, yes! If you have multiple locations, but the same content in each location you would want to submit a sitemap per country-specific area. However, is your domain on .co.uk? If you are trying to target other countries that are not the UK with that ccTLD, you are going to have a hard time as that is specific to the UK.
However, if you are on a gTLD (general domain), and have country-specific folders, you should have a sitemap per country.
-
GHi Kate,
Does your advice on points 2 and 3 also apply to location targeted subfolders? We have a separate subfolder for all of the countries we deliver to outside of the UK at www.example.co.uk/row however the content is in the same language from our uk site.
We have claimed this as a separate property on Google Search Console.
I realise it’s years later but hopefully someone is able to answer this query
I did read your article https://moz.com/blog/multiple-xml-sitemaps-increased-indexation-and-traffic but wasn’t sure if it is the same when trying to target specific countries with a subfolder.
Thanks in advance.
-
Thanks for the shout out Ryan!
Hi Angelos.
1. The short answer is yes, you should do all of those entries. Annoying right?
The best way to go about this is going to be writing a script to do the heavy lifting, but I am not cool enough to tell you how to do that. The idea would be to make one sitemap in one language, and if the URLs are identical except for the language code, then changing up everything for the next language and moving on one at a time.
You should have one main sitemap per language if you can get all of your content into one sitemap. Then have one sitemap index hosted at domain.com/sitemap.xml.
If that's not possible due to the sheer number of pages, do a sitemap index per language that references multiple sitemaps to cover the content in that one language. Then have another sitemap index that references the other indices per language.
2. See above. The main sitemap index should be at domain.com/sitemap.xml, BUT you can have each language sitemap hosted in each subfolder. Example: English sitemap at domain.com/en/sitemap.xml and Spanish sitemap at domain.com/es/sitemap.xml. This requires listing many sitemaps in the main robots.txt file or having a robots file for each subfolder. It's a lot more work than working with sitemap indices.
3. If you have claimed all language subfolders as independent sites in WMT, you can submit the corresponding sitemap. You don't have to put the sitemaps in the subfolders to do this though, you can still use the indices. You also don't have to submit them all individually, but you can and I would as I would want to see the index information in each corresponding account. That's just me though.
Does that all help?
-
Per Google's recommendations here: https://support.google.com/webmasters/answer/2620865?hl=en, yes you want to have pages correctly tagged with their alternate language translations. Per the blog I cited earlier, you'll want to organize the sitemaps to break out the 200.000 pages in a structure that's more refined than just 'all', specifically in ways that will help you find if there are problems creeping up in one section or another. Good luck!
-
It is indeed a great article but certainly does not answer any of my questions
( or i havent read the article correct )
My first question is: Do i have to do this
<loc>http://www.example.com/</loc>
<xhtml:link rel="alternate" hreflang="en" href="<a href=" http:="" www.example.com="" "="" target="_blank">http://www.example.com/"/>
<xhtml:link rel="alternate" hreflang="de" href="<a href=" http:="" www.example.com="" de"="" target="_blank">http://www.example.com/de"/>
<xhtml:link rel="alternate" hreflang="fr" href="<a href=" http:="" www.example.com="" fr"="" target="_blank">http://www.example.com/fr"/></xhtml:link></xhtml:link></xhtml:link>
for all my 200.000 + sitemap pages?
Thanks Ryan for taking the time to answer
-
Kate Morris wrote a nice post on how to break up sitemaps for large sites a few years ago, but it still holds true today: http://moz.com/blog/multiple-xml-sitemaps-increased-indexation-and-traffic, so following the advice there should help on your first question.
Your 301 redirect to English should probably be a 302 and based on browser language settings. Is it possible for anyone to get to a file or folder at www.example.com/whatever...?
Third, see the blog mentioned above. She gets into the details of how to create an Index format for your soon to be many sitemaps. Cheers!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Pending Sitemaps
Hi, all Wondering if someone could give me a pointer or two, please. I cannot seem to get Google or Bing to crawl my sitemap. If I submit the sitemap in WMT and test it I get a report saying 44,322urls found. However, if I then submit that same sitemap it either says Pending (in old WMT) or Couldn't fetch in the new version. This couldn't fetch is very puzzling as it had no issue fetching the map to test it. My other domains on the same server are fine, the problem is limited to this one site. I have tried several pages on the site using the Fetch as Google tool and they load without issue, however, try as I may, it will not fetch my sitemap. The sitemapindex.xml file won't even submit. I can confirm my sitemaps, although large, work fine, please see the following as an example (minus the spaces, of course, didn't want to submit and make it look like I was just trying to get a link) https:// digitalcatwalk .co.uk/sitemap.xml https:// digitalcatwalk .co.uk/sitemapindex.xml I would welcome any feedback anyone could offer on this, please. It's driving me mad trying to work out what is up. Many thanks, Jeff
Intermediate & Advanced SEO | | wonkydogadmin0 -
Google News Sitemap in Different Languages
Thought I'd ask this question to confirm what I already think. I'm curious that if we're publishing something in two language and both are verified by the publishing center if the group would recommend publishing two separate Google News Sitemaps (one in each language) or publishing one in each language.
Intermediate & Advanced SEO | | mattdinbrooklyn0 -
Sitemap generator which only includes canonical urls
Does anyone know of a 3rd party sitemap generator that will only include the canonical url's? Creating a sitemap with geo and sorting based parameters isn't the most ideal way to generate sitemaps. Please let me know if anyone has any ideas. Mind you we have hundreds of thousands of indexed url's and this can't be done with a simple text editor.
Intermediate & Advanced SEO | | recbrands0 -
Substantial difference between Number of Indexed Pages and Sitemap Pages
Hey there, I am doing a website audit at the moment. I've notices substantial differences in the number of pages indexed (search console), the number of pages in the sitemap and the number I am getting when I crawl the page with screamingfrog (see below). Would those discrepancies concern you? The website and its rankings seems fine otherwise. Total indexed: 2,360 (Search Consule)
Intermediate & Advanced SEO | | Online-Marketing-Guy
About 2,920 results (Google search "site:example.com")
Sitemap: 1,229 URLs
Screemingfrog Spider: 1,352 URLs Cheers,
Jochen0 -
Hreflang in vs. sitemap?
Hi all, I decided to identify alternate language pages of my site via sitemap to save our development team some time. I also like the idea of having leaner markup. However, my site has many alternate language and country page variations, so after creating a sitemap that includes mostly tier 1 and tier 2 level URLs, i now have a sitemap file that's 17mb. I did a couple google searches to see is sitemap file size can ever be an issue and found a discussion or two that suggested keeping the size small and a really old article that recommended keeping it < 10mb. Does the sitemap file size matter? GWT has verified the sitemap and appears to be indexing the URLs fine. Are there any particular benefits to specifying alternate versions of a URL in vs. sitemap? Thanks, -Eugene
Intermediate & Advanced SEO | | eugene_bgb0 -
404 Errors with my RSS Feed/sitemap
In my google webmasters I just started getting 404 errors that I'm not sure how to redirect. I'm getting quite a few that are ending in /feed/ for instance /nyc-accident-injury/feed/
Intermediate & Advanced SEO | | jsmythd
contact-us-thank-you/feed/ and then also a problem with my sitemap I guess? With /site-map/?postsort=tags The domain is pulversthompson.com0 -
Different domains for multilingual website
Hey guys, A site that I'm currently working on as different domains for each website language. So for example: word1word2.com for the english version word3word4.com for the french version word5word6.com for spanish version .... Is it better to move all of the different languages to the same domain and use subfolders for each language /fr/... Please note that the domains being used bring in organic traffic as well as they are EMDs. Thank You.
Intermediate & Advanced SEO | | BruLee0 -
Include Cross Domain Canonical URL's in Sitemap - Yes or No?
I have several sites that have cross domain canonical tags setup on similar pages. I am unsure if these pages that are canonicalized to a different domain should be included in the sitemap. My first thought is no, because I should only include pages in the sitemap that I want indexed. On the other hand, if I include ALL pages on my site in the sitemap, once Google gets to a page that has a cross domain canonical tag, I'm assuming it will just note that and determine if the canonicalized page is the better version. I have yet to see any errors in GWT about this. I have seen errors where I included a 301 redirect in my sitemap file. I suspect its ok, but to me, it seems that Google would rather not find these URL's in a sitemap, have to crawl them time and time again to determine if they are the best page, even though I'm indicating that this page has a similar page that I'd rather have indexed.
Intermediate & Advanced SEO | | WEB-IRS0