Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Multilingual Sitemaps
-
Hey there, I have a site with many languages. So here are my questions concerning the sitemaps. The correct way of creating a sitemap for a multilingual site is as followed ( by the official blog of Google )
<urlset xmlns="</span>http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml">
<xhtml:link rel="alternate" hreflang="en" href="</span>http://www.example.com/"/>
<xhtml:link rel="alternate" hreflang="de" href="</span>http://www.example.com/de"/>
<xhtml:link rel="alternate" hreflang="fr" href="</span>http://www.example.com/fr"/><a href=" http:="" www.example.com="" fr"="" target="_blank"></xhtml:link><a href=" http:="" www.example.com="" de"="" target="_blank"></xhtml:link><a href=" http:="" www.example.com="" "="" target="_blank"></xhtml:link><a href=" http:="" www.sitemaps.org="" schemas="" sitemap="" 0.9"="" rel="nofollow" target="_blank"></urlset>
**So here is my first question. My site has over 200.000 pages that all of them support around 5-6 languages. Am I suppose to do this example 200.000 times?****My second question is. My root domain is www.example.com but this one redirects with 301 to www.example.com/en should the sitemap be at ****www.example.com/sitemap.xmlorwww.example.com/en/sitemap.xml ???****My third question is as followed. On WMT do I submit my sitemap in all versions of my site? I have all my languages there.**Thanks in advance for taking the time to respond to this thread and by creating it I hope many people will solve their own questions.
-
Thank you so much for replying to me.
Sorry I’ve just realise I’ve made a mistake in my first comment. We are using .com for our main site and we plan to add subfolders for individual countries in the future. Currently, we only have /row for all the countries outside of the UK that we deliver to.
Thanks again for getting back!
-
Ah, yes! If you have multiple locations, but the same content in each location you would want to submit a sitemap per country-specific area. However, is your domain on .co.uk? If you are trying to target other countries that are not the UK with that ccTLD, you are going to have a hard time as that is specific to the UK.
However, if you are on a gTLD (general domain), and have country-specific folders, you should have a sitemap per country.
-
GHi Kate,
Does your advice on points 2 and 3 also apply to location targeted subfolders? We have a separate subfolder for all of the countries we deliver to outside of the UK at www.example.co.uk/row however the content is in the same language from our uk site.
We have claimed this as a separate property on Google Search Console.
I realise it’s years later but hopefully someone is able to answer this query
I did read your article https://moz.com/blog/multiple-xml-sitemaps-increased-indexation-and-traffic but wasn’t sure if it is the same when trying to target specific countries with a subfolder.
Thanks in advance.
-
Thanks for the shout out Ryan!
Hi Angelos.
1. The short answer is yes, you should do all of those entries. Annoying right?
The best way to go about this is going to be writing a script to do the heavy lifting, but I am not cool enough to tell you how to do that. The idea would be to make one sitemap in one language, and if the URLs are identical except for the language code, then changing up everything for the next language and moving on one at a time.
You should have one main sitemap per language if you can get all of your content into one sitemap. Then have one sitemap index hosted at domain.com/sitemap.xml.
If that's not possible due to the sheer number of pages, do a sitemap index per language that references multiple sitemaps to cover the content in that one language. Then have another sitemap index that references the other indices per language.
2. See above. The main sitemap index should be at domain.com/sitemap.xml, BUT you can have each language sitemap hosted in each subfolder. Example: English sitemap at domain.com/en/sitemap.xml and Spanish sitemap at domain.com/es/sitemap.xml. This requires listing many sitemaps in the main robots.txt file or having a robots file for each subfolder. It's a lot more work than working with sitemap indices.
3. If you have claimed all language subfolders as independent sites in WMT, you can submit the corresponding sitemap. You don't have to put the sitemaps in the subfolders to do this though, you can still use the indices. You also don't have to submit them all individually, but you can and I would as I would want to see the index information in each corresponding account. That's just me though.
Does that all help?
-
Per Google's recommendations here: https://support.google.com/webmasters/answer/2620865?hl=en, yes you want to have pages correctly tagged with their alternate language translations. Per the blog I cited earlier, you'll want to organize the sitemaps to break out the 200.000 pages in a structure that's more refined than just 'all', specifically in ways that will help you find if there are problems creeping up in one section or another. Good luck!
-
It is indeed a great article but certainly does not answer any of my questions ( or i havent read the article correct )
My first question is: Do i have to do this
<loc>http://www.example.com/</loc>
<xhtml:link rel="alternate" hreflang="en" href="<a href=" http:="" www.example.com="" "="" target="_blank">http://www.example.com/"/>
<xhtml:link rel="alternate" hreflang="de" href="<a href=" http:="" www.example.com="" de"="" target="_blank">http://www.example.com/de"/>
<xhtml:link rel="alternate" hreflang="fr" href="<a href=" http:="" www.example.com="" fr"="" target="_blank">http://www.example.com/fr"/></xhtml:link></xhtml:link></xhtml:link>
for all my 200.000 + sitemap pages?
Thanks Ryan for taking the time to answer
-
Kate Morris wrote a nice post on how to break up sitemaps for large sites a few years ago, but it still holds true today: http://moz.com/blog/multiple-xml-sitemaps-increased-indexation-and-traffic, so following the advice there should help on your first question.
Your 301 redirect to English should probably be a 302 and based on browser language settings. Is it possible for anyone to get to a file or folder at www.example.com/whatever...?
Third, see the blog mentioned above. She gets into the details of how to create an Index format for your soon to be many sitemaps. Cheers!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are HTML Sitemaps Still Effective With "Noindex, Follow"?
A site we're working on has hundreds of thousands of inventory pages that are generally "orphaned" pages. To reach them, you need to do a lot of faceting on the search results page. They appear in our XML sitemaps as well, but I'd still consider these orphan pages. To assist with crawling and indexation, we'd like to create HTML sitemaps to link to these pages. Due to the nature (and categorization) of these products, this would mean we'll be creating thousands of individual HTML sitemap pages, which we're hesitant to put into the index. Would the sitemaps still be effective if we add a noindex, follow meta tag? Does this indicate lower quality content in some way, or will it make no difference in how search engines will handle the links therein?
Intermediate & Advanced SEO | | mothner0 -
Href lang in image or video XML sitemaps
Does anyone know if it is possible/recommended/not recommended to use href lang in image or video XML sitemaps? This had not crossed my mind until recently, but a client asked me this question and I couldn't find any information on this topic.
Intermediate & Advanced SEO | | ChrisKing0 -
301 v/s 302 Redirection on Homepage (Multilingual)
Hello, Our website: http://www.luxresorts.com currently has a default 302 redirection to http://www.luxresorts.com/en. We would like to do a 301 redirection instead of a 302 to http://www.luxresorts.com. Our concern is that the site is multilingual and we wonder what effect would the 301 redirection have on search engine crawlers and how would this appear on SERP. When a search is done on Google.com, the English version of our website appears and when on Google.FR, the French version appears. Would the 301 redirection change the way our website appear on Google? Grateful if you could help us out in understanding the pros and cons/best practices for our concern. Thanks in advance. Tej Luchmun.
Intermediate & Advanced SEO | | luxresorts0 -
Getting a Sitemap for a Subdomain into Webmaster Tools
We have a subdomain that is a Wordpress blog, and it takes days, sometimes weeks for most posts to be indexed. We are using the Yoast plugin for SEO, which creates the sitemap.xml file. The problem is that the sitemap.xml file is located at blog.gallerydirect.com/sitemap.xml, and Webmaster Tools will only allow the insertion of the sitemap as a directory under the gallerydirect.com account. Right now, we have the sitemap listed in the robots.txt file, but I really don't know if Google is finding and parsing the sitemap. As far as I can tell, I have three options, and I'd like to get thoughts on which of the three options is the best choice (that is, unless there's an option I haven't thought of): 1. Create a separate Webmaster Tools account for the blog 2. Copy the blog's sitemap.xml file from blog.gallerydirect.com/sitemap.xml to the main web server and list it as something like gallerydirect.com/blogsitemap.xml, then notify Webmaster Tools of the new sitemap on the galllerydirect.com account 3. Do an .htaccess redirect on the blog server, such as RewriteRule ^sitemap.xml http://gallerydirect.com/blogsitemap_index.xml Then notify Webmaster Tools of the new blog sitemap in the gallerydirect.com account. Suggestions on what would be the best approach to be sure that Google is finding and indexing the blog ASAP?
Intermediate & Advanced SEO | | sbaylor0 -
XML Sitemap index within a XML sitemaps index
We have a similar problem to http://www.seomoz.org/q/can-a-xml-sitemap-index-point-to-other-sitemaps-indexes Can a XML sitemap index point to other sitemaps indexes? According to the "Unique Doll Clothing" example on this link, it seems possible http://www.seomoz.org/blog/multiple-xml-sitemaps-increased-indexation-and-traffic Can someone share an XML Sitemap index within a XML sitemaps index example? We are looking for the format to implement the same on our website.
Intermediate & Advanced SEO | | Lakshdeep0 -
XML Sitemap Index Percentage (Large Sites)
Hi all I'm wanting to find out from those who have experience dealing with large sites (10s/100s of millions of pages). What's a typical (or highest) percentage of indexed pages vs. submitted pages you've seen? This information can be found in webmaster tools where Google shows you the pages submitted & indexed for each of your sitemap. I'm trying to figure out whether, The average index % out there There is a ceiling (i.e. will never reach 100%) It's possible to improve the indexing percentage further Just to give you some background, sitemap index files (according to schema.org) have been implemented to improve crawl efficiency and I'm wanting to find out other ways to improve this further. I've been thinking about looking at the URL parameters to exclude as there are hundreds (e-commerce site) to help Google improve crawl efficiency and utilise the daily crawl quote more effectively to discover pages that have not been discovered yet. However, I'm not sure yet whether this is the best path to take or I'm just flogging a dead horse if there is such a ceiling or if I'm already at the average ballpark for large sites. Any suggestions/insights would be appreciated. Thanks.
Intermediate & Advanced SEO | | danng0 -
Include Cross Domain Canonical URL's in Sitemap - Yes or No?
I have several sites that have cross domain canonical tags setup on similar pages. I am unsure if these pages that are canonicalized to a different domain should be included in the sitemap. My first thought is no, because I should only include pages in the sitemap that I want indexed. On the other hand, if I include ALL pages on my site in the sitemap, once Google gets to a page that has a cross domain canonical tag, I'm assuming it will just note that and determine if the canonicalized page is the better version. I have yet to see any errors in GWT about this. I have seen errors where I included a 301 redirect in my sitemap file. I suspect its ok, but to me, it seems that Google would rather not find these URL's in a sitemap, have to crawl them time and time again to determine if they are the best page, even though I'm indicating that this page has a similar page that I'd rather have indexed.
Intermediate & Advanced SEO | | WEB-IRS0 -
Old deleted sitemap still shown in webmaster tools
Hello I have redisgned a website inl new url structure in cms. Old sitemap was not set to 404 but changed with new sitemap files,also new sitemap was named different to old one.All redirections done properly Still 3 month after google still shows me duplicate titile and metas by comparing old and new urls I am lost in what to do now to eliminate the shown error. How can google show urls that are not shown in sitemap any more? Looking forward to any help Michelles
Intermediate & Advanced SEO | | Tit0