Multilingual Sitemaps
-
Hey there, I have a site with many languages. So here are my questions concerning the sitemaps. The correct way of creating a sitemap for a multilingual site is as followed ( by the official blog of Google )
<urlset xmlns="</span>http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml">
<xhtml:link rel="alternate" hreflang="en" href="</span>http://www.example.com/"/>
<xhtml:link rel="alternate" hreflang="de" href="</span>http://www.example.com/de"/>
<xhtml:link rel="alternate" hreflang="fr" href="</span>http://www.example.com/fr"/><a href=" http:="" www.example.com="" fr"="" target="_blank"></xhtml:link><a href=" http:="" www.example.com="" de"="" target="_blank"></xhtml:link><a href=" http:="" www.example.com="" "="" target="_blank"></xhtml:link><a href=" http:="" www.sitemaps.org="" schemas="" sitemap="" 0.9"="" rel="nofollow" target="_blank"></urlset>
**So here is my first question. My site has over 200.000 pages that all of them support around 5-6 languages. Am I suppose to do this example 200.000 times?****My second question is. My root domain is www.example.com but this one redirects with 301 to www.example.com/en should the sitemap be at ****www.example.com/sitemap.xmlorwww.example.com/en/sitemap.xml ???****My third question is as followed. On WMT do I submit my sitemap in all versions of my site? I have all my languages there.**Thanks in advance for taking the time to respond to this thread and by creating it I hope many people will solve their own questions.
-
Thank you so much for replying to me.
Sorry I’ve just realise I’ve made a mistake in my first comment. We are using .com for our main site and we plan to add subfolders for individual countries in the future. Currently, we only have /row for all the countries outside of the UK that we deliver to.
Thanks again for getting back!
-
Ah, yes! If you have multiple locations, but the same content in each location you would want to submit a sitemap per country-specific area. However, is your domain on .co.uk? If you are trying to target other countries that are not the UK with that ccTLD, you are going to have a hard time as that is specific to the UK.
However, if you are on a gTLD (general domain), and have country-specific folders, you should have a sitemap per country.
-
GHi Kate,
Does your advice on points 2 and 3 also apply to location targeted subfolders? We have a separate subfolder for all of the countries we deliver to outside of the UK at www.example.co.uk/row however the content is in the same language from our uk site.
We have claimed this as a separate property on Google Search Console.
I realise it’s years later but hopefully someone is able to answer this query
I did read your article https://moz.com/blog/multiple-xml-sitemaps-increased-indexation-and-traffic but wasn’t sure if it is the same when trying to target specific countries with a subfolder.
Thanks in advance.
-
Thanks for the shout out Ryan!
Hi Angelos.
1. The short answer is yes, you should do all of those entries. Annoying right?
The best way to go about this is going to be writing a script to do the heavy lifting, but I am not cool enough to tell you how to do that. The idea would be to make one sitemap in one language, and if the URLs are identical except for the language code, then changing up everything for the next language and moving on one at a time.
You should have one main sitemap per language if you can get all of your content into one sitemap. Then have one sitemap index hosted at domain.com/sitemap.xml.
If that's not possible due to the sheer number of pages, do a sitemap index per language that references multiple sitemaps to cover the content in that one language. Then have another sitemap index that references the other indices per language.
2. See above. The main sitemap index should be at domain.com/sitemap.xml, BUT you can have each language sitemap hosted in each subfolder. Example: English sitemap at domain.com/en/sitemap.xml and Spanish sitemap at domain.com/es/sitemap.xml. This requires listing many sitemaps in the main robots.txt file or having a robots file for each subfolder. It's a lot more work than working with sitemap indices.
3. If you have claimed all language subfolders as independent sites in WMT, you can submit the corresponding sitemap. You don't have to put the sitemaps in the subfolders to do this though, you can still use the indices. You also don't have to submit them all individually, but you can and I would as I would want to see the index information in each corresponding account. That's just me though.
Does that all help?
-
Per Google's recommendations here: https://support.google.com/webmasters/answer/2620865?hl=en, yes you want to have pages correctly tagged with their alternate language translations. Per the blog I cited earlier, you'll want to organize the sitemaps to break out the 200.000 pages in a structure that's more refined than just 'all', specifically in ways that will help you find if there are problems creeping up in one section or another. Good luck!
-
It is indeed a great article but certainly does not answer any of my questions
( or i havent read the article correct )
My first question is: Do i have to do this
<loc>http://www.example.com/</loc>
<xhtml:link rel="alternate" hreflang="en" href="<a href=" http:="" www.example.com="" "="" target="_blank">http://www.example.com/"/>
<xhtml:link rel="alternate" hreflang="de" href="<a href=" http:="" www.example.com="" de"="" target="_blank">http://www.example.com/de"/>
<xhtml:link rel="alternate" hreflang="fr" href="<a href=" http:="" www.example.com="" fr"="" target="_blank">http://www.example.com/fr"/></xhtml:link></xhtml:link></xhtml:link>
for all my 200.000 + sitemap pages?
Thanks Ryan for taking the time to answer
-
Kate Morris wrote a nice post on how to break up sitemaps for large sites a few years ago, but it still holds true today: http://moz.com/blog/multiple-xml-sitemaps-increased-indexation-and-traffic, so following the advice there should help on your first question.
Your 301 redirect to English should probably be a 302 and based on browser language settings. Is it possible for anyone to get to a file or folder at www.example.com/whatever...?
Third, see the blog mentioned above. She gets into the details of how to create an Index format for your soon to be many sitemaps. Cheers!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Sitemap - What are the recommendations on the number of links
Hi, I have a sitemap(s) which is very large(.i.e. 60000) links, is it recommended to have so many links and how come when I do a site search(site:mydomain) the number of links are less than on my site map?
Intermediate & Advanced SEO | | FreddyKgapza0 -
Changing sitemaps in console
Hi there, Does anyone have any experience submitting a completely new sitemap structure - including URLs - to google console? We've changed our sitemap plug in, so rather than /sitemap-index.xml, our main sitemap home is /sitemap.xml (as an example). Is it better to 410 the old ones or 301 redirect them to the new sitemaps? If 301, what do we do about sitemaps that don't completely correlate - what was divided into item1.xml, item2.xml is now by date so items-from-2015.xml, items-from-2016.xml and so on. On a related note, am I right in thinking that there's no longer a "delete/ remove sitemap" option on console? In which case, what happens to the old ones which will now 404? Thanks anyone for any insight you may have 🙂
Intermediate & Advanced SEO | | Fubra0 -
What should my main sitemap URL be?
Hi Mozzers - regarding the URL of a website's main website: http://example.com/sitemap.xml is the normal way of doing it but would it matter if I varied this to: http://example.com/mainsitemapxml.xml or similar? I can't imagine it would matter but I have never moved away from the former before - and one of my clients doesn't want to format the URL in that way. What the client is doing is actually quite interesting - they have the main sitemap: http://example.com/sitemap.xml - that redirects to the sitemap file which is http://example.com/sitemap (with no xml extension) - might that redirect and missing xml extension the redirected to sitemap cause an issue? Never come across such a setup before. Thanks in advance for your feedback - Luke
Intermediate & Advanced SEO | | McTaggart0 -
Xml sitemap only shows up sometimes (magento)
Hi Moz community, I'm using Magento platform. I can generate a sitemap using their xml generator, but it will only pull up sometimes in web explorers, the rest of the time it will show a 404 page. GWT also tells me that I get a 404 error when testing the sitemap, but sometimes it will acknowledge that it's there. Anyone had this problem before or know how to help. sitemap= www.ice.com/sitemap.xml Let me know what other information I can provide to help. Thanks!
Intermediate & Advanced SEO | | IceIcebaby0 -
Sort term product pages and fast indexing - XML sitemaps be updated daily, weekly, etc?
Hi everyone, I am currently working on a website that the XML sitemap is set to update weekly. Our client has requested that this be changed to daily. The real issue is that the website creates short term product pages (10-20 days) and then the product page URL's go 404. So the real problem is quick indexing not daily vs weekly sitemap. I suspect that daily vs weekly sitemaps may help solve the indexing time but does not completely solve the problem. So my question for you is how can I improve indexing time on this project? The real problem is how to get the product pages indexed and ranking before the 404 page shows u?. . Here are some of my initial thoughts and background on the project. Product pages are only available for 10 to 20 days (Auction site).Once the auction on the product ends the URL goes 404. If the pages only exist for 10 to 20 days (404 shows up when the auction is over), this sucks for SEO for several reasons (BTW I was called onto the project as the SEO specialist after the project and site were completed). Reason 1 - It is highly unlikely that the product pages will rank (positions 1 -5) since the site has a very low Domain Authority) and by the time Google indexes the link the auction is over therefore the user sees a 404. Possible solution 1 - all products have authorship from a "trustworthy" author therefore the indexing time improves. Possible solution 2 - Incorporate G+ posts for each product to improve indexing time. There is still a ranking issue here since the site has a low DA. The product might appear but at the bottom of page 2 or 1..etc. Any other ideas? From what I understand, even though sitemaps are fed to Google on a weekly or daily basis this does not mean that Google indexes them right away (please confirm). Best case scenario - Google indexes the links every day (totally unrealistic in my opinion), URL shows up on page 1 or 2 of Google and slowly start to move up. By the time the product ranks in the first 5 positions the auction is over and therefore the user sees a 404. I do think that a sitemap updated daily is better for this project than weekly but I would like to hear the communities opinion. Thanks
Intermediate & Advanced SEO | | Carla_Dawson0 -
404 Errors with my RSS Feed/sitemap
In my google webmasters I just started getting 404 errors that I'm not sure how to redirect. I'm getting quite a few that are ending in /feed/ for instance /nyc-accident-injury/feed/
Intermediate & Advanced SEO | | jsmythd
contact-us-thank-you/feed/ and then also a problem with my sitemap I guess? With /site-map/?postsort=tags The domain is pulversthompson.com0 -
Can a XML sitemap index point to other sitemaps indexes?
We have a massive site that is having some issue being fully crawled due to some of our site architecture and linking. Is it possible to have a XML sitemap index point to other sitemap indexes rather than standalone XML sitemaps? Has anyone done this successfully? Based upon the description here: http://sitemaps.org/protocol.php#index it seems like it should be possible. Thanks in advance for your help!
Intermediate & Advanced SEO | | CareerBliss0 -
What is the best method for segmenting HTML sitemaps?
Sitemaps create a Table of Contents for web crawlers and users alike. Understanding how PageRank is passed, HTML sitemaps play a critical role in how Googlebot and other crawlers spider and catalog content. I get asked this question a lot and, in most cases, it's easy to categorize sitemaps and create 2-3 category-based maps that can be linked to from the global footer. However, what do you do when a client has 40 categories with 200+ pages of content under each category? How do you segment your HTML sitemap in a case like this?
Intermediate & Advanced SEO | | stevewiideman0