HREFLANG for multiple country/language combinations
-
We have a site setup with English, German, French, Spanish and Italian. We offer these languages for every European country (over 30). Thus, there are 150 + different URL combinations, as we use the /country/language/ subdirectory path.
Should I list out every combination in hreflang?Or should I simply choose the most applicable combinations (/de/de and fr/fr, etc.)? If we go the latter path, should I block google bot from crawling the atypical combinations?
Best,
Sam
-
Hi Sam,
Apologies for the slow response. Your question slipped through the net.
This is an interesting case!
In an ideal world, you'd specify the relationship between all of those pages, in each direction. That's 150+ tags per page, though, which is going to cause some headaches. Even if you shift the tagging to an XML sitemap, that's a _lot _of weight and processing.
Anecdotally, I know that hreflang tagging starts to break at those kinds of scales (even more so on large sites, at that kind of scale, when the resultant XML sitemaps can reach the size of many gigabytes, or when Google is crawling faster than it's processing the hreflang directives), and so tagging everything isn't going to be a viable approach.
I'd suggest picking out and implementing hreflang for _only _the primary combinations*, as you suggest, and reducing the site-wide mapping to the primary variant in each case.
- You might consider that there may be cases where the valuable/primary combinations aren't just the /xx/xx/ or _/yy/yy/ _versions and that there might be some examples of varying country/language combinations which are worth including.
For the atypical variants, I think that you have a few options:
-
Use meta robots (or x-robots) tags to set noindex attributes. This will keep them out of the index, but doesn't guarantee that you're effectively managing/consolidating value across near duplicates - you may be quietly harming performance without realising it, as those pages represent points of crawl and value wastage/leakage.
-
Use robots.txt to prevent Google from accessing the atypical variants. That won't necessarily stop them from showing up in search results, though, and isn't without problems - you risk you creating crawl dead-ends, writing off the value of any inbound links to those pages, and other issues.
-
You use canonical URLs on all of the atypical variations, referencing the nearest primary version, to attempt to consolidate value/relevance etc. However, that risks the wrong language/content showing up in the wrong country, as you're explicitly _un_optimising the location component.
I think that #1 is the best approach, as per your thinking. That removes the requirement to do anything clever or manipulative with hreflang tagging, and fits neatly with the idea that the atypical combinations aren't useful/valuable enough to warrant their own identities - Google should be smart enough to fall back to the nearest 'generic' equivalent.
I'd also take care to set up your Google Search Console country targeting for each country-level folder, to reduce the risk of people ending up in the wrong sections.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What date tags are required/best to use for articles or other landing pages?
Does anyone have suggestions on which date tag(s) are most important to use and how to use them on the frontend? (i.e. dateModified, dateCreated, and datePublished). The Structured Data Testing Tool is coming up with errors for my article pages, but I'm a bit confused which ones should be in the code vs. showing on the frontend.
Algorithm Updates | | ElsaT0 -
What is your hypothesis why Panda/Penguin recoveries happen over months after an algorithm update rather than over night?
We have experienced many scenarios were ranking recoveries from clear Panda and Penguin penalties on our sites don't necessarily happen with the launch of a Panda/Penguin update but instead trickle back in over weeks and months after a confirmed algo update. A good example is shown in the image which shows a panda recovery for a high volume keyword. What is your theory why these ranking recoveries happen over weeks vs instantly? qCWliLF
Algorithm Updates | | italiansoc0 -
Use of http://schema-creator.org boost ranking
Hello all if we use http://schema-creator.org for structured html will it increase our ranking too. has it any benefit for SEO?
Algorithm Updates | | adnan11010 -
Do panda/penguin algorithm updates hit websites or just webpages ?
If I have a website that been affected by the panda/penguin update, do bad links affect the entire site or just the page the bad link(s) are linked to? If it is the latter and penguin/panda actually affect webpages, not websites (as is the common reference/conception), then wouldn't simply creating a new URL, targeting this new URL, shifting meta-tags and restarting link-building efforts again (this time using the right quality strategies) be a really common-sense approach instead of the tediousness of the disavow approach that so many go down?
Algorithm Updates | | Gavo0 -
Meta Description / Meta Keywords
So, I am using Wordpress and the seo by yoast..... I have heard that meta keywords and descriptions are no longer recognized or used by search engines with respect to page rankings in the serps. Is this true? If so, why do people still recommend using these? I thought content is king these days?
Algorithm Updates | | APICDA0 -
Should social widgets be the kind that shares/likes a page, or the kind that adds followers to a brand social page?
I'm wondering if the social widgets on my blog should create a share/like referencing the page or should the social widget create a follower to my brands page on a particular social network? Any ideas?
Algorithm Updates | | salesduke0 -
Best practice for someone wanting to repost / translate some of your blog posts?
I've been contacted by several sites (a few in other countries) who would like to repost some of our articles on their site. A few of these are in other countries and they would like to translate them in their language. (we have a site about raising a child with Down syndrome so they are wanting to use our info to help people...not "beat us" in rankings, or anything like that.) I didn't know what the best practice on this was. I don't want to get dinged for duplicate content or have someone rank higher than me for my own article, etc. Just curious what the best way to go about this was. I'm also assuming the articles that are translated wouldn't be an issue at all since the content will be in another language. Is this right? Thanks!
Algorithm Updates | | NoahsDad0 -
TripAdvisor multiple listings
Why do certain sites for example TripAdvisor rank for their multiple international sites e.g TripAdvisor.co.uk, TripAdvisor.com, TripAdvisor.fr, TripAdvisor.de etc for a English search phrase ? From my view point they are just spamming the index with the same content. These searches are performed on google.co.uk
Algorithm Updates | | NeilTompkins0