Multilingual site with untranslated content
-
We are developing a site that will have several languages.
There will be several thousand pages, the default language will be English. Several sections of the site will not be translated at first, so the main content will be in English but navigation/boilerplate will be translated.
We have hreflang alternate tags set up for each individual page pointing to each of the other languages, eg in the English version we have:
etc
In the spanish version, we would point to the french version and the english version etc.
My question is, is this sufficient to avoid a duplicate content penalty for google for the untranslated pages?
I am aware that from a user perspective, having untranslated content is bad, but in this case it is unavoidable at first.
-
Thanks for your comments Gianluca.
I think Google's guidelines are somewhat ambiguous. Here it does state that "if you're providing the same content to the same users on different URLs (for instance, if both example.de/ and example.com/de/ show German language content for users in Germany), you should pick a preferred version and redirect (or use the rel=canonical link element) appropriately."
https://support.google.com/webmasters/answer/182192?hl=en
I think you've explained it nicely though.
-
At first that would be fine.
Said that, this is a very specific case where you can use both hreflang and cross domain rel="canonical".
Remember that these two mark-up are totally independent one each other, though.
If you use them both, as I wrote replying to Yusuf, from one side you are telling Google that you want it to show a determined URL for a determined geo-targeted country/language, and from other side you are also telling Google that that geo-targeted URL is the exact copy of the canonical one.
What Google will do will be showing the geo-targeted URL in the SERPs, but with the Title and Meta Description of the canonical one.
One more thing, and this a strong reason for urging a complete translation in a short period of time:
if the content of the URL of the French site, for instance, is in English, you cannot put "fr-FR" in the hreflang, but "en-FR". This is a consequence: that the URL will tend to be shown only for English queries done in Google.fr, not for French queries... and that mean loosing a lot of traffic opportunities.
-
Yusuf,
I'm sorry but I've to correct you.
If two pages are in the same language, but they are targeting different countries (i.e.: USA and UK), even if the content is the same or substantially the same, then you not only can use the hreflang, but also you should use it in order to tell Google that one URL must be shown to US people and the other to UK ones.
Obviously, if you want you can always decide to use the cross domain rel="canonical" instead.
Remember, though, that in that case - if you are using the hreflang - that Google will show the snippets' components (title and meta description) of the canonical URL, even it will show the geotargeted URL. Instead, if you opted to not use the hreflang, people will see the canonical URL snippet (web address included).
-
Have you taken a look through the following :
https://support.google.com/webmasters/answer/182192?hl=en#1
https://sites.google.com/site/webmasterhelpforum/en/faq-internationalisation
"
Duplicate content and international sites
Websites that provide content for different regions and in different languages sometimes create content that is the same or similar but available on different URLs. This is generally not a problem as long as the content is for different users in different countries. While we strongly recommend that you provide unique content for each different group of users, we understand that this may not always be possible. There is generally no need to "hide" the duplicates by disallowing crawling in a robots.txt file or by using a "noindex" robots meta tag. However, if you're providing the same content to the same users on different URLs (for instance, if both
example.de/
andexample.com/de/
show German language content for users in Germany), you should pick a preferred version and redirect (or use the rel=canonical link element) appropriately. In addition, you should follow the guidelines on rel-alternate-hreflang to make sure that the correct language or regional URL is served to searchers." -
Hi Jorge
The rel="alternate" hreflang="x" tag is not suitable for pages that are in the same language as these are essentially duplicates rather than alternative language versions.
I'd use the rel="canonical" tag to point to the main page until the translations of those pages are available.
Webmaster Tools should allow you to see any issues.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How do you implement an SEO site structure with content that falls under two silos?
We primarily produce two different types of content: concise fact sheets on topics and video briefings + transcripts of topics. Often these two content types cover the same topic area and since we're currently siloing by content type, these pages end up competing against each other for rankings. Advice on a site structure that'd avoid these issues?
On-Page Optimization | | jay_elsie1 -
When does updating site content affect SERPS?
Can any of you shine a light as to when updating content on a website had any effect on its 'general' placement in the search engines? I appreciate that for an online newspaper, it must be important, but has anyone noticed from theirs or a client's site that by not uploading i.e. product descriptions or articles in general, that their site has taken a hit?
On-Page Optimization | | SDavis110 -
What constitutes duplicate content on a page?
I am working on SEO for a Shopify store. Their products are very similar, hence the pages are so similar that Moz shows them as duplicate content. The only difference in the product pages is the title and model number. I am going to "go for the gold" and try re-writing all the product descriptions. It's incredibly difficult due to the products being nearly identical with just a minor variation. I know I could go down the road of just creating variants --- but the customer is not down for that. Here's my question: what constitutes duplicate content? 80% of the content, 90%???? If I can going to re-write the descriptions, what should I aim for? Thank you!
On-Page Optimization | | steve_linn1 -
Content in Tabs
I speed read an article recently and forgot to save it regarding Contents on a page in tabs. Is it correct that now Google is rendering the entire page it's better not to have content in tabs hidden by Javascript? As it stands at the moment, we've got the tabs set-up so that the main part of the page containing the keyword rich text is in a tab and not the first thing presented to the user
On-Page Optimization | | Ham19790 -
Duplicate content on events site
I have an event website and for every day the event occurs the event has a page. For example: The Oktoberfest in Germany the event takes 16 days. My site would have 16 (almost)identical pages about the Oktoberfest(same text, adres, photos, contact info). The only difference between the pages is the date mentioned on the page. I use rich snippets. How does google treat my pages and what is the best practice.
On-Page Optimization | | dragonflo0 -
I have one page on my site... but still get duplicate name and content errors.
i have only the index.html page. my domain has a permanent 301 to the root. why am i getting duplicate problems? i only have one page the index .html???
On-Page Optimization | | one4u2see0 -
Site Structure
I'm confused about the best way for seo to set up the site structure . i understand the examples of the pyramid diagrams and how link juice flows, however does this mean that global navigation is not good? It appears the pyramid structure leads to the designated number of category pages (we'll use five) and they lead to the 5 content pages etc and some "superman pages" can be linked to from the home page but is this is global navigation or anchor text navigation and is gloval navigation acdeptable for content pages? Any input would be greatly appreciated. Thank you.
On-Page Optimization | | JulB0 -
Number of Pages a Site Contains
Hi, How does the number of pages indexed from a single website affect that websites ability to compete in the SERPS? Cheers
On-Page Optimization | | Jurnii0