Multilingual -> ahref lang, canonical and duplicated title content
-
Hi all!
We have our site eurasmus.com where we are implementing the multilingual.
We have already available english and spanish and we use basically href lang to control different areas.First question:
When a page is not translated but still is visible in both langauges under /en and /es is it enough with the hreflang or should we
add a canonical as well? Nowadays we are apply href lang and only canonicals to the one which are duplicated
in the same language.Second question:
When some pages are not translated, like http://eurasmus.com/en/info/find-intern-placement-austria and http://eurasmus.com/es/info/find-intern-placement-austria,
we are setting up the href lang but still moz detects title and meta duplicated (not duplicate page content).
What do you suggest we should do?Let me know and thank you before hand for your help!
-
What I know is that since almost one year Google is able to deal with duplicated content in a multilingual or multicountry environment if the hreflang is well implemented.
Moreover... if you were using the rel="canonical", you were practically quitting to your Spanish home page (in this specific case) any possibility to even being present in the index, because you would be telling Google:
"Don't consider this URL, but just the canonical one".
This is one of the reasons why Google quit all mention of the rel="canonical" in the hreflang help pages.
-
I am not so sure about using canonical, even if this case is multilingual and not multicountry.
Maybe this is due to the well-known inability Google has to communicate correctly, but in this case it is quite clear with its example:
Some example scenarios where rel="alternate" hreflang="x" is recommended:
You keep the main content in a single language and translate only the template, such as the navigation and footer. Pages that feature user-generated content like a forums typically do this.
This scenario is the one described in this Q&A, so I personally would not suggest canonicalization but yes using hreflang, and - obviously - my main priority would be telling to localize all the content of the page, also because without a complete translation the opportunities to rank in Google.es are substantially zero.
-
I confirm that the moz crawler does not detect or consider the hreflang (in fact no tabs or advice in the moz analytics is dedicated to it).
The only tools that consider it by default (and that I know) are deepcrawl and onpage.org
-
They are not great at writing their own explanations for international. What they meant above is if you have geo-targeted correctly, you would not have to use a canonical between two pages that are the same. That they will figure it out on their own.
You aren't geo-targeting, so I still think the canonical would be needed.
-
Hi there Kate!
Thanks for your time. That is what logic tells me.
But "God" google says, confusing me:
Specifying language and location
We've expanded our support of the rel="alternate" hreflang link element to handle content that is translated or provided for multiple geographic regions. The hreflang attribute can specify the language, optionally the country, and URLs of equivalent content. By specifying these alternate URLs, our goal is to be able to consolidate signals for these pages, and to serve the appropriate URL to users in search. Alternative URLs can be on the same site or on another domain.
Annotating pages as substantially similar content
Optionally, for pages that have substantially the same content in the same language and are targeted at multiple countries, you may use the rel="canonical" link element to specify your preferred version. We’ll use that signal to focus on that version in search, while showing the local URLs to users where appropriate. For example, you could use this if you have the same product page in German, but want to target it separately to users searching on the Google properties for Germany, Austria, and Switzerland.
Update: to simplify implementation, we no longer recommend using rel=canonical.So I guess canonical is no longer needed?
-
HREFLANG is all you need to note the change in language between two pages. However, if the page has not been translated and is available under both language subfolders, make sure there isn't an HREFLANG and has a canonical. When the pages are identical and have 2 URLs, us a canonical and NOT HREFLANG.
I am not sure if Moz detects HREFLANG. If you know it's set up correctly, just ignore the warnings in Moz. And if you can, translate the title and description as well. That'll help get rid of the warnings.
-
Geo-tagging is not necessary if the content is just translated.
-
Did you assign the geography in webmastertools? This is advised and should already prevent some of the problems might they arise ( i think it should be OK)
Using a canonical is always a good way of harnessing the link value to one specific version.
You could test if a problem is there by running your englisch keywords against the local version of Google.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content
I am trying to get a handle on how to fix and control a large amount of duplicate content I keep getting on my Moz Reports. The main area where this comes up is for duplicate page content and duplicate title tags ... thousands of them. I partially understand the source of the problem. My site mixes free content with content that requires a login. I think if I were to change my crawl settings to eliminate the login and index the paid content it would lower the quantity of duplicate pages and help me identify the true duplicate pages because a large number of duplicates occur at the site login. Unfortunately, it's not simple in my case because last year I encountered a problem when migrating my archives into a new CMS. The app in the CMS that migrated the data caused a large amount of data truncation Which means that I am piecing together my archives of approximately 5,000 articles. It also means that much of the piecing together process requires me to keep the former app that manages the articles to find where certain articles were truncated and to copy the text that followed the truncation and complete the articles. So far, I have restored about half of the archives which is time-consuming tedious work. My question is if anyone knows a more efficient way of identifying and editing duplicate pages and title tags?
Technical SEO | | Prop650 -
Is there an percentage of duplicate content required before you should use a canonical tag?
Is there a percentage (approximate or exact) of duplicate content you should have before you use a canonical tag? Similarly how does Google handle canonical tags if the pages aren’t 100% duplicate? I've added some background and an example below; Nike Trainer model 1 – has an overview page that also links to a sub-page about cushioning, one about Gore-Tex and one about breathability. Nike Trainer model 2,3,4,5 – have an overview page that also links to sub-pages page about cushioning , Gore-Tex and breathability. In each of the sub-pages the URL is a child of the parent so a distinct page from each other e.g. /nike-trainer/model-1/gore-tex /nike-trainer/model-2/gore-tex. There is some differences in material composition, some different images and of course the product name is referred multiple times. This makes the page in the region of 80% unique.
Technical SEO | | punchseo0 -
Duplicate Content
Crawl Diagnostics has returned several issues that I'm unsure how to fix. I'm guessing it's a canonical link issue but not entirely sure... Duplicate Page Content/Titles On a website (http://www.smselectronics.co.uk/market-sectors) with 6 market sectors but each pull the same 3 pages as child pages - certifications, equipment & case studies. On each products section where the page only shows X amount of items but there are several pages to fit all the products this creates multiple pages. There is also a similar pagination problem with the Blogs (auto generated date titles & user created SEO titles) & News listings. Blog Tags also seem to generate duplicate pages with the same content/titles as the parent page. Are these particularly important for SEO or is it more important to remove the duplication by deleting them? Any help would be greatly appreciated. Thanks
Technical SEO | | BBDCreative0 -
How to avoid duplicate content
Hi, I have a website which is ranking on page 1: www.oldname.com/landing-page But because of legal reason i had to change the name.
Technical SEO | | mikehenze
So i moved the landing page to a different domain.
And 301'ed this landing page to the new domain (and removed all products). www.newname.com/landing-page All the meta data, titles, products are still the same. www.oldname.com/landing-page is still on the same position
And www.newname.com/landing-page was on page 1 for 1 day and is now on page 4. What did i do wrong and how can I fix this?
Maybe remove www.oldname.com/landing-page from Google with Google Webmaster Central or not allow crawling of this page with .htaccess ?0 -
Tired of finding solution for duplicate contents.
Just my site was scanned by seomoz and seen lots of duplicate content and titles found. Well I am tired of finding solutions of duplicate content for a shopping site product category page. You can see the screenshot below. http://i.imgur.com/TXPretv.png You can see below in every link its showing "items_per_page=64, 128 etc.". This happened in every category in which I was created. I am already using Canonical add-on to avoid this problem but still it's there. You can check my domain here - http://www.plugnbuy.com/computer-software/pc-security/antivirus-internet-security/ and see if the add-on working correct. I recently submitted my sitemap to GWT, so that's why it's not showing me any report regarding duplicate issues. Please help ME
Technical SEO | | chandubaba0 -
Duplicate Content
The crawl shows a lot of duplicate content on my site. Most of the urls its showing are categories and tags (wordpress). so what does this mean exactly? categories is too much like other categories? And how do i go about fixing this the best way. thanks
Technical SEO | | vansy0 -
How can something be duplicate content of itself?
Just got the new crawl report, and I have a recurring issue that comes back around every month or so, which is that a bunch of pages are reported as duplicate content for themselves. Literally the same URL: http://awesomewidgetworld.com/promotions.shtml is reporting that http://awesomewidgetworld.com/promotions.shtml is both a duplicate title, and duplicate content. Well, I would hope so! It's the same URL! Is this a crawl error? Is it a site error? Has anyone seen this before? Do I need to give more information? P.S. awesomewidgetworld is not the actual site name.
Technical SEO | | BetAmerica0 -
How do I eliminate duplicate page titles?
Almost...I repeat almost all of my duplicate page titles show up as such because the page is being seen twice in the crawl. How do I prevent this? <colgroup><col width="336"> <col width="438"></colgroup>
Technical SEO | | ENSO
| www.ensoplastics.com/ContactUs/ContactUs.html | Contact ENSO Plastics |
| ensoplastics.com/ContactUs/ContactUs.html | Contact ENSO Plastics | This is what is from the CSV...there are many more just like this. How do I cut out all of these duplicate urls?0