Duplicate Content for Spanish & English Product
-
Hi There,
Our company provides training courses and I am looking to provide the Spanish version of a course that we already provide in English. As it is an e-commerce site, our landing page for the English version gives the full description of the course and all related details. Once the course is purchased, a flash based course launches within a player window and the student begins the course.
For the Spanish version of the course, my target customers are English speaking supervisors purchasing the course for their Spanish speaking workers. So the landing page will still be in English (just like the English version of the course) with the same basic description, with the only content differences on that page being the inclusion of the fact that this course is in Spanish and a few details around that.
The majority of the content on these two separate landing pages will be exactly the same, as the description for the overall course is the same, just that it's presented in a different language, so it needs to be 2 separate products.
My fear is that Google will read this as duplicate content and I will be penalized for it. Is this a possibility or will Google know why I set it up this way and not penalize me? If that is a possibility, how should I go about doing this correctly?
Thanks!
-
Thank you for this information, Optimize. Not having a very technical background in this area, it seems quite confusing to try to implement this correctly.
-
Hola Julio,
even though here in SEOmoz we are happy that Mozzers find occasions for collaborating with other people, we think that it would be better (and even safer for your inbox) to use the private message function.
-
Niall,
What is the theme of your course? We are in Mexico searching for new training modules to sell in Latin Market. Maybe we can talk about it...
We have some good websites very well ranked.
Email me!!!
Thanks...
Julio
[email removed by staff]
-
You are going to have a problem with this.....Unfortunately, the combination of duplicate looking content and a directory/subdirectory structure causes sites to be stuck in Googles Panda filter. Google pulled out a "large roll of duct tape" to fix the problem with multiple language version websites, writing “hreflang” on one strip and writing“canonical” on the other strip.
Basically, Google is telling us that we should use a regional subtag in our head tag on each URL to help Google’s spider figure out what kind of content is on each page and where it is intended. Once this is done, Google will consider that the content is intended for that region. Here are the rules for hreflang and canonical....make sure you are sitting down......
Hreflang
The hreflang attribute (hreflang: rel="alternate" hreflang="x") rules in a nutshell:
- Applies to any users from different parts of the world, with content translated in the native language to target that region.
- Used for multilingual websites using substantially the same content on all web pages (e.g., English pages for Australia, Canada, and the U.S.)
- Can specify the language, country, and URLs of content translated for multiple countries.
- Used when:
- You translate only the template of your page (navigation and footer) and main content is still in a single language.
- Pages have broadly similar content within a single language, but are targeted at different regions (e.g., English-language content targeted in U.S., UK, and Australia).
- Content on the web page is fully translated (e.g., have Spanish, French, and English versions of each page).
- How to use rel="alternate" hreflang ="x"
- If there are multiple language versions of the website, each language must use rel="alternate" hreflang="x" (e.g., a page in Spanish must have a rel="alternate" hreflang="x" link to the English and French version and the English and French version must include a link pointing to the Spanish site.
(For more information: http://support.google.com/webmasters/bin/answer.py?hl=en&answer=189077)
Canonical
The multilingual canonical tag (rel="canonical") tells Google that x URL is the preferred location and the most important translated version of the content of the URL.
Multilingual canonical is:
- Used in conjunction with hreflang.
- Can be used when web pages have the same content in the same language targeting multiple countries.
- Sometimes users are directed to the wrong language.
- The canonical designates the version of content that gets indexed and returned to users.
- Use rel="canonical" tag on other versions of the webpage.
- When users enter content into search results, users will likely see the URL that corresponds to their language preference.
Putting hreflang and canonical together:
Spanish site is the canonical and contains the following tags:
link rel="alternate" hreflang="en" href="http://en.example.com/" /English site contains the following tags:
link rel="canonical" href="http://es.example.com/" /French site contains the following tags:
link rel="canonical" href="http://es.example.com/" /(**CAN ONLY BE USED WHEN SPANISH IS THE MAIN LANGUAGE AND ONLY THE TEMPLATE IS TRANSLATED TO ENLISH AND FRENCH)
Hope this is helpful......All of this information can be found in the original author at this link:
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Consolidating a Large Site with Duplicate Content
I will be restructuring a large website for an OEM. They provide products & services for multiple industries, and the product/service offering is identical across all industries. I was looking at the site structure and ran a crawl test, and learned they have a LOT of duplicate content out there because of the way they set up their website. They have a page in the navigation for “solution”, aka what industry you are in. Once that is selected, you are taken to a landing page, and from there, given many options to explore products, read blogs, learn about the business, and contact them. The main navigation is removed. The URL structure is set up with folders, so no matter what you select after you go to your industry, the URL will be “domain.com/industry/next-page”. The product offerings, blogs available, and contact us pages do not vary by industry, so the content that can be found on “domain.com/industry-1/product-1” is identical to the content found on “domain.com/industry-2/product-1” and so-on and so-forth. This is a large site with a fair amount of traffic because it’s a pretty substantial OEM. Most of their content, however, is competing with itself because most of the pages on their website have duplicate content. I won’t begin my work until I can dive in to their GA and have more in-depth conversations with them about what kind of activity they’re tracking and why they set up the website this way. However, I don’t know how strategic they were in this set up and I don’t think they were aware that they had duplicate content. My first thought would be to work towards consolidating the way their site is set up, so we don’t spread the link-equity of “product-1” content, and direct all industries to one page, and track conversion paths a different way. However, I’ve never dealt with a site structure of this magnitude and don’t want to risk messing up their domain authority, missing redirect or URL mapping opportunities, or ruin the fact that their site is still performing well, even though multiple pages have the same content (most of which have high page authority and search visibility). I was curious if anyone has dealt with this before and if they have any recommendations for tackling something like this?
On-Page Optimization | | cassy_rich0 -
Content on product category pages - does Google care?
Hi All, I've always been unsure about the importance of content on product category pages. Nobody reads it. If you search for "living room chairs", you're just going to want to see a big list of living room chairs - not read content about living room chairs, how to choose one, etc. On virtually any ecommerce site, category pages have a paragraph or two of total bla-bla. Does this have any impact on search rankings? More specifically, will Googlebot see content on how to choose a living room chair and say "Yes! This is really helpful content"? Or, will it realize that the searcher intent on this keyword is really just to see a list of chairs, and ignore this content - or at least downplay its importance? WDTY?
On-Page Optimization | | BarryBuckman0 -
Do we need to worry about internal duplicate content?
Hi, I have a question about internal duplicate content. We have a catalogue of around 4000 products. Most of these do have individual descriptions but for most of the products they contain a generic summary that includes a sentence to begin with that includes each product name. We're currently working on descriptions for each product, but as you can imagine it's quite a chore. I was wondering if there are actually any penalties for this or whether we can ignore the crawl errors from the moz report? Thanks in Advance!
On-Page Optimization | | 10dales0 -
Does hreflang restrain my site from being penalized for duplicated content?
I am curently setting up a travel agency website. This site is going to be targeting both american and mexican costumers. I will be working with an /es subdirectory. Would hreflang, besides showing the matching language version in the SERP´s, restrain my site translated content (wich is pretty much the same) from being penalized fro duplicated content? Do I have to implement relcannonical? Thank ypu in advanced for any help you can provide.
On-Page Optimization | | kpi3600 -
Is This A Reason To Move Content?
Dear All, I am questioning my initial decisions when I planned a site due to reading lots of info on moz. Although what I have read has made me question what I have already done, I can't find anything that is specific to my exact case, so here goes. I recently built a shopping cart in OpenCart. I want the site to have lots of information on the products it sells. I have populated each category with at least 1000 words of content that is specific to the products in that category, also I have some information pages that have no products in them at all, just copy. So the shopping site actually has a few pages that look like a static website and a few that look like a normal shopping cart. My thought behind this was I wanted the pages with lots of info to rank and become authoritative, in some way elevating the whole site. I have recently put a blog on the site, and a combination of that, and reading Moz has lead me think that I should move all the content from the category pages to the blog, and deep link each blog post to it's relevant products and category. From what I have read it would be easier to get the blog ranking and acknowledged as an authority rather than 30 category pages. Also each 1500+ word category page will make at least 3-4 nice blog posts, and each post can be focused on a single keyword rather than a large category page that has maybe 3-4 keywords it's trying to rank for. Also the blog is much better optimised than a standard OC category page (even using extensions with them). The only negative I can see is moving the content, but the site is less that 2 months old, and the amount of link juice it has is negligible. Does google cut new sites a bit of slack in these situations of moving content around, or will I be seen as 'up to something' by google? I guess my question is, am I barking up the right tree? Or is the old adage 'a little information is dangerous' true in this case, and I just about to make a load of work for the sake of it with no real benefit. However, if I am to make such a dramatic change to the sites architecture I think the time is now, before things start gaining juice & rank. I hope I have explained my situation clearly and I thank anyone who can offer me any advice. Great forum, Thank you, Ian
On-Page Optimization | | cookie7770 -
Duplicate content
crawler shows following links as duplicate http://www.mysite.com http://mysite.com http://www.mysite.com/ http://mysite.com. http://mysite.com/index.html How can i solve this issue?
On-Page Optimization | | bhanu22170 -
Duplicate Page Title
Wordpress Category pagination causes duplicate page title errors (ie. when there are so many posts in the category, it paginates them), is this a problem? Your tool is reporting it as a problem... but ProPhoto (my Wordpress provider say it is not a problem). Here are the 2 URL's with the same page title: http://www.lisagillphotography.co.uk/category/child-photography/ http://www.lisagillphotography.co.uk/category/child-photography/page/2/
On-Page Optimization | | LisaGill0 -
Optimise duplicate products or canonical link
We exist in a niche market with a good % of products that sell well at specific times of the year. Lets say for example a red cup can be sold as a christmas red cup and a valentine red cup or just a red cup. Would we be best to optimize each specific product specifically for those seasons/events on different pages or keep google pointed to just one page using a canonical link.
On-Page Optimization | | LadyApollo0