Duplicate Content for Spanish & English Product
-
Hi There,
Our company provides training courses and I am looking to provide the Spanish version of a course that we already provide in English. As it is an e-commerce site, our landing page for the English version gives the full description of the course and all related details. Once the course is purchased, a flash based course launches within a player window and the student begins the course.
For the Spanish version of the course, my target customers are English speaking supervisors purchasing the course for their Spanish speaking workers. So the landing page will still be in English (just like the English version of the course) with the same basic description, with the only content differences on that page being the inclusion of the fact that this course is in Spanish and a few details around that.
The majority of the content on these two separate landing pages will be exactly the same, as the description for the overall course is the same, just that it's presented in a different language, so it needs to be 2 separate products.
My fear is that Google will read this as duplicate content and I will be penalized for it. Is this a possibility or will Google know why I set it up this way and not penalize me? If that is a possibility, how should I go about doing this correctly?
Thanks!
-
Thank you for this information, Optimize. Not having a very technical background in this area, it seems quite confusing to try to implement this correctly.
-
Hola Julio,
even though here in SEOmoz we are happy that Mozzers find occasions for collaborating with other people, we think that it would be better (and even safer for your inbox) to use the private message function.
-
Niall,
What is the theme of your course? We are in Mexico searching for new training modules to sell in Latin Market. Maybe we can talk about it...
We have some good websites very well ranked.
Email me!!!
Thanks...
Julio
[email removed by staff]
-
You are going to have a problem with this.....Unfortunately, the combination of duplicate looking content and a directory/subdirectory structure causes sites to be stuck in Googles Panda filter. Google pulled out a "large roll of duct tape" to fix the problem with multiple language version websites, writing “hreflang” on one strip and writing“canonical” on the other strip.
Basically, Google is telling us that we should use a regional subtag in our head tag on each URL to help Google’s spider figure out what kind of content is on each page and where it is intended. Once this is done, Google will consider that the content is intended for that region. Here are the rules for hreflang and canonical....make sure you are sitting down......
Hreflang
The hreflang attribute (hreflang: rel="alternate" hreflang="x") rules in a nutshell:
- Applies to any users from different parts of the world, with content translated in the native language to target that region.
- Used for multilingual websites using substantially the same content on all web pages (e.g., English pages for Australia, Canada, and the U.S.)
- Can specify the language, country, and URLs of content translated for multiple countries.
- Used when:
- You translate only the template of your page (navigation and footer) and main content is still in a single language.
- Pages have broadly similar content within a single language, but are targeted at different regions (e.g., English-language content targeted in U.S., UK, and Australia).
- Content on the web page is fully translated (e.g., have Spanish, French, and English versions of each page).
- How to use rel="alternate" hreflang ="x"
- If there are multiple language versions of the website, each language must use rel="alternate" hreflang="x" (e.g., a page in Spanish must have a rel="alternate" hreflang="x" link to the English and French version and the English and French version must include a link pointing to the Spanish site.
(For more information: http://support.google.com/webmasters/bin/answer.py?hl=en&answer=189077)
Canonical
The multilingual canonical tag (rel="canonical") tells Google that x URL is the preferred location and the most important translated version of the content of the URL.
Multilingual canonical is:
- Used in conjunction with hreflang.
- Can be used when web pages have the same content in the same language targeting multiple countries.
- Sometimes users are directed to the wrong language.
- The canonical designates the version of content that gets indexed and returned to users.
- Use rel="canonical" tag on other versions of the webpage.
- When users enter content into search results, users will likely see the URL that corresponds to their language preference.
Putting hreflang and canonical together:
Spanish site is the canonical and contains the following tags:
link rel="alternate" hreflang="en" href="http://en.example.com/" /English site contains the following tags:
link rel="canonical" href="http://es.example.com/" /French site contains the following tags:
link rel="canonical" href="http://es.example.com/" /(**CAN ONLY BE USED WHEN SPANISH IS THE MAIN LANGUAGE AND ONLY THE TEMPLATE IS TRANSLATED TO ENLISH AND FRENCH)
Hope this is helpful......All of this information can be found in the original author at this link:
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Product content length & links within product description
Hello, I have questions regarding content length and links within descriptions. With our ecommerce site, we have thousands of products, each with a unique description. In the product description, I have links to the parent category and grandparent category (if it has one) in the main product text which is generally about 175 words. Then I have a last paragraph that's about 75 words that includes links to our main homepage and our main product catalogue page. Is the content length long enough? I used to use text that was 500 words, and shortening it I still rank when launching new products, so I don't think an increase in text length will have any additional benefit. I do see conflicting information when I do searches, with some people recommending a minimum of 300 words and some saying to try and go a 1000 for category pages. In regards to the links, I noticed a competitor has stopped following this format, so I'm unsure if I should keep going too. Is it too many links to have each of the products link back to the main catalogue and homepage? Is it good to have links with anchor text to the categories a product is in? There are breadcrumbs on the page with these links already. There are already have heaps of links on our pages (footer, and a right sidebar with image links to relevant categories), so my pages do get flagged for too many links. Thanks!
On-Page Optimization | | JustinBSLW0 -
Unique Pages with Thin Content vs. One Page with Lots of Content
Is there anyone who can give me a definitive answer on which of the following situations is preferable from an SEO standpoint for the services section of a website? 1. Many unique and targeted service pages with the primary keyword in the URL, Title tag and H1 - but with the tradeoff of having thin content on the page (i.e. 100 words of content or less). 2. One large service page listing all services in the content. Primary keyword for URL, title tag and H1 would be something like "(company name) services" and each service would be in the H2 title. In this case, there is lots of content on the page. Yes, the ideal situation would be to beef up content for each unique pages, but we have found that this isn't always an option based on the amount of time a client has dedicated to a project.
On-Page Optimization | | RCDesign741 -
Duplicate content errors
I have multiple duplicate content errors in my crawl diagnostics. The problem is though that i already took care of these problems with the canonical tag but MOZ keeps saying there is a problem. For example this page http://www.letspump.dk/produkter/56-aminosyre/ has a canonical tag, but moz still says it has an error. Why is that?
On-Page Optimization | | toejklemme0 -
E-Commerce Site - Duplicate Content
We run an e-commerce site with about 250,000 SKUs. Certain items, such as a micro USB car charger, will be applicable to several different phones. Example: http://www.wirelessemporium.com/p-165787-samsung-galaxy-proclaim-illusion-sch-i110-heavy-duty-car-charger.asp http://www.wirelessemporium.com/p-165856-sony-xperia-ion-4g-lte-att-heavy-duty-car-charger.asp As one can imagine with so many items, unique content for each item description page can be a challenge. What would be the best way to address this on a large scale?
On-Page Optimization | | eugeneku0 -
Is my blog simply duplicate content of my authors' profiles?
www.example.com/blog is the full list of blog posts by various writers. The list contains the title of each article and the first paragraph from the article. In addition to /blog being indexed, each author's contribution list is being indexed separately. It's not a profile, really, just a list of articles in the same title & paragraph format of the /blog page. So if /blog a list of 10 articles written by two writers, I have three pages: /blog/author1 is a list of 4 articles /blog/author2 is a list of 6 different articles /blog is a list of 10 articles (the 4+6 from the two writers) Is this going to be considered duplicate content?
On-Page Optimization | | Brocberry0 -
Has a product portfolio bad (direct) influence on rankings because of less relevant content?
Hey SEO's, I'm wondering if a bad product portfolio of a e-commerce website has an influence (direct) on the rankings because of less relevant content. To show you what I'm thinking about, look at these two situations: 1. E-Commerce website with 3.000 products from different brands (all top brands) but the products itself are not quite up to date 2. E-Comerce website with 3.000 products from differend brands (all top brands) but these products contain allways the top seller items. Is it possible, that Google has an index for "fresh content" or "highly demanded content" or something like this which has direct influence on the ranking? I ask this question besides the fact, that its of course allways better to have the top sellers in a portfolio than not have. But the big thing is: What to do if you don't know which products are best, or you just don't get them from your retailers or some other reason... For Google & Co. its harmfull (I guess) to put a e-commerce website without top-seller products into the top-rankings with high competition!? Any ideas on that?
On-Page Optimization | | videriconcept0 -
Duplicate content because of content scrapping - please help
We manage brands websites in a very competitive industry that have thousands of affiliate links We see that more and more websites (mainly affiliates websites) are scrapping our brand websites content and it generate many duplicate content (but most of them link to us back with an affiliate link). Our brand websites still rank for any sentence in brackets you search in Google, Will this duplicate content hurt our brand websites ? If yes, should we take some preventive actions ? We are not able to add ongoing UGC or additional text to all our duplicate content and trying to stop those websites of stealing our content is like playing cat and mouse... Thanks for your advices
On-Page Optimization | | Tit0 -
What is the best way to manage industry required duplicate Important Safety Information (ISI) content on every page of a site?
Hello SEOmozzer! I have recently joined a large pharmaceutical marketing company as our head SEO guru, and I've encountered a duplicate content related issue here that I'd like some help on. Because there is so much red tape in the pharmaceutical industry, there are A LOT of limitations on website content, medication and drug claims, etc. Because of this, it is required to have Important Safety Information (ISI) clearly stated on every page of the client's website (including the homepage). The information is generally pretty lengthy, and in some cases is longer than the non-ISI content on each page. Here is an example: http://www.xifaxan.com/ All content under the ISI header is required on each page. My questions are: How will this duplicated content on each page affect our on-page optimization scores in the eyes of search engines? Is Google seeing this simply as duplicated content on every page, or are they "smart" enough to understand that because it is a drug website, this is industry standard (and required)? Aside from creating more meaty, non-ISI content for the site, are there any other suggestions you have for handling this potentially harmful SEO situation? And in case you were going to suggest it, we cannot simply have an image of the content, as it may not be visible by all internet users. We've already looked into that 😉 Thanks in advance! Dylan
On-Page Optimization | | MedThinkCommunications0