Duplicated Content with joomla multi language website
-
Dear Seomoz Community
I am running a multi language joomla website (www.siam2nite.com) with 2 active languages.
The first and primary language is english. the second language is thai. Most of the content (articles, event descriptions ...) is in english only.
What we did is a thai translation for the navigation bars, headers, titles etc (translation of all joomla language files) those texts are static and only help the user navigate / understand our site in their thai language.
Now I facing a problem with duplicated content. Lets take our Q&A component as example.
the url structure looks like this:
english - www.siam2nite.com/en/questions/
thai - www.siam2nite.com/th/questions/
Every question asked will create two URL, one for each language. The content itself (user questions & answers) is identical on both URL's. Only the GUI language is different. If you take a look at this question you will understand what i mean:
ENGLISH VERSION:
http://www.siam2nite.com/en/questions/where-to-celebrate-halloween-in-bangkok
THAI VERSION:
http://www.siam2nite.com/th/questions/where-to-celebrate-halloween-in-bangkok
As you can see each page has a unique title (H1) and introduction text in the correct language (same for menu, buttons, etc.) but the questions and answers are only available in one language.
Now my question
I guess Google will see this pages as duplicated content. How should I proceed with this problem:
- put all thai links /th/questions/ in the robots.txt and block them
or
- make a canonical tag for the english versions?
Not sure if I set a canonical tag google will still index the thai title and introduction texts (they have important thai keywords in them)
Would really appreciate your help on this
Regards,
Menelik
-
Hi John
Sorry for my late response ;-(
Thank you very much for your help. I added a rel=alternate for the Thai version as well. So far it looks good - no duplicated content.
Regards,
Menelik
-
The Google Webmaster set up sounds right to me!
You should set the rel alternate on all pages that go back and forth, not just the English pages. That way if Google wants to return a Thai page to an English searcher, it'll know to reference the English page. This is the set up Google recommends in their help documentation.
Don't worry about a new sitemap for the /th/ pages. Your current set up should be fine.
-
Hi John
Thank you very much for your answer. I did not know about the rel=alternate tag until today
Following your advise I modified the joomla header and now on every english page /en/... their is a rel=alternate link to the thai version.
for example:
http://www.siam2nite.com/en/magazine now has the following tag:
<link href="http://www.siam2nite.com/th/magazine" hreflang="th" rel="alternate">
Regarding the webmaster help (link you mentioned) I do not need to set a tag on the thai pages targeting the english ones correct? Just one rel=alternate on the english pages should make it right?
I tried to follow your advise with Google webmaster as well. My current configuration looks like this:
My old already existing site:
1 Site: www.siam2nite.com (no geo-targeting)
Today I created a new one
2. Site: www.siam2nite.com/th/ (geo-targeting: Thailand)
Is this the setup you meant in your answer?
I did not submit a sitemap for the 2nd site as all links (thai and english) are already included in the sitemap I use on the 1 site. Should I split my old sitemap and submit one for each site containing only the correct language links?
Thank you very much for your kind support - really appreciate it
-
The proper way to handle this is with rel=alternate hreflang tags. This will tell Google the content is the same, but in different languages. See http://support.google.com/webmasters/bin/answer.py?hl=en&answer=189077 for more info. You can place meta tags on each page, or do it in your sitemap.
Other things you can do to help search engines get it right is to set up a profile in Google Webmaster Tools for each of the directories (or at least for the Thai one), and set the geotargeting. For Bing, they prefer you set the country and language on each page (see here).
If you block the pages with robots.txt or use canonical tags, you're telling Google not to include those pages in SERPs. It sounds like you want the Thai pages to appear in Thai results, and the English pages in English SERPs, so I wouldn't do that.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Content above the Fold, or Below
Hi, I have an ecommerce site with several categories that I consider good landing pages. In order to get better search results I add content to these pages, usually above the fold, then after the content products are listed. Example:https://www.carburetor-parts.com/Carburetor-Kits_c_568.html I worry that customers get to the page and since they don't see the products above the fold, they move on. Should I be putting content in the footer instead of the header and if so how does that effect SEO? This has been bugging me for a long time. Thanks
On-Page Optimization | | MikeCarbs
Mike0 -
Writing cornerstone content for a shop (eCommerce) website
Hi there I am trying to optimise my site to the best that it can be. Since the most recent Google updates, everything that I reading is saying cornerstone content with lots of valuable content is a really good strategy as it tells Google what is the most important content on your site. Writing articles that are well structured and have give the user a detailed overview of that subject. Lots of top SEO's are saying 3000 words plus on these pages. My question is, how do I go about this with and eCommerce site? Obviously that majority of the keywords that I want to target are product related and these are the pages that I want to come up in the search. How do I go about creating cornerstone content for these pages? I am thinking that one of my cornerstone pieces of content would be "The Ultimate Guide to [my main product category]". But that product has numerous products related to it, all of which have their own keywords, so how would this help the products to rank? The site had two main product categories, with numerous products under each of those categories. The two main categories are targeting my best performing keywords, but currently the landing page for these is the main product category pages. I am really struggling to work out the best strategy here. The content that I have on my actual products pages is comprehensive and covers a lot of detail about that particular product and has started to rank for product keywords, but I am guessing Google wouldn't consider that to be cornerstone content. I hope this make sense. Any advice anyone can give would be really useful. Many thanks in advance
On-Page Optimization | | Clojobobo1 -
Duplicate Content - Pricing Plan tables
Hey guys, We're faced with a problem that we want to solve. We're working on the designs for a few pages for a drag & drop email builder we're currently working on, and we will be having the same pricing table on several pages (much like Moz does). We're worried that Google will take this as duplicate content and not be very fond of it. Any ideas about how we could integrate the same flow without potentially harming ranking efforts? And NO, re-writing the content for each table is not an option. It would do nothing but confuse the heck out of our clients. 😄 Thanks everybody!
On-Page Optimization | | andy.bigbangthemes0 -
Duplicate content on partner site
I have a trade partner who will be using some of our content on their site. What's the best way to prevent any duplicate content issues? Their plan is to attribute the content to us using rel=author tagging. Would this be sufficient or should I request that they do something else too? Thanks
On-Page Optimization | | ShearingsGroup0 -
Content by Country
Currently we have a news website aimed at several countries. We want to filter the content of some url (home, category pages, ...) using the country of origin of the visitor. For example in the home we've heard of global character, and a column with news of the country of origin of the visitor. This may affect the position or cause a Google penalty? thank you very much
On-Page Optimization | | promonet0 -
Suggestions to avoid duplicate content
Hi, we have about 6500 products, almost all with descriptions. SEOMOZ is showing about 2500 of them with duplicate content. The reason for this is that only one or two words are different for each product. For example, we have 500 award certificates. All are the same size and have the same description. But one is swimming, one baseball, one reading, etc, etc. Apparently the 1 word difference is not enough to differentiate. We have the same issue with our trophies - they are identical, except for figures. Does anyone have any good tips on how to change the content to avoid this issue and to avoid making up content for 2500 items? Thanks! Neil trophycentral.com
On-Page Optimization | | trophycentraltrophiesandawards0 -
Duplicate Content - Site Wide or Internet Wide?
Hello... I am creating a new website and i was wondering how you guys would define duplicate content? If my new site had the same page titles and descriptions as my existing site, would that be duplicate content? Or does duplicate content mean same titles and descriptions in the same site? I'm wondering if i can upload the same database (with page titles and descriptions and alt tags) to my new site or if that would be looked at as duplicate... Thanks
On-Page Optimization | | Prime850 -
Duplicate Page Titles?
I'm running a campaign report within SEOmoz & am getting 9 pages that appear on this report. They all happen to be our author pages www.example.com//author/admin We have multiple authors. Is there a proper way that I should take care of this? Also as a side note, I'm using Yoast Wordpress SEO plugin, is there a setting on their I should change that will fix this issue? Or is it an issue at all? Thanks, BJ
On-Page Optimization | | seointern0