Duplicated Content with joomla multi language website
-
Dear Seomoz Community
I am running a multi language joomla website (www.siam2nite.com) with 2 active languages.
The first and primary language is english. the second language is thai. Most of the content (articles, event descriptions ...) is in english only.
What we did is a thai translation for the navigation bars, headers, titles etc (translation of all joomla language files) those texts are static and only help the user navigate / understand our site in their thai language.
Now I facing a problem with duplicated content. Lets take our Q&A component as example.
the url structure looks like this:
english - www.siam2nite.com/en/questions/
thai - www.siam2nite.com/th/questions/
Every question asked will create two URL, one for each language. The content itself (user questions & answers) is identical on both URL's. Only the GUI language is different. If you take a look at this question you will understand what i mean:
ENGLISH VERSION:
http://www.siam2nite.com/en/questions/where-to-celebrate-halloween-in-bangkok
THAI VERSION:
http://www.siam2nite.com/th/questions/where-to-celebrate-halloween-in-bangkok
As you can see each page has a unique title (H1) and introduction text in the correct language (same for menu, buttons, etc.) but the questions and answers are only available in one language.
Now my question
I guess Google will see this pages as duplicated content. How should I proceed with this problem:
- put all thai links /th/questions/ in the robots.txt and block them
or
- make a canonical tag for the english versions?
Not sure if I set a canonical tag google will still index the thai title and introduction texts (they have important thai keywords in them)
Would really appreciate your help on this
Regards,
Menelik
-
Hi John
Sorry for my late response ;-(
Thank you very much for your help. I added a rel=alternate for the Thai version as well. So far it looks good - no duplicated content.
Regards,
Menelik
-
The Google Webmaster set up sounds right to me!
You should set the rel alternate on all pages that go back and forth, not just the English pages. That way if Google wants to return a Thai page to an English searcher, it'll know to reference the English page. This is the set up Google recommends in their help documentation.
Don't worry about a new sitemap for the /th/ pages. Your current set up should be fine.
-
Hi John
Thank you very much for your answer. I did not know about the rel=alternate tag until today
Following your advise I modified the joomla header and now on every english page /en/... their is a rel=alternate link to the thai version.
for example:
http://www.siam2nite.com/en/magazine now has the following tag:
<link href="http://www.siam2nite.com/th/magazine" hreflang="th" rel="alternate">
Regarding the webmaster help (link you mentioned) I do not need to set a tag on the thai pages targeting the english ones correct? Just one rel=alternate on the english pages should make it right?
I tried to follow your advise with Google webmaster as well. My current configuration looks like this:
My old already existing site:
1 Site: www.siam2nite.com (no geo-targeting)
Today I created a new one
2. Site: www.siam2nite.com/th/ (geo-targeting: Thailand)
Is this the setup you meant in your answer?
I did not submit a sitemap for the 2nd site as all links (thai and english) are already included in the sitemap I use on the 1 site. Should I split my old sitemap and submit one for each site containing only the correct language links?
Thank you very much for your kind support - really appreciate it
-
The proper way to handle this is with rel=alternate hreflang tags. This will tell Google the content is the same, but in different languages. See http://support.google.com/webmasters/bin/answer.py?hl=en&answer=189077 for more info. You can place meta tags on each page, or do it in your sitemap.
Other things you can do to help search engines get it right is to set up a profile in Google Webmaster Tools for each of the directories (or at least for the Thai one), and set the geotargeting. For Bing, they prefer you set the country and language on each page (see here).
If you block the pages with robots.txt or use canonical tags, you're telling Google not to include those pages in SERPs. It sounds like you want the Thai pages to appear in Thai results, and the English pages in English SERPs, so I wouldn't do that.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content? other issues? using vendor info when selling their prodcuts?
When building content for vendors that we sell their products? best practices? ok, to copy and paste "about us" info? or will that be flagged as duplicate content.
On-Page Optimization | | bakergraphix_yahoo.com0 -
How do I fix my portfolio causing duplicate content issues?
Hi, Im new to this whole duplicate content issue. I have a website, fatcatpaperie.com that I use the portofolio feature in Wordpress as my gallery for all my wedding invitations. I have a ton of duplicate content issues from this. I don't understand at all how to fix this. I'd appreciate any help! Below is an example of one duplicate content issue. They have slightly different names, different urls, different images and all have no text. But are coming up as duplicates. Would it be as easy as putting a different metadescription for each?? Thanks for the help! Rena | "Treasure" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/treasure-designers-fine-press 1 0 0 0 200 3 duplicates "Perennial" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/perennial-by-designers-fine-press 1 0 0 0 200 1 of 3 duplicates "Primrose" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/8675 1 0 0 0 200 2 of 3 duplicates "Catalina" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/catalina-designers-fine-press |
On-Page Optimization | | HonestSEOStudio0 -
Duplicate Content - Bulk analysis tool?
Hi I wondered if there's a tool to analyse duplicate content - within your own site or on external sites, but that you can upload the URL's you want to check in bulk? I used Copyscape a while ago, but don't remember this having a bulk feature? Thank you!
On-Page Optimization | | BeckyKey0 -
How to deal with duplicate content when presenting event and sub-events information?
Hi, I'm have a sport event calendar website.
On-Page Optimization | | ahotu
It presents events that may have multiple races.
The event has its own page as well as the races. example :
Event: /event/edinburgh-marathon-festival Races:
/race/emf-half-marathon
/race/emf-10-km
/race/edinburgh-marathon
/race/emf-5-km The pages may have a lot of information in common (location, date, description) and they all link to each other.
What would be the best practices to avoid having the pages considered duplicate content by Google? Thanks0 -
Duplicate content harms individual pages or whole site?
Hi, One section of my site is a selection of Art and Design books. I have about 200 individual posts, each with a book image and a description retrieved from Amazon (using their API). Due to several reasons not worth mentioning I decided to use the Amazon description. I don't mind if those pages rank well or not, but I need them as additional content for my visitors as they browse my site. The value relies in the selection of books. My question is if the duplicate content taken from Amazon harms only each book page or the whole site. The rest of the site has unique content. Thanks! Enrique
On-Page Optimization | | enriquef0 -
Mobile blog creating duplicate title and content issues
My site is set up with a mobile website and after running the on page SEO report on moz it is returning multiple errors for duplicate title tags and duplicate content all related to the mobile website being a direct reflection of the main site. I want to know if this is really an issue. I'm sure Google actually knows the difference between my main site and the mobile version. Do I really need to go through each individual mobile page and change each page title and modify all the content? I really would like to avoid doing this but at the same time would like to fully optimize my site. Any advice is appreciated.
On-Page Optimization | | shiftins0 -
Duplicate Content
Hi I have Duplicate content that i do sent understand 1 - www.example.dk 2- www.example.dk/ I thought i was the same page, whit and without the / Hope someone can help 🙂
On-Page Optimization | | seopeter290 -
Why does SEOmoz use /blog/content-title vs /category/content-title? Any difference?
Assume a brand new blog being designed and all other things equal. What are the pros & cons between using the url structure /blog/content-title vs. /category/content-title? Note:
On-Page Optimization | | JasonJackson
Both scenarios would be using categorical archiving.0