Duplicated Content with joomla multi language website
-
Dear Seomoz Community
I am running a multi language joomla website (www.siam2nite.com) with 2 active languages.
The first and primary language is english. the second language is thai. Most of the content (articles, event descriptions ...) is in english only.
What we did is a thai translation for the navigation bars, headers, titles etc (translation of all joomla language files) those texts are static and only help the user navigate / understand our site in their thai language.
Now I facing a problem with duplicated content. Lets take our Q&A component as example.
the url structure looks like this:
english - www.siam2nite.com/en/questions/
thai - www.siam2nite.com/th/questions/
Every question asked will create two URL, one for each language. The content itself (user questions & answers) is identical on both URL's. Only the GUI language is different. If you take a look at this question you will understand what i mean:
ENGLISH VERSION:
http://www.siam2nite.com/en/questions/where-to-celebrate-halloween-in-bangkok
THAI VERSION:
http://www.siam2nite.com/th/questions/where-to-celebrate-halloween-in-bangkok
As you can see each page has a unique title (H1) and introduction text in the correct language (same for menu, buttons, etc.) but the questions and answers are only available in one language.
Now my question
I guess Google will see this pages as duplicated content. How should I proceed with this problem:
- put all thai links /th/questions/ in the robots.txt and block them
or
- make a canonical tag for the english versions?
Not sure if I set a canonical tag google will still index the thai title and introduction texts (they have important thai keywords in them)
Would really appreciate your help on this
Regards,
Menelik
-
Hi John
Sorry for my late response ;-(
Thank you very much for your help. I added a rel=alternate for the Thai version as well. So far it looks good - no duplicated content.
Regards,
Menelik
-
The Google Webmaster set up sounds right to me!
You should set the rel alternate on all pages that go back and forth, not just the English pages. That way if Google wants to return a Thai page to an English searcher, it'll know to reference the English page. This is the set up Google recommends in their help documentation.
Don't worry about a new sitemap for the /th/ pages. Your current set up should be fine.
-
Hi John
Thank you very much for your answer. I did not know about the rel=alternate tag until today
Following your advise I modified the joomla header and now on every english page /en/... their is a rel=alternate link to the thai version.
for example:
http://www.siam2nite.com/en/magazine now has the following tag:
<link href="http://www.siam2nite.com/th/magazine" hreflang="th" rel="alternate">
Regarding the webmaster help (link you mentioned) I do not need to set a tag on the thai pages targeting the english ones correct? Just one rel=alternate on the english pages should make it right?
I tried to follow your advise with Google webmaster as well. My current configuration looks like this:
My old already existing site:
1 Site: www.siam2nite.com (no geo-targeting)
Today I created a new one
2. Site: www.siam2nite.com/th/ (geo-targeting: Thailand)
Is this the setup you meant in your answer?
I did not submit a sitemap for the 2nd site as all links (thai and english) are already included in the sitemap I use on the 1 site. Should I split my old sitemap and submit one for each site containing only the correct language links?
Thank you very much for your kind support - really appreciate it
-
The proper way to handle this is with rel=alternate hreflang tags. This will tell Google the content is the same, but in different languages. See http://support.google.com/webmasters/bin/answer.py?hl=en&answer=189077 for more info. You can place meta tags on each page, or do it in your sitemap.
Other things you can do to help search engines get it right is to set up a profile in Google Webmaster Tools for each of the directories (or at least for the Thai one), and set the geotargeting. For Bing, they prefer you set the country and language on each page (see here).
If you block the pages with robots.txt or use canonical tags, you're telling Google not to include those pages in SERPs. It sounds like you want the Thai pages to appear in Thai results, and the English pages in English SERPs, so I wouldn't do that.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Page Content - default.html
I am showing a duplicate content error in moz. I have site.com and site.com/default.html How can I fix that? Should I use a canonical tag? If so, how would i do that?
On-Page Optimization | | bhsiao0 -
Magento - How to avoid duplicate content on products that span different sites.
We have 4 Magento store fronts that operate out of the same backend. Is there any way to safely have products that span multiple stores without getting a duplicate content penalty? thanks!
On-Page Optimization | | Shop-Sq0 -
Many have stolen our content. Rewrite vs. DMCA content removal?
Hello, We own a medical tourism website and many other sites have stolen (copied and pasted) our content. Our content is more than 2 years old, so we thought we could rewrite the content - but Which is a more wiser decision from you guys' experience? Archive our current content at a different URL and upload a fresh content in the current URL Claim our originality to Google and ask the stolen sites to remove our content. Thank you and appreciate your time.
On-Page Optimization | | joony0 -
Events in Wordpress Creating Duplicate Content Canonical Issues
Hi, I have a site which uses Event Manager Pro within Wordpress to create Events (as custom post types on my blog. I use it to advertise cookery classes. In a given month I might run one type of class 4 times. The event page I have made for each class is the same and I duplicate it 4 times and just change the dates to promote it. The problem is with over 10 different classes, which are then duplicated up to 4 times each per month. I get loads of duplicate content errors. How can I fix this without redirecting people away from the correct page for the date they are interested in? Is it best just to use a no follow for ALL events and rely on the other parts of my site for SEO? Thanks, T23
On-Page Optimization | | tekton230 -
Meta descriptions better empty or with duplicate content?
I am working with a yahoo store. Somehow all of the meta description fields were filled in with random content from throughout the store. For example, a black cabinet knob product page might have in its description field the specifications for a drawer slide. I don't know how this happened. We have had a programmer auto populate certain fields to get them ready for product feeds, etc. It's possible they screwed something up during that, this was a long time ago. My question. Regardless of how it happened. Is it better for me to have them wipe these fields entirely clean? Or, is it better for me to have them populate the fields with a duplicate of our text from the body. The site has about 6,500 pages so I have and will make custom descriptions for the more important pages after this process, but the workload to do them all is too much. So, nothing or duplicate content for the pages that likely won't receive personal attention?
On-Page Optimization | | dellcos1 -
Is duplicate content harmful? Example from on my site
I'm not talking about content copied from another site but content unique to a site being used on several pages. I have a delivery tab that has precisely the same content as another product page. This content is on four product pages and the dedicated delivery page. Thanks
On-Page Optimization | | Brocberry0 -
My website is saying I have duplicate page content and page title. How do I fix it?
Hi, I created a website on webstarts.com. After I launched it then ran a scan through SEO it says I have duplicate page content and page title. The 2 pages it is reading are technically the same page. www.mobilemowermedicsinc.com and www.mobilemowermedicsinc.com/index . I am unsure how to get rid of on of these as it keeps saying this is an error in the SEO scan. Could someone please advise me of what to do from here. Thanks!
On-Page Optimization | | bcarp880 -
Duplicate Title question
Thanks Mozzers in advance for any insight into what I'm sure is a basic SEO question. I'm working with a resort in the great state of Maine. Their home page title reads Maine Resorts, Resorts in Maine, (company name). The site has about 400 URL's and over half of the URL's utilize the first keyword phrase of the home page title, "Maine Resorts." Predominately, I find them used on the Accommodations pages (pages that describe each room with a picture) which I would label as deeper pages and non-conversion type pages. The page titles themselves are not exact duplicates of the Home Page Title but might read something like "Maine Resorts, Company Name, Accommodation Listing." My concern is that the heavy use of "Maine Resorts" as the first phrase in over 200 plus pages might be competing against the home page and pulling the home page ranking down. Thanks for any help given!
On-Page Optimization | | hawkvt10