Geotargeting duplicate content to different regions - href and canonical tag confusion
-
If you duplicate content onto a sub-folder for say a new US geotargeted site (to target kw spelling differences) and, in addition to GWT geotargeting settings, implement the 'Canonical' and 'Hreflang' tags on these new pages to show G different region and language version (en-us). Then does the original/main site similar pages also need to have canonical and href tags ?
The main/original sites page I don't really want to target a specific country (although existing signals (hosting etc) will be UK (primary target of main site) but pages show up in other country searches too (which we want).
Im presuming fine to leave the original/main site as it currently is although wording in google blog/webmaster central articles etc are a bit confusing hence why im asking for anyone elses opinion/input on this.
Also is there are any benefit (or just best practice) to use 'www.example.com/en-us/...' in the subdirectory URL as opposed to just 'www.example.com/us/'
many thanks in advance to any commentators
-
Many thanks Gianluca !!
-
Hi,
I suggest you both to give a read to this post by DejanSEO, which is quite clear and - IMHO - points to the right interpretation of a somehow confused best practice.
-
Thats what i thought originally but getting confised when i read this page: http://googlewebmastercentral.blogspot.com.au/2011/12/new-markup-for-multilingual-content.html
Specifically this bit:
Annotating pages as substantially similar content
Optionally, for pages that have substantially the same content in the same language and are targeted at multiple countries, you may use the rel="canonical" link element to specify your preferred version. We’ll use that signal to focus on that version in search, while showing the local URLs to users where appropriate. For example, you could use this if you have the same product page in German, but want to target it separately to users searching on the Google properties for Germany, Austria, and Switzerland.
And read in conjunction with this article:
Specifically this bit:
The Effect Of Combining Canonical Tags & Hreflang Tags
Not forgetting that the canonical tags should only be used with content in the same language, when would we use both?
Well firstly, the use of both would involve what I usually call world languages such as English, Spanish, French or Portuguese. These languages are used in many countries and, whilst there are variations between the use of these languages in those countries, the variations are sometimes small.
Additionally, multinational publishers often save costs by using one version of the language for all countries speaking that general language, thus ignoring the regional variations. In other words, for Spain and Mexico, Google is presented with exactly the same content, letter for letter.
The canonical acknowledges that this is the same content. The Hreflang tag identifies which URL should be displayed in different sets of results.
So, in other words, canonical + Hreflang = same content + different URL.
Google knows the content is the same, but displays the correct URL for the Google domain search (e.g. google.com.mx will see the relevant URLs for Mexico displayed in the results).
-
With canonical tag it is a one way road:
You have Page A and Page B with the same content but you want to point out Page A
Page B has a canonical to Page A:
Page B will disappear from the Search Results transferring all the link juice that it has gained to Page A
If you have the same content in different languages then you should use hreflang telling search engines that the two are the same but in other language:
Page A and Page B will have both the following in their headers
This way you will not Geo-Target but Language-Target the two pages ;-)
-
thanks Istvan
but what about whether its a requirement, or suggested best practice, that if you have tags (say canonical) on one set of duplicate pages then you must also add to the other similar/dupe pages (on original site).
Can you have one but not the other without it causing issues or do you need both to stop duplicate issues ?
-
Sorry for responding late, but I somehow forgot to answer this one.
So basically I would consider putting HREFLANG to all of the pages (US, original and any other language). Please note that HREFLANG is connected to optimizing the same content on different languages and not for geo-targeting mainly.
The best example would be Belgium. You can have content in French and in Dutch, still you are optimizing for the same region.
-
Thanks Itsvan, its a good answer and further information! What im really trying to establish though is if its ok to ONLY add canonical & href tags to the US focused subdirectory site ? Do they need to be added to the main site too or can I leave them off (since dont want to geotarget the main site) ? Im confused by wording on google articles/bogs etc on this subject. Since think they say that if you put the tags on a duplicate page you need to also put tags with alternative region/lang tags on the corresponding dupe content page (although i dont want to since want to leave main site free of specific geotargeting). In other words is it a technical requirement/necessity to have tags on both sets of dupe content ?
-
Hi danarchism,
This is what we have on a quite big website:
1. Main site is geo-targeted for a specific country
2. sub-folders of the site are geo-targeted for other countries
3. On each Page in the header we have the HREFLANG to the other 9 languages we use on the site.
Still when we talk about SERP impressions we have many times overlays (Such as the Geo-Targeted content to the Netherlands will appear in the Google.be or Geo-Targeted content to Germany appears in Google.At).
I hope this helped,
Istvan
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content analysis
Hi all,We have some pages being flagged as duplicates by the google search console. However, we believe the content on these pages is distinctly different (for example, they have completely different search results returned, different headings etc). An example of two pages google finds to be duplicates is below. if anyone can spot what might be causing the duplicate issue here, would very much appreciate suggestions! Thanks in advance.
Technical SEO | | Eric_S
Examples: https://www.vouchedfor.co.uk/IFA-financial-advisor-mortgage/harborne
https://www.vouchedfor.co.uk/accountant/harborne0 -
Non-standard HTML tags in content
I had coded my website's article content with a non-standard tag <cnt>that surrounded other standard tags that contained the article content, I.e.</cnt> , . The whole text was enclosed in a div that used Schema.org markup to identify the contents of the div as the articleBody. When looking at scraped data for stories in Webmaster Tools, the content of the story was there and identified as the articleBody correctly. It's recently been suggested by someone else that the presence of the non-standard <cnt>tags were actually making the content of the article uncrawlable by the Googlebot, this effectively rendering the content invisible. I did not believe this to be true, since the content appeared to be correctly indexed in Webmaster Tools, but for the sake of a test I agreed to removing them. In the last 6 weeks since they were removed, there have been no changes in impressions or traffic from organic search, which leads me to believe that the removal of the <cnt>tags actually had no effect, since the content was already being indexed successfully and nothing else has changed.</cnt></cnt> My question is whether or not an encapsulating non-standard tag as I've described would actually make the content invisible to Googlebot, or if it should not have made any difference so long as the correct Schema.org markup was in place? Thank you.
Technical SEO | | dlindsey0 -
Shopify Canonicals for Tagged Filters
I've been researching this topic endlessly and thought I had arrived at a solution but Screaming Frog indicates my solution was not successful. Problem: I used tags to filter my collections pages. The result, I discovered, was the creation of dozens and dozens, maybe hundreds, of additional collection URLs for each possible permutation of tag filters. I would like to make the collection page URL, with no tag filters, the canonical. Proposed Solution: I found the following code described somewhere as the solution: {% if template contains 'collection' and current_tags %} {% else %} {% endif %} However, I crawled my site with Screaming Frog and found that the canonical link element is still listed as the URL with the tags included. The crawler does recognizes the "noindex" instruction. Any ideas on what the best move is here?
Technical SEO | | vgusvg0 -
How to handle one section of duplicate content
Hi guys, i'm wondering if I can get some best practice advice in preparation for launching our new e-commerce website. For the new website we are creating location pages with a description and things to do which will lead the user to hotels in the location. For each hotel page which relates to the location we will have the same 'Things to do' content. This is what the content will look like on each page: Location page Location title (1-3 words) Location description (150-200 words) Things to do (200-250 words) Reasons to visit location (15 words) Hotel page Hotel name and address (10 words) Short description (25 words) Reasons to book hotel (15 words) Hotel description (100-200 words) Friendly message why to visit (15 words) Hotel reviews feed from trust pilot Types of break and information (100-200 words) Things to do (200-250 words) My question is how much will we penalised for having the same 'Things to do' content on say up to 10 hotels + 1 location page? In an ideal world we want to develop a piece of code which tells search engines that the original content lies on the location page but this will not be possible before we go live. I'm unsure whether we should just go and take the potential loss in traffic or remove the 'Things to do' section on hotel pages until we develop the piece of code?
Technical SEO | | CHGLTD1 -
Shopify Duplicate Content in products
Hello Moz Community, New to Moz and looking forward to beginning my journey towards SEO education and improving our clients' sites. Our client's website is a Shopify store. https://spiritsofthewestcoast.com/ Our first Moz reports show 686 duplicate content issues. I will show the first 4 as examples. https://spiritsofthewestcoast.com/collections/native-earrings-and-studs-in-silver-and-gold/products/haida-eagle-teardrop-earrings https://spiritsofthewestcoast.com/collections/native-earrings-and-studs-in-silver-and-gold/products/haida-orca-silver-earrings https://spiritsofthewestcoast.com/collections/native-earrings-and-studs-in-silver-and-gold/products/silver-oval-earrings https://spiritsofthewestcoast.com/collections/native-earrings-and-studs-in-silver-and-gold/products/haida-eagle-spirit-silver-earrings As you can see, URL titles are unique. But I know that the content in each of those products have very similar product descriptions but not exactly. But since they have been flagged as a site issue by Moz, I am guessing that the content is 95% duplicate. So can a rel=canonical be the right solution for this type of duplicate content? Or should I be considering adding new content to each of 686 products to drop below the 95% threshold? Or another solution that I may not be aware of. Thanks in advance for your assistance and expertise! Sean
Technical SEO | | TheUpdateCompany1 -
How to protect against duplicate content?
I just discovered that my company's 'dev website' (which mirrors our actual website, but which is where we add content before we put new content to our actual website) is being indexed by Google. My first thought is that I should add a rel=canonical tag to the actual website, so that Google knows that this duplicate content from the dev site is to be ignored. Is that the right move? Are there other things I should do? Thanks!
Technical SEO | | williammarlow0 -
Unique Title Tags OR Unique Content?
The content management system for my e-commerce site will not allow me to add both: unique content and unique title tags to each of my product category pages. Since I am forced to choose one or the other...which one is more important for rankings? And please dont answer "get a new CM system"...thats not an option. Thanks
Technical SEO | | rcarll0 -
Magento and Duplicate content
I have been working with Magento over the last few weeks and I am becoming increasingly frustrated with the way it is setup. If you go to a product page and remove the sub folders one by one you can reach the same product pages causing duplicate content. All magento sites seem to have this weakness. So use this site as an example because I know it is built on magento, http://www.gio-goi.com/men/clothing/tees/throve-t-short.html?cid=756 As you remove the tees then the clothing and men sub folders you can still reach the product page. My first querstion is how big an issue is this and two does anyone have any ideas of how to solve it? Also I was wondering how does google treat question marks in urls? Should you try and avoid them unless you are filtering? Thanks
Technical SEO | | gregster10001