Duplicate content and canonicalization confusion
-
Hello,
http://bit.ly/1b48Lmp and http://bit.ly/1BuJkUR pages have same content and their canonical refers to the page itself. Yet, they rank in search engines. Is it because they have been targeted to different geographical locations? If so, still the content is same.
Please help me clear this confusion.
Regards
-
I agree with you. It's all very confusing and little details make a BIG difference. Thanks for sticking with this.
-
Thanks a ton Donna for looking into the issue and helping at this level. I highly appreciate it
Their canonical tags confused me. As you have mentioned, the tags should have been one, I don't know why they are using two different ones. Probably, they have set the different geographic targets in Google Webmaster Tools and with the minor content variation and canonical tags, they want to signal Google to treat both the pages differently. I mean it's a big name in the world of ERP. They can't mess up with the canonical tags.
What do you think?
-
Okay. Let's start over looking at it from a goal perspective. I compared the two pages. Here is the difference between the two in terms of page text, highlighted in yellow - http://63.249.66.211/comparison.html. The differences are in the URL, the phone numbers at the top, a word here and there in the middle, and the 2nd block of text and photo under "Explore Our Solutions".
The first page, which I'll call India, has a canoncial tag pointing to itself. (http://www.sap.com/india/pc/bp/erp.html"/>) .
The second page, which I'll call UK, has a canoncial tag, also pointing to itself. (http://www.sap.com/uk/pc/bp/erp.html"/>).
- If you want both pages to rank and have authority, then you use the canonical tag. You need to use the same canonical tag on both pages. Right now they're different. That will essentially tell Google to treat the two pages as one; to show one or the other in search results, but considate their combined SEO value into one for ranking purposes.
- If you only want one page to rank, then noindex the other.
Does that make more sense?
-
Thanks for the reply Donna but my question is bit different. Could you please take a look at the rel canonical tag of the urls I posted. The content on both the pages is 100% same. The only difference is that they are targeted at different geographic locations. The canonical tags point to the page itself and not any master page.
-
This might help Shailendra - https://support.google.com/webmasters/answer/139066?hl=en. Skim down to (or search for) the part beginning with "This indicates the preferred URL", about half-way down the page.
Bottom line, Google attempts to respect canonical tags but it's no guarantee. Increase your chances by using "absolute paths rather than relative paths with the
rel="canonical"
link element". -
Thanks everyone for the response! But I am still confused. The two links that I have posted in my initial question have exactly the same content on both the pages (targeted at different geographic locations) and their canonical tags do not refer to any master page but to them itself, i.e. canonical tag on page A refers to A and canonical tag on page B refers to B. Please take a look at both the pages: http://bit.ly/1b48Lmp and http://bit.ly/1BuJkUR
Regards
-
Canonical pages still get indexed at Google's discretion.
A related question was asked in March 2013 that I think, explains what you're seeing. I've cut and pasted the relevant part below. Mememax is the author.
"Normally the only thing which will prevent a page from ranking is noindex tag. If you don't want to have it indexed just noindex it, if that page has been laready indexed, put the noindex tag and delete from index using GWT option.
Concerning the canonical tag thing, it will consolidate the seo value in one page but it won't prevent those page to appear in rankings, however you may have two cases:
-
the two or more pages are identical. In that case google may accept the canonicalization and show always the original page.
-
the two or more pages are slightly different, it's the case of paginated pages which are canonicalized using rel next/prev. In that sense the whole value will be consolidated in page 1 but then the page which will be shown in the rankings will be the one which responds to that query, for example if someone is looking for blue glass, google will return the page which shows blue glass listing if that's different from the first one."
-
-
Yes, if they were directly competing against each other, you'd expect one of them to drop out of the rankings. What are they both ranking for?
If they are both showing up in the same search, my guess would be that they are very new and Google hasn't noticed the duplication.
But if you see the ranking in different searches (like Google UK and Google India), then you are probably right, Google does not see them as duplicate since they are being shown to different audiences.
-
Hi,
I am sharing two Matt cutts video on this to clear your confusion.I hope it helps.
https://www.youtube.com/watch?v=GFf1gwr6HJw
Thanks
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content from long Site Title
Hello! I have a number of "Duplicate Title Errors" as my website has a long Site Title: Planit NZ: New Zealand Tours, Bus Passes & Travel Planning. Am I better off with a short title that is simply my website/business name: Planit NZ My thought was adding some keywords might help with my rankings. Thanks Matt
Technical SEO | | mkyhnn0 -
Development Website Duplicate Content Issue
Hi, We launched a client's website around 7th January 2013 (http://rollerbannerscheap.co.uk), we originally constructed the website on a development domain (http://dev.rollerbannerscheap.co.uk) which was active for around 6-8 months (the dev site was unblocked from search engines for the first 3-4 months, but then blocked again) before we migrated dev --> live. In late Jan 2013 changed the robots.txt file to allow search engines to index the website. A week later I accidentally logged into the DEV website and also changed the robots.txt file to allow the search engines to index it. This obviously caused a duplicate content issue as both sites were identical. I realised what I had done a couple of days later and blocked the dev site from the search engines with the robots.txt file. Most of the pages from the dev site had been de-indexed from Google apart from 3, the home page (dev.rollerbannerscheap.co.uk, and two blog pages). The live site has 184 pages indexed in Google. So I thought the last 3 dev pages would disappear after a few weeks. I checked back late February and the 3 dev site pages were still indexed in Google. I decided to 301 redirect the dev site to the live site to tell Google to rank the live site and to ignore the dev site content. I also checked the robots.txt file on the dev site and this was blocking search engines too. But still the dev site is being found in Google wherever the live site should be found. When I do find the dev site in Google it displays this; Roller Banners Cheap » admin dev.rollerbannerscheap.co.uk/ A description for this result is not available because of this site's robots.txt – learn more. This is really affecting our clients SEO plan and we can't seem to remove the dev site or rank the live site in Google. In GWT I have tried to remove the sub domain. When I visit remove URLs, I enter dev.rollerbannerscheap.co.uk but then it displays the URL as http://www.rollerbannerscheap.co.uk/dev.rollerbannerscheap.co.uk. I want to remove a sub domain not a page. Can anyone help please?
Technical SEO | | SO_UK0 -
Duplicate Page content / Rel=Cannonical
My SEO Moz crawl is showing duplicate content on my site. What is showing up are two articles I submitted to Submit your article (article submission service). I put their code in to my pages i.e. " <noscript><b>This article will only display in JavaScript enabled browsers.</b></noscript> " So do I need to delete these blog posts since they are showing up as dup content? I am having a difficult time understanding rel=cannonical. Isn't this for dup content on within one site? So I could not use rel="cannonical" in this instance? What is the best way to feature an article or press release written for another site, but that you want your clients to see? Rewritting seem ridiculous for a small business like ours. Can we just present the link? Thank you.
Technical SEO | | RoxBrock0 -
Testing for duplicate content and title tags
Hi there, I have been getting both Duplicate Page content and Duplicate Title content warnings on my crawl diagnostics report for one of my campaigns. I did my research, and implemented the preferred domain setting in Webmaster Tools. This did not resolve the crawl diagnostics warnings, and upon further research I discovered the preferred domain would only be noted by Google and not other bots like Roger. My only issue was that when I ran an SEOmoz crawl test on the same domain, I saw none of the duplicate content or title warnings yet they still appear on my crawl diagnostics report. I have now implemented a fix in my .htaccess file to 301 redirect to the www. domain. I want to check if it's worked, but since the crawl test did not show the issue last time I don't think I can rely on that. Can you help please? Thanks, Claire
Technical SEO | | SEOvet0 -
Duplicate Content on Product Pages
Hello I'm currently working on two sites and I had some general question's about duplicate content. For the first one each page is a different location, but the wording is identical on each; ie it says Instant Remote Support for Critical Issues, Same Day Onsite Support with a 3-4 hour response time, etc. Would I get penalized for this? Another question i have is, we offer Antivirus support for providers ie Norton, AVG,Bit Defender etc. I was wondering if we will get penalized for having the same first paragraph with only changing the name of the virus provider on each page? My last question is we provide services for multiple city's and towns in various states. Will I get penalized for having the same content on each page, such as towns and producuts and services we provide? Thanks.
Technical SEO | | ilyaelbert0 -
How to prevent duplicate content at a calendar page
Hi, I've a calender page which changes every day. The main url is
Technical SEO | | GeorgFranz
/calendar For every day, there is another url: /calendar/2012/09/12
/calendar/2012/09/13
/calendar/2012/09/14 So, if the 13th september arrives, the content of the page
/calendar/2012/09/13
will be shown at
/calendar So, it's duplicate content. What to do in this situation? a) Redirect from /calendar to /calendar/2012/09/13 with 301? (but the redirect changes the day after to /calendar/2012/09/14) b) Redirect from /calendar to /calendar/2012/09/13 with 302 (but I will loose the link juice of /calendar?) c) Add a canonical tag at /calendar (which leads to /calendar/2012/09/13) - but I will loose the power of /calendar (?) - and it will change every day... Any ideas or other suggestions? Best wishes, Georg.0 -
Duplicate content error from url generated
We are getting a duplicate content error, with "online form/" being returned numerous times. Upon inspecting the code, we are calling an input form via jQuery which is initially called by something like this: Opens Form Why would this be causing it the amend the URL and to be crawled?
Technical SEO | | pauledwards0 -
Canonicalization - duplicate homepage issues
I'm trying to work out the best way to resolve an issue where Google is seeing duplicate versions of a homepage, i.e. http://www.home.co.uk/Home.aspx and http://www.home.co.uk/ The site runs on Windows servers. I've tried implementing redirects for homepages before (for a different site on a linux server) and ended up with a loop, so although I know I can read lots of info (as I have been doing) and try again, I am really concerned about getting it wrong. Can anyone give me some advice on the best way to make Google take just one version of the page? Obviously link juice is also being diluted so I need to get this sorted asap. Thanks.
Technical SEO | | travelinnovations0