Duplicate content and canonicalization confusion
-
Hello,
http://bit.ly/1b48Lmp and http://bit.ly/1BuJkUR pages have same content and their canonical refers to the page itself. Yet, they rank in search engines. Is it because they have been targeted to different geographical locations? If so, still the content is same.
Please help me clear this confusion.
Regards
-
I agree with you. It's all very confusing and little details make a BIG difference. Thanks for sticking with this.
-
Thanks a ton Donna for looking into the issue and helping at this level. I highly appreciate it
Their canonical tags confused me. As you have mentioned, the tags should have been one, I don't know why they are using two different ones. Probably, they have set the different geographic targets in Google Webmaster Tools and with the minor content variation and canonical tags, they want to signal Google to treat both the pages differently. I mean it's a big name in the world of ERP. They can't mess up with the canonical tags.
What do you think?
-
Okay. Let's start over looking at it from a goal perspective. I compared the two pages. Here is the difference between the two in terms of page text, highlighted in yellow - http://63.249.66.211/comparison.html. The differences are in the URL, the phone numbers at the top, a word here and there in the middle, and the 2nd block of text and photo under "Explore Our Solutions".
The first page, which I'll call India, has a canoncial tag pointing to itself. (http://www.sap.com/india/pc/bp/erp.html"/>) .
The second page, which I'll call UK, has a canoncial tag, also pointing to itself. (http://www.sap.com/uk/pc/bp/erp.html"/>).
- If you want both pages to rank and have authority, then you use the canonical tag. You need to use the same canonical tag on both pages. Right now they're different. That will essentially tell Google to treat the two pages as one; to show one or the other in search results, but considate their combined SEO value into one for ranking purposes.
- If you only want one page to rank, then noindex the other.
Does that make more sense?
-
Thanks for the reply Donna but my question is bit different. Could you please take a look at the rel canonical tag of the urls I posted. The content on both the pages is 100% same. The only difference is that they are targeted at different geographic locations. The canonical tags point to the page itself and not any master page.
-
This might help Shailendra - https://support.google.com/webmasters/answer/139066?hl=en. Skim down to (or search for) the part beginning with "This indicates the preferred URL", about half-way down the page.
Bottom line, Google attempts to respect canonical tags but it's no guarantee. Increase your chances by using "absolute paths rather than relative paths with the
rel="canonical"
link element". -
Thanks everyone for the response! But I am still confused. The two links that I have posted in my initial question have exactly the same content on both the pages (targeted at different geographic locations) and their canonical tags do not refer to any master page but to them itself, i.e. canonical tag on page A refers to A and canonical tag on page B refers to B. Please take a look at both the pages: http://bit.ly/1b48Lmp and http://bit.ly/1BuJkUR
Regards
-
Canonical pages still get indexed at Google's discretion.
A related question was asked in March 2013 that I think, explains what you're seeing. I've cut and pasted the relevant part below. Mememax is the author.
"Normally the only thing which will prevent a page from ranking is noindex tag. If you don't want to have it indexed just noindex it, if that page has been laready indexed, put the noindex tag and delete from index using GWT option.
Concerning the canonical tag thing, it will consolidate the seo value in one page but it won't prevent those page to appear in rankings, however you may have two cases:
-
the two or more pages are identical. In that case google may accept the canonicalization and show always the original page.
-
the two or more pages are slightly different, it's the case of paginated pages which are canonicalized using rel next/prev. In that sense the whole value will be consolidated in page 1 but then the page which will be shown in the rankings will be the one which responds to that query, for example if someone is looking for blue glass, google will return the page which shows blue glass listing if that's different from the first one."
-
-
Yes, if they were directly competing against each other, you'd expect one of them to drop out of the rankings. What are they both ranking for?
If they are both showing up in the same search, my guess would be that they are very new and Google hasn't noticed the duplication.
But if you see the ranking in different searches (like Google UK and Google India), then you are probably right, Google does not see them as duplicate since they are being shown to different audiences.
-
Hi,
I am sharing two Matt cutts video on this to clear your confusion.I hope it helps.
https://www.youtube.com/watch?v=GFf1gwr6HJw
Thanks
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
When is Duplicate Content Duplicate Content
Hi, I was wondering exactly when duplicate content is duplicate content? Is it always when it is word-for-word or if it is similar? For example, we currently have an information page and I would like to add a FAQ to the website. There is, however, a crossover with the content and some of it is repeated. However, it is not written word for word. Could you please advise me? Thanks a lot Tom
Technical SEO | | National-Homebuyers0 -
Duplicate Content Mystery
Hi Moz community! I have an ongoing duplicate mystery going on here and I'm hoping someone here can answer my question. We have an Ecommerce site that has a variety of product pages and category pages. There are Rel canonicals in place, along with parameters in GWT, and there are also URL rewrites. Here are some scenarios, maybe you can give insight as to what’s exactly going on and how to fix it. All the duplicates look to be coming from category pages specifically. For example:
Technical SEO | | Ecom-Team-Access
This link re-writes: http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves.html?cat=407&color=152&price=20- To: http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves.html The rel canonical tag looks like this: http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves.html" /> The CONTENT is different, but the URLs are the same. It thinks that the product category view is the same as the all products view, even though there is a canonical in there telling it which one is the original. Some of them don’t have anything to do with each other. Take a look: Link identified as duplicate: http://www.incipio.com/cases/smartphone-cases/htc-smartphone-cases/htc-windows-phone-8x-cases.html?color=27&price=20- Link this is a duplicate of: http://www.incipio.com/cases/macbook-cases/macbook-pro-13in-cases.html Any idea as to what could be happening here?0 -
Duplicate Content Question
I have a client that operates a local service-based business. They are thinking of expanding that business to another geographic area (a drive several hours away in an affluent summer vacation area). The name of the existing business contains the name of the city, so it would not be well-suited to market 'City X' business in 'City Y'. My initial thought was to (for the most part) 'duplicate' the existing site onto a new site (brand new root domain). Much of the content would be the exact same. We could re-word some things so there aren't entire lengthy paragraphs of identical info, but it seems pointless to completely reinvent the wheel. We'll get as creative as possible, but certain things just wouldn't change. This seems like the most pragmatic thing to do given their goals, but I'm worried about duplicate content. It doesn't feel as though this is spammy though, so I'm not sure if there's cause for concern.
Technical SEO | | stevefidelity0 -
Duplicate Content based on www.www
In trying to knock down the most common errors on our site, we've noticed we do have an issue with dupicate content; however, most of the duplicate content errors are due to our site being indexed with www.www and not just www. I am perplexed as to how this is happening. Searching through IIS, I see nothing that would be causing this, and we have no hostname records setup that are www.www. Does anyone know of any other things that may cause this and how we can go about remedying it?
Technical SEO | | CredA0 -
Help With Joomla Duplicate Content
Need another set of eyes on my site from someone with Joomla experience. I'm running Joomla 2.5 (latest version) and SEOmoz is giving my duplicate content errors on a lot of my pages. I checked my sitemap, I checked my menus, and I checked my links, and I can't figure out how SEOmoz is finding the alternate paths to my content. Home page is: http://www.vipfishingcharters.com/ There's only one menu at the top. Take the first link "Dania Beach" under fishing charters for example. This generates the SEF url: http://www.vipfishingcharters.com/fishing-charters/broward-county/dania-beach-fishing-charters-and-fishing-boats.html Somehow SEOmoz (and presumably all other robots) are finding duplicate content at: http://www.vipfishingcharters.com/broward-county/dania-beach-fishing-charters-and-fishing-boats.html SEOmoz says the referrer is the homepage/root. The first URL is constructed using the menu aliases. The second one is constructed using the Joomla category and article alias. Where is it getting this and how can I stop it? <colgroup><col width="601"></colgroup>
Technical SEO | | NoahC0 -
Duplicate content
Greetings! I have inherited a problem that I am not sure how to fix. The website I am working on had a 302 redirect from its original home url (with all the link juice) to a newly designed page (with no real link juice). When the 302 redirect was removed, a duplicate content problem remained, since the new page had already been indexed by google. What is the best way to handle duplicate content? Thanks!
Technical SEO | | shedontdiet0 -
The Bible and Duplicate Content
We have our complete set of scriptures online, including the Bible at http://lds.org/scriptures. Users can browse to any of the volumes of scriptures. We've improved the user experience by allowing users to link to specific verses in context which will scroll to and highlight the linked verse. However, this creates a significant amount of duplicate content. For example, these links: http://lds.org/scriptures/nt/james/1.5 http://lds.org/scriptures/nt/james/1.5-10 http://lds.org/scriptures/nt/james/1 All of those will link to the same chapter in the book of James, yet the first two will highlight the verse 5 and verses 5-10 respectively. This is a good user experience because in other sections of our site and on blogs throughout the world webmasters link to specific verses so the reader can see the verse in context of the rest of the chapter. Another bible site has separate html pages for each verse individually and tends to outrank us because of this (and possibly some other reasons) for long tail chapter/verse queries. However, our tests indicated that the current version is preferred by users. We have a sitemap ready to publish which includes a URL for every chapter/verse. We hope this will improve indexing of some of the more popular verses. However, Googlebot is going to see some duplicate content as it crawls that sitemap! So the question is: is the sitemap a good idea realizing that we can't revert back to including each chapter/verse on its own unique page? We are also going to recommend that we create unique titles for each of the verses and pass a portion of the text from the verse into the meta description. Will this perhaps be enough to satisfy Googlebot that the pages are in fact unique? They certainly are from a user perspective. Thanks all for taking the time!
Technical SEO | | LDS-SEO0 -
Why are my pages getting duplicate content errors?
Studying the Duplicate Page Content report reveals that all (or many) of my pages are getting flagged as having duplicate content because the crawler thinks there are two versions of the same page: http://www.mapsalive.com/Features/audio.aspx http://www.mapsalive.com/Features/Audio.aspx The only difference is the capitalization. We don't have two versions of the page so I don't understand what I'm missing or how to correct this. Anyone have any thoughts for what to look for?
Technical SEO | | jkenyon0