Should we use the rel-canonical tag?
-
We have a secure version of our site, as we often gather sensitive business information from our clients.
Our https pages have been indexed as well as our http version.
-
Could it still be a problem to have an http and an https version of our site indexed by Google? Is this seen as being a duplicate site?
-
If so can this be resolved with a rel=canonical tag pointing to the http version?
Thanks
-
-
Agreed - this is generally an issue with relative paths, and job one is to fix it. In most cases, you really don't want these crawled at all. I do think rel=canonical is a good bet here - 301 redirects can get really tricky with http/https, and you can end up creating loops. It can be done right, but it's also easy to screw up, in my experience.
-
-
Yes, having 2 versions of the same content can be seen duplicate content and could cause issues.
-
Yes, include a canonical tag in the header (assuming both http & https pages are close to identical). This will help Google's crawler figure out which version of the page to show in the search results.
-
-
Yes, would suggest canonical as the easiest resolution -
And Irving is right PDF's are most definitely indexed, I am not sure how they are interpreted and if they would specifically count a dup content, but not sure this idea would EVER be something i would suggest as it it seems to have lots of negative repercussions.
I would most definitely agree that relative links is probably your issue, and if you canonical and remove inline relative links and make them http absolute this should resolve itself in a month or so.
-
I disagree
a) pdfs are both indexed AND read by crawlers.
b) even if you don't have navigation to the file sometimes Google can find it if it's in a folder that you are not blocking in robots.txt.
c) if someone links to it once on the web it's getting crawled and indexed.
If you have a https section that content should be behind a login and not accessible to the engines. Your problem sounds like your https pages have relative links on them and Google is crawling the https page and then following the relative links staying on https so you need to fix that and this will fix your site getting http pages indexed as dupe https.
Absolute http canonical tags will help but it not the solution. you need to fix the https leaking on your secure pages.
.
-
You can "no-index" them within the html - but if you really want a fun trick - when and if you are not able to get around mass amount of duped content and it isn't for the sake of rankings - example, MLS listings, etc
Change the content into a pdf - or file format - thus not being able to be crawled.
Once again - it will NOT be crawled - so don't go doing this to an entire site
But maybe your clients confidential data - can be submitted this way - and it will not get indexed - except for the subpage - but then you can no index that subpage.
Hope this helps.
Your pal
Chenzo
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Canonical tags for duplicate listings
Hi there, We are restructuring a website. The website originally lists jobs that will have duplicate content. We have tried to ask the client not to use duplicates but apparently their industry is not something they can control. The recommendations I had is to have categories (which will have the idea description for a group of jobs), and the job listing pages. The job listing pages will then have canonical tags pointing to the category page as the primary URL to be indexed. Another opinion came from a third party that this can be seen as if we are tricking Google and would get penalised, **Is that even true? **Why would Google penalise for this if thats their recommendations in the first place? This third party suggested using nofollow on the links to these listings, or even not not index them all together. What are your thoughts? Thanks Issa
Intermediate & Advanced SEO | | iQi0 -
How use Rel="canonical" for our Website
How is the best way to use Rel="canonical" for our website www.ofertasdeemail.com.br, for we can say goodbye for duplicated pages? I appreciate for every help. I also hope to contribute to the SEOmoz community. Sincerely,
Intermediate & Advanced SEO | | ZZNINTERNETMEDIAGROUP
Amador Goncalves0 -
Merging blog post tags within static page - Rel = Canonical?
As a blogger, I use a combination of categories and tags in order to organize my content. I do index tags because they've been very powerful for SEO purposes, but there are certain keywords in which I'd like to be able to create an entirely separate static page with the tagged posts merged onto it. So in other words, this is what I'd like the landing page to be: www.website.com/keyword as opposed to www.website.com/tags/keyword Because of this, I'm uncertain what I need to do with that tag page. With this, I would assume that www.website.com/tags/keywords needs to be indexed, but what would be the wise thing to do? Do I place a rel=canonical on www.website.com/tags/keyword to the static page? Do I do a simple re-direct? Do I just leave it indexed? Will it dilute my desired landing page? Would appreciate all comments and thoughts. Thank you!
Intermediate & Advanced SEO | | longview0 -
Real impact of canonical links?
I am responsible for 2 e-commerce websites. SEO Moz and Google Web Master tools both inform me regularly that on both sites there are many instances of duplicate titles, headings, decriptions and page content. Obviously from an SEO point of view I am more than a little concerned about this! Out product pages struggle to perform strongly despite the fact that our website is of a decent quality and we are leaders in our field. Our competitors rank above us when they add a product page, whereas we normal flit in between 8-10 or on the 2nd SERP. I know it is hard without viewing the site, but is duplicate content likely to be a strong, leading factor in this? I think it is, but want to put together a business case to spend the cash to sort it out....just need someone confirmation that this is worth sorting as a priority. Here are 2 examples of what I mean: 1) Category pages www.exampledomain.co.uk/category1.aspx We have filters on our category page (so the customer can sort products based on their price, colour, size etc.). When filters are used a new URL is generared. www.exampledomain.co.uk/category1.aspx?prices=0||10 www.exampledomain.co.uk/category1.aspx?prices=10||20 The content, titles, description is the same although the links are different. Do I need to set up a canonical tag on the page that reads: 2) Product pages Product pages on the websites have different URLs depending on how to arrive on them. You get 1 URL if you navigated to the page via the website navigation, but you get another different URL if you used the website search functionality to find the page. Example: Search link: www.exampledomain.co.uk/category1/Product1.aspx Navigation link: www.exampledomain.co.uk/12345/category1/Product1.aspx Again, do I need to set up a canonical tag for 1 of these link types so that the link benefit is not shared over 2 pages? Any feedback would be welcome! At the moment the ability to add canonical tags is locked down by our CMS (I know, rubbish!)...so website development would be needed - hence the need for a business case!
Intermediate & Advanced SEO | | DHS_SH0 -
Canonical vs noindex for blog tags
Our blog started to user tags & I know this is bad for Panda, but our product team wants use them for user experience. Should we canonizalize these tags to the original blog URL or noindex them?
Intermediate & Advanced SEO | | nicole.healthline0 -
What is the best canonical url to use for a product page?
I just helped a client redesign and launch a new website for their organic skin care company (www.hylunia.com). The site is built in Magento which by default creates MANY urls for each product. Which of these two do you think would be the best to use as the canonical version? http://www.hylunia.com/pure-hyaluronic-acid-solution
Intermediate & Advanced SEO | | danielmoss
or http://www.hylunia.com/products/face-care/facial-moisturizers/pure-hyaluronic-acid-solution ? I'm leaning on the latter, because it makes sense to me to have the breadcrumbs match the url string, and also it seems having more keywords in the url would help. However, it's obviously a very long url, and there might be some benefits to using the shorter version that I'm not aware of. Thanks in advance for sharing your thoughts. Best, Daniel0 -
Canonical Meta Tag
Can someone explain how this works and how necessary is it? For example, I have a new client, who is ranking WITHOUT the www in their domain, but they have a good deal of backlinks already that have www in it. When I set up google webmaster tools I made 2, one for WWW and one for WITHOUT and there are diffenet numbers of backlinks for each. I have no idea what do about this or if I should even do anything. Thanks
Intermediate & Advanced SEO | | TheGrid0 -
Rel=nofollow and SSL Certs
Will I lose or gain seo benefit from using rel=nofollow on my SSL certificate? every page on the site refers (links) to the cert and the server call to display the cert adds over 500ms to my page load speeds. <updated question=""> Is there a way to display the cert to cut down on load speeds? Also, would Google discount or penalize the site if the cert were nofollowed?</updated> Thoughts? Thanks in advance!
Intermediate & Advanced SEO | | AnthonyYoung0