Should we use the rel-canonical tag?
-
We have a secure version of our site, as we often gather sensitive business information from our clients.
Our https pages have been indexed as well as our http version.
-
Could it still be a problem to have an http and an https version of our site indexed by Google? Is this seen as being a duplicate site?
-
If so can this be resolved with a rel=canonical tag pointing to the http version?
Thanks
-
-
Agreed - this is generally an issue with relative paths, and job one is to fix it. In most cases, you really don't want these crawled at all. I do think rel=canonical is a good bet here - 301 redirects can get really tricky with http/https, and you can end up creating loops. It can be done right, but it's also easy to screw up, in my experience.
-
-
Yes, having 2 versions of the same content can be seen duplicate content and could cause issues.
-
Yes, include a canonical tag in the header (assuming both http & https pages are close to identical). This will help Google's crawler figure out which version of the page to show in the search results.
-
-
Yes, would suggest canonical as the easiest resolution -
And Irving is right PDF's are most definitely indexed, I am not sure how they are interpreted and if they would specifically count a dup content, but not sure this idea would EVER be something i would suggest as it it seems to have lots of negative repercussions.
I would most definitely agree that relative links is probably your issue, and if you canonical and remove inline relative links and make them http absolute this should resolve itself in a month or so.
-
I disagree
a) pdfs are both indexed AND read by crawlers.
b) even if you don't have navigation to the file sometimes Google can find it if it's in a folder that you are not blocking in robots.txt.
c) if someone links to it once on the web it's getting crawled and indexed.
If you have a https section that content should be behind a login and not accessible to the engines. Your problem sounds like your https pages have relative links on them and Google is crawling the https page and then following the relative links staying on https so you need to fix that and this will fix your site getting http pages indexed as dupe https.
Absolute http canonical tags will help but it not the solution. you need to fix the https leaking on your secure pages.
.
-
You can "no-index" them within the html - but if you really want a fun trick - when and if you are not able to get around mass amount of duped content and it isn't for the sake of rankings - example, MLS listings, etc
Change the content into a pdf - or file format - thus not being able to be crawled.
Once again - it will NOT be crawled - so don't go doing this to an entire site
But maybe your clients confidential data - can be submitted this way - and it will not get indexed - except for the subpage - but then you can no index that subpage.
Hope this helps.
Your pal
Chenzo
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Rel=Canonical Vs. 301 for blog articles
Over the last few years, my company has acquired numerous different companies -- some of which were acquired before that. Some of the products acquired were living on their previous company's parent site vs. having their own site dedicated to the product. The decision has been made that each product will have their own site moving forward. Since the product pages, blog articles and resource center landing pages (ex. whitepapers LPs) were living on the parent site, I'm struggling with the decision to 301 vs. rel=canonical those pages (with the new site being self canonicaled). I'm leaning toward take-down and 301 since rel=canonicals are simply suggestions to Google and a new domain can get all the help it can to start ranking. Are there any cons to doing so?
Intermediate & Advanced SEO | | mfcb0 -
Long Title Tags
Hi guys, We have product e-commerce title tags which are over 60 characters - around 80 plus. The reason we added them in there is to incorporate
Intermediate & Advanced SEO | | seowork214
more information for Google. The format of these title tags are: Name + Colour + Rug Type + Origin Name = for people searching for the name of the rug
Color = people searching for a specific color
Type = The type of rug (e.g. normal or designer)
Origin = Where the rug is for. So this title will cover people searching for: People searching for designer rugs, the specific colour and also where it comes from. This then results in the title tag going way over 60 characters - around 80-90 characters. -- Would it be wise to try and shrink it down to under 60 characters, and what would be a good approach to do this? Cheers.0 -
Pagination with rel=“next” and rel=“prev”
Hi Guys, Just wondering can anyone recommend any tools or good ways to check if rel=“next” and rel=“prev” attributes have been implemented properly across a large ecommerce based site? Cheers. rel=“next” and rel=“prev”
Intermediate & Advanced SEO | | jayoliverwright0 -
Rel=“next” and rel=“prev” on category pages and galleries
Hi there, I am running an WordPress blog and i was looking a couple of days on the source code of the categories. From a SEO point of view would make sense to include into the header of the categories the rel=“next” and rel=“prev” tags ? Same question would be for the image galleries . Should i add the rel=“next” and rel=“prev” tags on the image galleries ? So for example if i upload 10 images to a gallery, the user will check the post and see the gallery. It will click on an image and will redirected on the attachment page of that displays that image > from where he can click next to see the next image or prev for the previous image. Therefore should i add the rel=“next” and rel=“prev” tags here too ? Many thanks
Intermediate & Advanced SEO | | MirceazetelSerafim1 -
Using Canonical URL to poin to an external page
I was wondering if I can use a canonical URL that points to a page residing on external site? So a page like:
Intermediate & Advanced SEO | | llamb
www.site1.com/whatever.html will have a canonical link in its header to www.site2.com/whatever.html. Thanks.0 -
Canonical tag + HREFLANG vs NOINDEX: Redundant?
Hi, We launched our new site back in Sept 2013 and to control indexation and traffic, etc we only allowed the search engines to index single dimension pages such as just category, brand or collection but never both like category + brand, brand + collection or collection + catergory We are now opening indexing to double faceted page like category + brand and the new tag structure would be: For any other facet we're including a "noindex, follow" meta tag. 1. My question is if we're including a "noindex, follow" tag to select pages do we need to include a canonical or hreflang tag afterall? Should we include it either way for when we want to remove the "noindex"? 2. Is the x-default redundant? Thanks for any input. Cheers WMCA
Intermediate & Advanced SEO | | WMCA0 -
Canonical tags and product descriptions
I just wanted to check what you guys thought of this strategy for duplicate product descriptions. A sample product is a letter bracelet - a, b, c etc so there are 26 products with identical descriptions. It is going to be extremely difficult to come up with 25 new unique descriptions so with recommendation i'm looking to use the canonical tag. I can't set any to no-index because visitors will look for explicit letters. Because the titles only differ by the letter then a search for either letter bracelet letter a bracelet letter i bracelet will just return results for 'letter bracelet' due to stop words unless the searcher explicitly searches for 'letter "a" bracelet. So I reckon I can make 4 new unique descriptions. I research what are the most popular letters picking 5 from the top (excluding 'a' and 'i'). Equally share the remaining letters between those 5 and with each group set a canonical tag pointing to the primary letter of that group. Does this seem a sensible thing to do?
Intermediate & Advanced SEO | | MickEdwards0 -
Hierachy in a Title Tag Needed?
When coding the title tags of a website, is it important to note the hierarchy of the website so the search engine can find that page? So, for example, the title tag would be, for a subcategory: Webpage Subcategory>Category>Website name Does this help the search engine rankings at all? Or can the search engines figure out the hierarchy by reading a sitemap or the HTML readable navigation? Then you could focus on the descriptive keywords of the subcategory page for the title tag. Also, should you always include the site name in the title tag?
Intermediate & Advanced SEO | | ConnellyPartners0