How rel=canonical works with index, noindex ?
-
Hello all,
I had always wondered how the index,noindex affects to the canonical. And also if the canonical post should be included in the sitemap or not.
I posted this
http://www.comparativadebancos.co...
and with a rel=canonical to this that was published at the beginning of the month
http://www.comparativadebancos.co...
but then I have the first one in google
http://www.google.com/search?aq=f...
May be this is evident for you but, what is really doing the canonical? If I publish something with the canonical pointing to another page, will it still be indexed by google but with no penalty for duplicate content? Or the usual behaviour should have been to havent indexed the first post but just the second one?
Should I also place a noindex in the first post in addition to the canonical?
What am I missing here?
thanks
-
Antonio,
I came into this question a little late so I'm not sure how it was back when you asked it, but right now the problem I see is that the page that does exist ( http://www.comparativadebancos.com/mejores-depositos-bancarios-de-marzo-de-2011/ ) has a rel canonical tag pointing to the page that doesn't exist ( http://www.comparativadebancos.com/depositos/marzo/ ), which returns a 404 response code.
I think right now the best thing you can do would be to change the rel canonical tag on /mejores-depositos-bancarios-de-marzo-de-2011/ to be http://www.comparativadebancos.com/mejores-depositos-bancarios-de-marzo-de-2011/ .
-
I im saying that it is important to Google to tell them more what you want to use as your content without possible parameter "/" "www" adding a duplicate content penalty to your website.
-
Hi,
I agree that it will not help you to too much with stolen content. Unless Google has indexed you 1st they would probably give you 1st rights to the disputed content. The reason I believe you are getting with such good results on Google a non-indexed URL or what should be nonindexed is Google indexes everything regardless and from what Matt Cutts said "According to Google, the canonical link element is not considered to be a directive, but a hint that the web crawler will "honor strongly" "
my belief is Google is throwing more honor to dealing with the canonical.
I hope I was of some help.
Sincerely,
Thomas Zickell
-
Blueprint, as far as I understand it can't really be used to prevent people stealing your content because you need to have to similar versions and place the tag pointing to the one that is of lesser value or that you don't want to come up in place of the original. Or are you saying if you find some of your content elsewhere offsite you can place a canonical link to it, and this will tell the spiders it is your content rather than theres?
Antonio, if you have placed the tag on the newer page pointing to the older page you are telling the spiders that the newer page is the preferred/more original content.
-
I would say that rel=canonical is one of the single most vital parts of a website no matter how it's Written or hosted all must be set up to appropriately take traffic and simply tell Google I'm not trying to duplicate my content here is my <link rel="canonical" href="http://www.example.com/" /> and that way if anyone does haven't come across your content and try to make it their own they will be the ones penalized for stealing it not you. Always put this tag in the page that you have created and the one that you want Google to understand is your copy of your website content here is some info from Matt Cutts at Google as well as Wikipedia hope I am of help
http://www.mattcutts.com/blog/rel-canonical-html-head/
A canonical link element is an HTML element that helps webmasters prevent duplicate content issues by specifying the "canonical", or "preferred", version of a web page<sup id="cite_ref-googleblog_0-0" class="reference">[1]</sup><sup id="cite_ref-1" class="reference">[2]</sup><sup id="cite_ref-2" class="reference">[3]</sup> as part of search engine optimization.
Duplicate content issues occur when the same content is accessible from multiple URLs.<sup id="cite_ref-3" class="reference">[4]</sup> For example, <tt>http://www.example.com/page.html</tt> would be considered by search engines to be an entirely different page to<tt>http://www.example.com/page.html?parameter=1</tt>, even though both URLs return the same content. Another example is essentially the same (tabular) content, but sorted differently.
In February 2009, Google, Yahoo and Microsoft announced support for the canonical link element, which can be inserted into the section of a web page, to allow webmasters to prevent these issues.<sup id="cite_ref-4" class="reference">[5]</sup> The canonical link element helps webmasters make clear to the search engines which page should be credited as the original.
According to Google, the canonical link element is not considered to be a directive, but a hint that the web crawler will "honor strongly".<sup id="cite_ref-googleblog_0-1" class="reference">[1]</sup>
While the canonical link element has its benefits, Matt Cutts, who is the head of Google's webspam team, has claimed that the search engine prefers the use of 301 redirects. Cutts claims the preference for redirects is because Google's spiders can choose to ignore a canonical link element if they feel it is more beneficial to do so.<sup id="cite_ref-5" class="reference">[6]</sup>
[edit]Examples of the
canonical
link element<link rel="canonical" href="http://www.example.com/" />
<link rel="canonical" href="http://www.example.com/page.html" />
<link rel="canonical" href="http://www.example.com/directory/page.html" /> ```
-
you should give it time to settle down in the SERPS ... the results are muddy for a while but your canonicals will eventually show up if they have been implemented correctly.
-
I have already done it but my question come after this one
Where Rand suggest me to do the canonical thing I am explaining here. So my doubt is why it is indexing the new post better than the old one and how this is supposed to work.
From my understanding and also from your link, if I use rel=canonical is the "canonical" url the one that has to be indexed and not the one with "rel=canonical" but it has not been my case and now I have both indexed...
Any suggestion?
-
Is it the opposite. The new one has a rel=canonical to the old one because it was written with the same content that the old one but then it appears in the index.
Then the new one has been indexed and I thought it wasnt going to be indexed. But at the same time it ranks much higger than the old one...
-
According to Google a rel=canonical is just a hint 9although they say they strongly honour it) - http://googlewebmastercentral.blogspot.com/2009/02/specify-your-canonical.html. This might explain why your old page is still showing up int he results.
Has your new page been indexed yet?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Redirecting Canonical Hostnames
Hi, I want to rewrite all the url pages of "site.com" to "www.site.com". I read the moz redirection article and i concluded that this would be the best approach. RewriteCond %{HTTP_HOST} !^www.seomoz.org [NC]
Technical SEO | | bigrat95
RewriteRule (.*) http://www.seomoz.org/$1 [L,R=301]. But i recieved this error: Internal Server Error The server encountered an internal error or misconfiguration and was unable to complete your request. Please contact the server administrator, webmaster@localhost and inform them of the time the error occurred, and anything you might have done that may have caused the error. More information about this error may be available in the server error log. I tried this rewrite too... RewriteCond %{HTTP_HOST} !^www. [NC]
RewriteRule ^(.*)$ http://www.%{HTTP_HOST}/$1 [L,R=301] It worked but it just rewriting my domain** "site.com"** and not all the subs "site.com/fr/example.php" to "www.site.com" Why it doesn't work properly, it seem to be easy... Could it be a hosting problem? Is there another way to do it? <address> </address> <address> </address> <address> </address> <address> </address>0 -
Best way to handle pages with iframes that I don't want indexed? Noindex in the header?
I am doing a bit of SEO work for a friend, and the situation is the following: The site is a place to discuss articles on the web. When clicking on a link that has been posted, it sends the user to a URL on the main site that is URL.com/article/view. This page has a large iframe that contains the article itself, and a small bar at the top containing the article with various links to get back to the original site. I'd like to make sure that the comment pages (URL.com/article) are indexed instead of all of the URL.com/article/view pages, which won't really do much for SEO. However, all of these pages are indexed. What would be the best approach to make sure the iframe pages aren't indexed? My intuition is to just have a "noindex" in the header of those pages, and just make sure that the conversation pages themselves are properly linked throughout the site, so that they get indexed properly. Does this seem right? Thanks for the help...
Technical SEO | | jim_shook0 -
Rel Canonical tag using Wordpress SEO plugin
Hi team I hope this is the right forum for asking this question. I have a site http://hurunuivillage.com built on Wordpress 3.5.1 using a child theme on Genesis 1.9. We're using Joost's Wordpress SEO plugin and I thought it was configured correctly but the Crawl Diagnostics report has identified an issue with the Rel Canonical tag on the sites pages. I have not edited the plugin settings so am surprised the SEOMoz Crawl has picked up a problem. Example: Page URL is http://hurunuivillage.com/ Tag Value http://hurunuivillage.com/ (exactly the same) Page Authority 39 Linking Root Domains 23 Source Code Considering the popularity of the plugin I'm surprised I have not been able to find tutorials to find what I'm doing wrong or should be doing better. Thanks in advance. Best Nic
Technical SEO | | NicDale0 -
Expert Indexation challenge!
We have a major and strange indexation problem on our site for several languages for a while now. If I type in the search query "langsom computer" ("slow pc" in Danish) it used to display the page (www.spamfighter.com/SLOW-PCfighter/Lang_DA/) in the top 3. Now it displays this site instead as result #11 which is an entirely different product: http://www.spamfighter.com/VIRUSfighter/Lang_DA/ The same happens for some other languages. The French search: "Optimisez votre PC trop lent avec une meilleure performance" (Optimize your slow PC for better performance) displays: http://www.spamfighter.com/VIRUSfighter/Lang_FR/ which has nothing in common with the search and the page intended: http://www.spamfighter.com/SLOW-PCfighter/Lang_FR/ Anyone have ANY idea what this could be?
Technical SEO | | Crunchii0 -
IP addresses indexed?
I've met with a potential client who has a site with 1,000's of very specific part #'s which don't show in the SERP's on Google. They definitely have the issue of dynamic URL's - but the URL for the part # searches is an IP address rather than their domain name - example: 188.888.888.888/partssearch.php?pnum='1233445' I've not seen the IP address used like this for an external website - is this acceptable for SEO purposes? Thanks, Mark
Technical SEO | | DenverKelly0 -
Search Result Page, Index or Not?
I believe Google doesn't want to index and show other search result pages in there SERP.
Technical SEO | | DigitalJungle
So instead of adding "noindex, follow" tag i have changed the url in my search result page like this: Original
http://www.mysite.com/kb-search.aspx?=travelguide&type=wiki&s=3 To
http://www.mysite.com/travelguide/attraction-guide.html And the search result page contains the title of the articles, a short descriptions (300 chars.) and a link to the articles. Does it help? Or should i add noindex, follow tag? Helps Please?0 -
Rel="canonical" for PFDs?
Hello there, We have a lot of PDFs that seem to end up on other websites. I was wondering if there was a way to make sure that our website gets the credit/authority as the original creator. Besides linking directly from the PDF copy to our pages, is anyone aware of strategy for letting Google know that we are the original publishers? I know search engines can index HTML versions of PDFs, so is there anyway to get them to index a rel="canonical" tag as well? Thoughts/Ideas?
Technical SEO | | Tektronix0 -
Why Google did not index our domain?
Hi, We launched tmart 60 days ago and submitted to google, bing, yahoo 20 days later. But google had never indexed our website still when yahoo indexed it in one week. What we have checked or tried: 1. We got 20~50 inlinks in one month and now 81 inlinks via yahoo site explorer. 2. This domain has registered for 13 years and we purchased it from sedo last year. We
Technical SEO | | zt673
did not find any problems from domain archive pages. 3. Page similar: the homepage is 50% similar to one of our competitors when we just launched.
So we adjusted the page structure and modified the content one month later and decreased the similarity to 30% (by tools from webconfs.com) 4. Google Robots: googlebot crawled our website every day after we submitted for indexing.
We opened GWT account for it and added the xml sitemap last week. GWT said nothing
was wrong except the time of page loading. Our questions: Why google did not indexed our website? What should we do? Thanks, wu0