Will rel=canonical cause a page to be indexed?
-
Say I have 2 pages with duplicate content:
One of them is: http://www.originalsite.com/originalpage
This page is the one I want to be indexed on google (domain rank already built, etc.)
http://www.originalpage.com is more of an ease of use domain, primarily for printed material. If both of these sites are identical, will rel=canonical pointing to "http://www.originalsite.com/originalpage" cause it to be indexed? I do not plan on having any links on my site going to "http://www.originalsite.com/originalpage", they would instead go to "http://www.originalpage.com".
-
Read your additional comment (to @Highland). If you canonical from a known page (indexed and linked to, internally and/or externally) to an unknown page with no links, it would act a bit like a 301-redirect, in theory. The target page (of the canonical) would start ranking as if it were the source page.
The problem is that that page isn't really canonical. You have a tag saying "This is the page" but every single other cue (internal links, inbound links, etc.) says that the non-canonical page is really canonical. In other words, your canonical tag says the opposite of everything else you're saying. That's generally not a good situation. If you want a page to be canonical, treat it that way. Sending Google mixed signals can get messy fast.
-
Why would you point rel canonical to a page you don't want to rank?
-
I probably phrased poorly...simpler question: If there is a page that nobody knows about, it hasn't been submitted, there are no links to it...the only way the outside world would ever know it exists is if they looked at a rel="canonical" tag...will google follow that canonical tag and index it?
-
I actually have a completely different experience. Within the same domain, not between 2 domains. Lets say my page is http://www.originalsite.com/originalpage-1.html http://www.originalsite.com/originalpage-2.html http://www.originalsite.com/originalpage-3.html Each of them is actually http://www.originalsite.com/originalpage.html So each of the above pages (all 4) contain a canonical tag to the original page http://www.originalsite.com/originalpage.html What happens is when I check in the SERPS, nothing except http://www.originalsite.com/originalpage.html show up doing site: checks. However, if I do a cache: for any of the 4 pages, the http://www.originalsite.com/originalpage.html shows up. So Google identifies each of the URLs, but only returns http://www.originalsite.com/originalpage.html in my case.
-
Canonical doesn't prevent a page from being indexed. Canonical allows you, the end user, to specify which of your duplicate pages to treat as the real page. Otherwise Google will pick one. The page still is in the index and is still crawled, it's just ignored for ranking purposes.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is site: a reliable method for getting full list of indexed pages?
The site:domain.com search seems to show less pages than it used to (Google and Bing). It doesn't relate to a specific site but all sites. For example, I will get "page 1 of about 3,000 results" but by the time I've paged through the results it will end and change to "page 24 of 201 results". In that example If I look in GSC it shows 1,932 indexed. Should I now accept the "pages" listed in site: is an unreliable metric?
Technical SEO | | bjalc20112 -
Is it better to use XXX.com or XXX.com/index.html as canonical page
Is it better to use 301 redirects or canonical page? I suspect canonical is easier. The question is, which is the best canonical page, YYY.com or YYY.com/indexhtml? I assume YYY.com, since there will be many other pages such as YYY.com/info.html, YYY.com/services.html, etc.
Technical SEO | | Nanook10 -
Help! Pages not being indexed
Hi Mozzers, I need your help.
Technical SEO | | bshanahan
Our website (www.barnettcapitaladvisors.com) stopped being indexed in search engines following a round of major changes to URLs and content. There were a number of dead links for a few days before 301 redirects were properly put in place. And now, only 3 pages show up in bing when I do the search "site:barnettcapitaladvisors.com". A bunch of pages show up in Google for that search, but they're not any of the pages we want to show up. Our home page and most important services pages are nowhere in search results. What's going on here?
Our sitemap is at http://www.barnettcapitaladvisors.com/sites/default/files/users/AndrewCarrillo/sitemap/sitemap.xml
Robots.txt is at: http://www.barnettcapitaladvisors.com/robots.txt Thanks!0 -
Rel Canonical for Miva Merchant
Due to necessary pagination on the site that sells thousands of products, and due to products being assigned to more than one category in the Miva Merchant store, we have been battling duplicate content, and Meta tag issues. I asked lot of questions on the Miva forum on how to use rel canonical in Miva, and got this script below to use. It was supposed to solve all of our problems, but now it seems that every page of the site is under Rel Canonical Notices in the Crawl Diagnostics. I am not sure I am reading the Notices correctly, and if we achieved what we want or not. Here is an example of one listing: URL: http://www.domain.com/ABUS.html
Technical SEO | | 2CDevGroup
Tag Value: http://www.domain.com/
Page Authority: 28
Linking Root Domains: 1 | | | | |0 -
Index page 404 error
Crawl Results show there is 404 error page which is index.htmk **it is under my root, ** http://mydomain.com/index.htmk I have checked my index page on the server and my index page is index.HTML instead of index.HTMK. Please help me to fix it
Technical SEO | | semer0 -
Rel Canonical problem or SEOmoz bug ?
Hello all, I hope that sombody out there could help me with my question. I am very new in SEO and in SEOmoz community. I am not familiar with coding. I am goining to start learning soon enough but till now I now only basics. At the website where I am trying to optimize for SEO I am reciving this Crawl Diagnostic Programme. Issue: Rel Canonical (Notice) not Error I searched and lerned what it is. So I contact the developers of the website. Build in wordpress and ask them how to corrected ? They told me that they are using Canonical Tags to all their pages and have no idea why SEOmoz keep identifining it as a "notice" They also tel me to check the source code of page to see the canonical tag. I did and their is actuall a canonical tag there. Cjeck please here www.costanavarinogolf.com So do you have any idea why this is happening ? could you help me explaiin to developers what they should do to overcome this ? Or it's just a bug of SEOmoz and not a reall problem exist ? Thank you very much for your time
Technical SEO | | grzontan0 -
Should there be a canonical tag on my 404 error page?
In my crawl diagnostics, I notice some 4xx client errors. They are appearing for pages that no longer exist, so I'm not sure what the problem is. Shouldn't they just be dealt as 404's? Anyway, on closer inspection I noticed that my 404 error page contains a canonical tag which points to the missing page. Could this be the issue? Is it a good idea to remove the canonical tag from this error page? Thanks.
Technical SEO | | Leighm0 -
Dealing with hundreds of spam pages caused by a hacker
A couple of my sites have recently been hacked with the hacker managing to overwrite lots of my pages with their own spam products and also adding in lots of (hundreds) pages that they have created themselves. I have rectified this in so far as removing folders that the hacker used to over write my pages so my original pages are now back showing the correct content and also removed all the hundres of new pages that they had managed to instantly add. I appreciate that google will find and re-crawl all my genuine pages so the correct content is being displayed and indexed for them but what is the best method for dealing with the hundreds of extra spam ages that google had managed to crawl but have now been deleted so there are loads of 404 page not founds in google?
Technical SEO | | Wardy0