Will rel=canonical cause a page to be indexed?
-
Say I have 2 pages with duplicate content:
One of them is: http://www.originalsite.com/originalpage
This page is the one I want to be indexed on google (domain rank already built, etc.)
http://www.originalpage.com is more of an ease of use domain, primarily for printed material. If both of these sites are identical, will rel=canonical pointing to "http://www.originalsite.com/originalpage" cause it to be indexed? I do not plan on having any links on my site going to "http://www.originalsite.com/originalpage", they would instead go to "http://www.originalpage.com".
-
Read your additional comment (to @Highland). If you canonical from a known page (indexed and linked to, internally and/or externally) to an unknown page with no links, it would act a bit like a 301-redirect, in theory. The target page (of the canonical) would start ranking as if it were the source page.
The problem is that that page isn't really canonical. You have a tag saying "This is the page" but every single other cue (internal links, inbound links, etc.) says that the non-canonical page is really canonical. In other words, your canonical tag says the opposite of everything else you're saying. That's generally not a good situation. If you want a page to be canonical, treat it that way. Sending Google mixed signals can get messy fast.
-
Why would you point rel canonical to a page you don't want to rank?
-
I probably phrased poorly...simpler question: If there is a page that nobody knows about, it hasn't been submitted, there are no links to it...the only way the outside world would ever know it exists is if they looked at a rel="canonical" tag...will google follow that canonical tag and index it?
-
I actually have a completely different experience. Within the same domain, not between 2 domains. Lets say my page is http://www.originalsite.com/originalpage-1.html http://www.originalsite.com/originalpage-2.html http://www.originalsite.com/originalpage-3.html Each of them is actually http://www.originalsite.com/originalpage.html So each of the above pages (all 4) contain a canonical tag to the original page http://www.originalsite.com/originalpage.html What happens is when I check in the SERPS, nothing except http://www.originalsite.com/originalpage.html show up doing site: checks. However, if I do a cache: for any of the 4 pages, the http://www.originalsite.com/originalpage.html shows up. So Google identifies each of the URLs, but only returns http://www.originalsite.com/originalpage.html in my case.
-
Canonical doesn't prevent a page from being indexed. Canonical allows you, the end user, to specify which of your duplicate pages to treat as the real page. Otherwise Google will pick one. The page still is in the index and is still crawled, it's just ignored for ranking purposes.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
I hope someone can help me with page indexing problem
I have a problem with all video pages on www.tadibrothers.com.
Technical SEO | | TadiBrothers
I can not understand why google do not index all the video pages?
I never blocked them with the robots.txt file, there are no noindex/nofollow tags on the pages. The only video page that I found in search results is the main video category page: https://www.tadibrothers.com/videos and 1 video page out of 150 videos: https://www.tadibrothers.com/video/front-side-rear-view-cameras-for-backup-camera-systems I hope someone can point me to the right way0 -
How long after disallowing Googlebot from crawling a domain until those pages drop out of their index?
We recently had Google crawl a version of the site we that we had thought we had disallowed already. We have corrected the issue of them crawling the site, but pages from that version are still appearing in the search results (the version we want them to not index and serve up is our .us domain which should have been blocked to them). My question is this: How long should I expect that domain (the .us we don't want to appear) to stay in their index after disallowing their bot? Is this a matter of days, weeks, or months?
Technical SEO | | TLM0 -
how to set rel canonical on wordpress.com sites
I know how to do this with a wordpress.org site but I have a client that does not want to switch and without a plugin I am lost. any help would be greatly appreciated. Jeremy Wood
Technical SEO | | SOtBOrlando0 -
/index.php/ page
I was wondering if my system creates this page www my domain com/index.php/ is it better to block with robot.txt or just canonize?
Technical SEO | | ciznerguy0 -
Rel=cannonical vs. noindex.follow for paginated pages
I"m working on a real estate site that has multiple listing pages, e.g. http://www.hhcrealestate.com/manhattan-beach-mls-real-estate-listings I'm trying to get the main result page to rank for that particular geo-keyword, i.e. "manhattan beach homes for sale". I want to make sure all of the individual listings on the paginated pages, 2,3, 4 etc. still get indexed. Is it better to add to all of the paginated pages, i.e.manhattan-beach-mls-real-estate-listings-2, manhattan-beach-mls-real-estate-listings--3, manhattan-beach-mls-real-estate-listings-4, etc. or is it better to add noindex,follow to those pages?
Technical SEO | | fthead91 -
Will frequently adding and frequently removing pages from my site hinder any SEO?
Hi Guys, Just looking through our crawl diagnositcs and we have a ton errors, well over 5000 actually, on 404 pages that cannot be accessed. Our website runs a lot of "Hot Offers" that are time bound, so they expire at the end of each month and we remove the page via our CMS. It's making the crawl diagnositcs loook bad, but will this be hindering our seo and Google 'stuff' because they are finding thousands of 404 errors? Any advice would be greatly appreciated! Website: www.vospers.com Lee Greenhill
Technical SEO | | lee_greenhill0 -
Will using http ping, lastmod increase our indexation with Google?
If Google knows about our sitemaps and they’re being crawled on a daily basis, why should we use the http ping and /or list the index files in our robots.txt? Is there a benefit (i.e. improving indexability) to using both ping and listing index files in robots? Is there any benefit to listing the index sitemaps in robots if we’re pinging? If we provide a decent <lastmod>date is there going to be any difference in indexing rates between ping and the normal crawl that they do today?</lastmod> Do we need to all to cover our bases? thanks Marika
Technical SEO | | marika-1786190 -
Canonical on ecommerce pages
I have seen some competitors using the nofollow tag as well as canonical on all refinements and sorts on their ecommerce pages. Example being if you went to their hard drive category page and refined by 500gb hard drives then that page would have a canonical element to send it back to hard drives page without the refinement. I see how this could be good for control indexation and the amount pages Google crawls, but do you see problems in using the canonical tag this way? Also I have seen competitors have category page descriptions (describing what that type of product is) on all pagenation and refinements (the exact same block of text on all of the pages). Would this be a duplicate content problem or is it not that big of a deal since the content is only on their site so they are only competiting with themselves. Thanks for your help
Technical SEO | | Gordian0