Canonical URL availability
-
Hi
We have a website selling cellphones. They are available in different colors and with various data capacity, which slightly changes the URL.
For instance:
- Black iphone, 16GB: www.site.com/iphone(black,16,000000000010204783).html
- White iphone, 16GB: www.site.com/iphone(white,16,000000000010204783).html
- White iphone, 24GB: www.site.com/iphone(white,24,000000000010204783).html
Now, the canonical URL indicates a standard URL:
But this URL is never physically available. Instead, a user gets 301 redirected to one of the above URLs. Is this a problem? Does a URL have to be "physically" available if it is indicated as canonical?
-
Thanks Dirk for your great in-depth response!
I will now check with developers what the estimated effort would be. Making the canonical URL available will let me sleep better at night before releasing the new site version.
I think the risk shouldn't be huge if we cannot do this and will not waste too many ressources on this (unless, of course, we see a negative impact, which I will then report here;)Best,
Phil -
With a 301 you communicate that the requested resource is no longer available (The requested resource has been assigned a new permanent URI and any future references to this resource SHOULD use one of the returned URIs- source: http://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html)
If you look at the definition of a canonical url - it indicates the preferred URL to use, so that the search results will be more likely to show users that URL structure. (Google attempts to respect this, but cannot guarantee this in all cases.)
So basically what you are telling to Google:
On your site you ask Google not to index site.com/A.htm - but rather to index url site.com/B.htm
On the url site.com/B.htm you put a 301 to site.com/C.htm - in other words force Google to index C.htm rather than B.htm (the 301 indicates that the page has permanently moved to a new location - so is no longer available on B.htm)So in fact - you ask Google not to index A.htm but C.htm instead. Rather than doing this in a complicated 2step process using both canonical & redirect it would be simpler & make more sense to directly put a canonical url on A.htm with C.htm as canonical.
In your case you could create www.site.com/iphone but if it's identical to www.site.com/iphone(black,16,000000000010204783).html I don't think you will gain a lot (especially if it requires a lot of development)
rgds,
Dirk
-
Thank you Dirk!
I did look at the article you pointed out, but could not initially find that information:
"Double-check that your rel=canonical target exists (it’s not an error or “soft 404”)"However, for me this is not 100% conclusive. The page does exist, in a way, but it's redirected. I know that to be on the safe side, we should better make it available. But as it would mean a lot of additional programming effort, I am trying to find out if it really is necessary. Thats' why I was hoping someone already has some experience with this...
-
Normally a canonical url should be physically available - see also: http://googlewebmastercentral.blogspot.be/2013/04/5-common-mistakes-with-relcanonical.html
With a canonical you indicate the Search engines which page you want to have listed in the SERP's. A page which is 301'd to another page will never get listed in the results.
In your case - it's probably better to use the url where your are redirecting to as canonical - or to create a page www.site.com/iphone that is not redirected
rgds,
Dirk
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Does google ignore ? in url?
Hi Guys, Have a site which ends ?v=6cc98ba2045f for all its URLs. Example: https://domain.com/products/cashmere/robes/?v=6cc98ba2045f Just wondering does Google ignore what is after the ?. Also any ideas what that is? Cheers.
Intermediate & Advanced SEO | | CarolynSC0 -
Will disallowing URL's in the robots.txt file stop those URL's being indexed by Google
I found a lot of duplicate title tags showing in Google Webmaster Tools. When I visited the URL's that these duplicates belonged to, I found that they were just images from a gallery that we didn't particularly want Google to index. There is no benefit to the end user in these image pages being indexed in Google. Our developer has told us that these urls are created by a module and are not "real" pages in the CMS. They would like to add the following to our robots.txt file Disallow: /catalog/product/gallery/ QUESTION: If the these pages are already indexed by Google, will this adjustment to the robots.txt file help to remove the pages from the index? We don't want these pages to be found.
Intermediate & Advanced SEO | | andyheath0 -
URL rewrite traffic drop
Hello, A while ago (Sep. 19 2013) we had a new url structure upgrade for products pages within our website (with all the needed 301 redirects in place,internal links & sitemaps updates), but our new urls lost the serps of the old ones and with that we experienced a big traffic drop (and since September I can't see any sign of recovery).
Intermediate & Advanced SEO | | Silviu
Here are just 3 examples of old and coresponding new urls: http://www.nobelcom.com/phone-cards/calling-Mexico-from-United-States-1-182.html
http://www.nobelcom.com/Mexico-phone-cards-182.html http://www.nobelcom.com/es/phone-cards/calling-Mexico-from-United-States-1-182.html
http://www.nobelcom.com/es/Mexico-tarjetas-telefonicas-182.html http://www.nobelcom.com/phone-cards/calling-Angola-Cell-from-Canada-55-407.html
http://www.nobelcom.com/Angola-Cell-phone-cards/from-Canada-55-407.html We followed every seo/usability rule and have no clue why this happened. Any ideea? Cheers,
S.0 -
Recommended URL Structure
Hello, We are currently adding a new section of content on our site related to Marketing and more specifically 'Digital Marketing' (research reports, trend studies, etc). Over time (several months, or 1-3 years) we will add more 'general' marketing content. My question is which of the following URL structures makes more sense from an SEO perspective (and how best to quantify the benefit of one over another): www.mysite.com/marketing/digital/research/... www.mysite.com/digital-marketing/research/.. Thanks, Mike
Intermediate & Advanced SEO | | mike-gart0 -
What is the best canonical url to use for a product page?
I just helped a client redesign and launch a new website for their organic skin care company (www.hylunia.com). The site is built in Magento which by default creates MANY urls for each product. Which of these two do you think would be the best to use as the canonical version? http://www.hylunia.com/pure-hyaluronic-acid-solution
Intermediate & Advanced SEO | | danielmoss
or http://www.hylunia.com/products/face-care/facial-moisturizers/pure-hyaluronic-acid-solution ? I'm leaning on the latter, because it makes sense to me to have the breadcrumbs match the url string, and also it seems having more keywords in the url would help. However, it's obviously a very long url, and there might be some benefits to using the shorter version that I'm not aware of. Thanks in advance for sharing your thoughts. Best, Daniel0 -
Cross Sub Domain Canonical Links
I currently have 1 website, but am planning on dividing it into sub-domains specific to geographic locations such as xxx.co.uk, xxx.it, xxx.es, etc... We are working on creating original content for the sub-sites, however upon launch many will be duplicate pages. Is there a problem with cross sub-domain canonical links? Thanks!
Intermediate & Advanced SEO | | theLotter0 -
Googlebot crawling partial URLs
Hi guys, I've checked my email this morning and I've got a number of 404 errors over the weekend where Google has tried to crawl some of my existing pages but not found the full URL. Instead of hitting 'domain.com/folder/complete-pagename.php' it's hit 'domain.com/folder/comp'. This is definitely Googlebot/2.1; http://www.google.com/bot.html (66.249.72.53) but I can't find where it would have found only the partial URL. It certainly wasn't on the domain it's crawling and I can't find any links from external sites pointing to us with the incorrect URL. GoogleBot is doing the same thing across a single domain but in different sub-folders. Having checked Webmaster Tools there aren't any hard 404s and the soft ones aren't related and haven't occured since August. I'm really confused as to how this is happening.. Thanks!
Intermediate & Advanced SEO | | panini0 -
Quick URL structure question
Say you've got 5,000 articles. Each of these are from 2-3 generations of taxonomy. For example: example.com/motherboard/pc/asus39450 example.com/soundcard/pc/hp39 example.com/ethernet/software/freeware/stuffit294 None of the articles were SUPER popular as is, but they still bring in a bit of residual traffic combined. Few thousand or so a day. You're switching to a brand new platform. Awesome new structure, taxonomy, etc. The real deal. But, historically, you don't have the old taxonomy functions. The articles above, if created today, file under example.com/hardware/ This is the way it is from here on out. But what to do with the historical files? keep the original URL structure, in the new system. Readers might be confused if they try to reach example.com/motherboard, but at least you retain all SEO weight and these articles are all older anyways. Who cares? Grab some lunch. change the urls to /hardware/, and redirect everything the right way. Lose some rank maybe, but its a smooth operation, nice and neat. Grab some dinner. change the urls to /hardware/ DONT redirect, surprise Google with 5k articles about old computer hardware. Magical traffic splurge, go skydiving. Panic, cry into your pillow. Get job signing receipts at CostCo Thoughts?
Intermediate & Advanced SEO | | EricPacifico0