Canonical Query
-
If Google decides to ignore your canonical and indexes numerous versions, does that count as duplicate content?
We've got a large amount of canonicals ignored by Google, so I'm just trying to gauge if it's an issue or not.
-
Hi Ruth,
Appreciate your response. Trying to get these sorted at a code level, but we currently have six different issues all providing various issues, along with a variety of other features not working correctly. (The joys of working with a 10 year old system that is behind in a few areas)
You say the following:
- Make sure that the pages your canonical tags point to are very similar to the pages the tags are on - if they're too different, Google may decide they both need to be indexed.
Is it strange that the canonicals that are not the exact duplicates (category filters on ecommerce) are the main ones that are obeyed, the product canonicals (with exact duplicates, excluding changes to the breadcrumbs) are the ones being ignored.
There are pages that are receiving search traffic, but not a massive amount (atleast compared to the true versions of these pages, some of these pages get 10s to 100s of clicks, the canonical pages get thousands/tens of thousands)
Would a viable strategy to try and deal with these by redirecting these non-canonical urls to their canonical format? (short term until we can get issues sorted)
Final query, if Google ignores the canonical is this potentially going to be penalising us? If the answer is believed to be yes then it'll be a higher priority item to deal with.
-
Google can definitely choose to ignore the canonical tag, especially if they think that the page in question is a better solution to a query. I agree with the other respondents that the best possible solution would be to fix this at a code level, so the duplicate content isn't an issue on your site anymore. In the meantime, some things to try:
- Make sure that your internal hierarchy makes the canonical versions more important than the duplicate versions, i.e. they appear farther up in your site nav and have more internal links pointing to them.
- Try building some external links to those pages as well, where you can.
- Make sure that the pages your canonical tags point to are very similar to the pages the tags are on - if they're too different, Google may decide they both need to be indexed.
Are any of the duplicate pages receiving organic search traffic? If not, it may be that Google has indexed them but understands they're not as important. Again, though, the best possible solution would be to fix this at a code level.
-
Sent an email, have you received it?
-
Hey Tom,
Thanks will check it out on Deep crawl hope to find out what is going on.
Tom
-
Hi Tom,
I use Moz, Screaming Frog and this canonical checker: https://chrome.google.com/webstore/detail/canonical/dcckfeohihhlbeobohobibjbdobjbhbo?utm_source=chrome-app-launcher-info-dialog I'm sure that these canonicals are set up correctly.
I will send you an email to the email you have included on your profile.
Thanks,
Tom
-
It sounds to me like your problem is your CMS and your inability to access Google Webmaster tools. If you're going off of Google analytics, that's not going to tell you entire story. Use Moz, Deep Crawl, or screaming frog to determine other or not your canonicals are set up correctly.
It is possible that they're being blocked I some code error. And not being picked up by Googlebot.
Please run your site through the tools suggested and let me know if you need help in the form of somebody to run those tools for you I am willing to add that it is a code error, not Google deciding to ignore properly set up canonicals.
Google Analytics will show you whenever somebody has clicked on it does not mean that the bot is following that URL.
Without seeing more I really couldn't tell you much more unfortunately. If you can private message me with your domain if you'd like and I will check it out.
Hope this helps,Tom
Tom
-
Thank you for your responses. Hopefully someone who may have experienced this before will be able to contribute. It seems there's very little in this area about the potential impacts.
-
I believe you could be at risk of duplicate content issues. If it were my client, I'd definitely consider this a code-red issue and attack it from all possible angles.
-
Yep clean URLs there.
So, do you believe that Google ignoring these canonicals is something we should be worried about? (Basically setting a high priority so development sorts these issues out)
-
Hmm...only other thing I can think of is your that XML sitemap may contain these additional URL strings, but I assume you've already got clean URLs there.
-
Yeah they're definitely right, as a whole our canonicals Google agree with, but there's various batches that Google chooses to ignore.
Unfortunately I don't have access to search console, I have access to GA but that's it. I have to rely on third party tools and other things to try and see the impact. We also have a very restrictive platform which requires things to go through development. So i'm just trying to gauge the seriousness of this issue so that I can do a priority list.
To put the scale into perspective, it looks as if Google is ignoring the majority of our product URLs (thanks to a product recommendation software we use) and is using a different url path. Same with breadcrumbs.
255k indexed pages, ignored canonicals that i've found run to about 15k from just the two above.
-
That's odd, I've never seen a case where Google ignored canonical tags. Since I don't have an example, I have to ask, are your canonical tags in the right place?
Another thing you might try, have you set up parameter handling in Search Console?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to handle sorting, filtering, and pagination in ecommerce? Canonical is enough?
Hello, after reading various articles and watching several videos I'm still not sure how to handle faceted navigation (sorting/filtering) and pagination on my ecommerce site. Current indexation status: The number of "real" pages (from my sitemap) - 2.000 pages Google Search Console (Valid) - 8.000 pages Google Search Console (Excluded) - 44.000 pages Additional info: Vast majority of those 50k additional pages (44 + 8 - 2) are pages created by sorting, filtering and pagination. Example of how the URL changes while applying filters/sorting: example.com/category --> example.com/category/1/default/1/pricefrom/100 Every additional page is canonicalized properly, yet as you can see 6k is still indexed. When I enter site:example.com/category in Google it returns at least several results (in most of the cases the main page is on the 1st position). In Google Analytics I can see than ~1.5% of Google traffic comes to the sorted/filtered pages. The number of pages indexed daily (from GSC stats) - 3.000 And so I have a few questions: Is it ok to have those additional pages indexed or will the "real" pages rank higher if those additional would not be indexed? If it's better not to have them indexed should I add "noindex" to sorting/filtering links or add eg. Disallow: /default/ in robots.txt? Or perhaps add "noindex, nofollow" to the links? Google would have then 50k pages less to crawl but perhaps it'd somehow impact my rankings in a negative way? As sorting/filtering is not based on URL parameters I can't add it in GSC. Is there another way of doing that for this filtering/sorting url structure? Thanks in advance, Andrew
Intermediate & Advanced SEO | | thpchlk0 -
Canonical and Alternate Advice
At the moment for most of our sites, we have both a desktop and mobile version of our sites. They both show the same content and use the same URL structure as each other. The server determines whether if you're visiting from either device and displays the relevant version of the site. We are in a predicament of how to properly use the canonical and alternate rel tags. Currently we have a canonical on mobile and alternate on desktop, both of which have the same URL because both mobile and desktop use the same as explained in the first paragraph. Would the way of us doing it at the moment be correct?
Intermediate & Advanced SEO | | JH_OffLimits3 -
Canonical - unexpected page ranking
We are getting good ranking for an unexpected page, rathewr than the one we were trying to get ranking for. Should we put a canonical on the 'unexpected page' to the page we would like to receive the ranking for - or do we risk losing the ranking? Any suggestions welcomed. Ian
Intermediate & Advanced SEO | | Substance-create0 -
Dealing with non-canonical http vs https?
We're working on a complete rebuild of a client's site. The existing version of the site is in WordPress and I've noticed that the site is accessible via http and https. The new version of the site will have mostly or entirely different URLs. It seems that both http and https versions of a page will resolve, but all of the rel-canonical tags I've seen point to the https version. Sometimes image tags and stylesheets are https, sometimes they aren't. There are both http and https pages in Google's index. Having looked at other community posts about http/https, I've gathered the following: http/https is like two different domains. http and https versions need to be verified in Google Webmaster Tools separately. Set up the preferred domain properly. Rel-canonicals and internal links should have matching protocols. My thought is that we will do a .htaccess that redirects old URLs regardless of the protocol to new pages at one protocol. I would probably let the .css and image files from the current site 404. When we develop and launch the new site, does it make sense for everything to be forced to https? Are there any particular SEO issues that I should be aware of for a scenario like this? Thanks!
Intermediate & Advanced SEO | | GOODSIR0 -
Circular Canonical/Redirect
My client's site has an issue (see below) and I'm wondering how much it could be affecting crawlability. Has anyone seen a major rankings bump after fixing something like this? 1. In each page the rel=canonical is pointing to the http version of the page while the http version is redirecting to the https version. Basically, a circular redirect-canonical loop is occurring.2. The sitemap.xml is also referring to the http version of the pages rather than the https.
Intermediate & Advanced SEO | | elenaroi0 -
Duplicate page content query
Hi forum, For some reason I have recently received a large increase in my Duplicate Page Content issues. Currently it says I have over 7,000 duplicate page content errors! For example it says: Sample URLs with this Duplicate Page Content http://dikelli.com.au/accessories/gowns/news.html http://dikelli.com.au/accessories/news.html
Intermediate & Advanced SEO | | sterls
http://dikelli.com.au/gallery/dikelli/gowns/gowns/sale_gowns.html However there are no physical links to any of these page on my site and even when I look at my FTP files (I am using Dreamweaver) these directories and files do not exist. Can anyone please tell me why the SEOMOZ crawl is coming up with these errors and how to solve them?0 -
Can you Canonical to a URL in a different folder under the same domain?
I want to know if it's possible to add a canonical tag to a URL that points to a URL under a different folder. Content is just about the same. Here's an example (fake urls and product, but structure and parameters are similar to my client's website): domain.com/toy-ducks-results.aspx?color=Purple&model=Elvis domain.com/toy-ducks-details.aspx?color=Purple&model=Elvis&style=Sparkly Let's say that my purple Elvis ducks are really popular. Is there any harm in putting a rel=canonical on the Sparkly Elvis ducks page to the purple Elvis ducks page? Even though they are two different folders? /toy-ducks-results and /toy-ducks-details So, in effect, the preferred folder is /toy-ducks-results Thanks in advance for any help.
Intermediate & Advanced SEO | | EEE30 -
Is it ok to use both 301 redirect and rel="canonical' at the same time?
Hi everyone, I'm sorry if this has been asked before. I just wasn't able to find a response in previous questions. To fix the problems in our website regarding duplication I have the possibility to set up 301's and, at the same time, modify our CMS so that it automatically sets a rel="canonical" tag for every page that is generated. Would it be a problem to have both methods set up? Is it a problem to have a on a page that is redirecting to another one? Is it advisable to have a rel="canonical" tag on every single page? Thanks for reading!
Intermediate & Advanced SEO | | SDLOnlineChannel0