Canonical Query
-
If Google decides to ignore your canonical and indexes numerous versions, does that count as duplicate content?
We've got a large amount of canonicals ignored by Google, so I'm just trying to gauge if it's an issue or not.
-
Hi Ruth,
Appreciate your response. Trying to get these sorted at a code level, but we currently have six different issues all providing various issues, along with a variety of other features not working correctly. (The joys of working with a 10 year old system that is behind in a few areas)
You say the following:
- Make sure that the pages your canonical tags point to are very similar to the pages the tags are on - if they're too different, Google may decide they both need to be indexed.
Is it strange that the canonicals that are not the exact duplicates (category filters on ecommerce) are the main ones that are obeyed, the product canonicals (with exact duplicates, excluding changes to the breadcrumbs) are the ones being ignored.
There are pages that are receiving search traffic, but not a massive amount (atleast compared to the true versions of these pages, some of these pages get 10s to 100s of clicks, the canonical pages get thousands/tens of thousands)
Would a viable strategy to try and deal with these by redirecting these non-canonical urls to their canonical format? (short term until we can get issues sorted)
Final query, if Google ignores the canonical is this potentially going to be penalising us? If the answer is believed to be yes then it'll be a higher priority item to deal with.
-
Google can definitely choose to ignore the canonical tag, especially if they think that the page in question is a better solution to a query. I agree with the other respondents that the best possible solution would be to fix this at a code level, so the duplicate content isn't an issue on your site anymore. In the meantime, some things to try:
- Make sure that your internal hierarchy makes the canonical versions more important than the duplicate versions, i.e. they appear farther up in your site nav and have more internal links pointing to them.
- Try building some external links to those pages as well, where you can.
- Make sure that the pages your canonical tags point to are very similar to the pages the tags are on - if they're too different, Google may decide they both need to be indexed.
Are any of the duplicate pages receiving organic search traffic? If not, it may be that Google has indexed them but understands they're not as important. Again, though, the best possible solution would be to fix this at a code level.
-
Sent an email, have you received it?
-
Hey Tom,
Thanks will check it out on Deep crawl hope to find out what is going on.
Tom
-
Hi Tom,
I use Moz, Screaming Frog and this canonical checker: https://chrome.google.com/webstore/detail/canonical/dcckfeohihhlbeobohobibjbdobjbhbo?utm_source=chrome-app-launcher-info-dialog I'm sure that these canonicals are set up correctly.
I will send you an email to the email you have included on your profile.
Thanks,
Tom
-
It sounds to me like your problem is your CMS and your inability to access Google Webmaster tools. If you're going off of Google analytics, that's not going to tell you entire story. Use Moz, Deep Crawl, or screaming frog to determine other or not your canonicals are set up correctly.
It is possible that they're being blocked I some code error. And not being picked up by Googlebot.
Please run your site through the tools suggested and let me know if you need help in the form of somebody to run those tools for you I am willing to add that it is a code error, not Google deciding to ignore properly set up canonicals.
Google Analytics will show you whenever somebody has clicked on it does not mean that the bot is following that URL.
Without seeing more I really couldn't tell you much more unfortunately. If you can private message me with your domain if you'd like and I will check it out.
Hope this helps,Tom
Tom
-
Thank you for your responses. Hopefully someone who may have experienced this before will be able to contribute. It seems there's very little in this area about the potential impacts.
-
I believe you could be at risk of duplicate content issues. If it were my client, I'd definitely consider this a code-red issue and attack it from all possible angles.
-
Yep clean URLs there.
So, do you believe that Google ignoring these canonicals is something we should be worried about? (Basically setting a high priority so development sorts these issues out)
-
Hmm...only other thing I can think of is your that XML sitemap may contain these additional URL strings, but I assume you've already got clean URLs there.
-
Yeah they're definitely right, as a whole our canonicals Google agree with, but there's various batches that Google chooses to ignore.
Unfortunately I don't have access to search console, I have access to GA but that's it. I have to rely on third party tools and other things to try and see the impact. We also have a very restrictive platform which requires things to go through development. So i'm just trying to gauge the seriousness of this issue so that I can do a priority list.
To put the scale into perspective, it looks as if Google is ignoring the majority of our product URLs (thanks to a product recommendation software we use) and is using a different url path. Same with breadcrumbs.
255k indexed pages, ignored canonicals that i've found run to about 15k from just the two above.
-
That's odd, I've never seen a case where Google ignored canonical tags. Since I don't have an example, I have to ask, are your canonical tags in the right place?
Another thing you might try, have you set up parameter handling in Search Console?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Dynamic referenced canonical pages based on IP region and link equity question
Hi all, My website uses relative URLs that has PHP to read a users IP address, and update the page's referenced canonical tag to an region specific absolute URL for ranking / search results. E.g. www.example.com/category/product - relative URL referenced for internal links / external linkbuilding If a US IP address hits this link, the URL is the same, but canonicalisation is updated in the source to reference www.example.com**/us/**category/product, so all ranking considerations are pointed to that page instead. None of these region specific pages are actually used internally within the site. This decision was done so external links / blog content would fit a user no matter where they were coming from. I'm assuming this is an issue in trying to pass link equity with Googlebot, because it is splitting the strength between different absolute canonical pages depending on what IP it's using to crawl said links (as the relative URL will dynamically alter the canonical reference which is what ranking in SERPs) Any assistance or information no matter how small would be invaluable. Thanks!
Intermediate & Advanced SEO | | MattBassos0 -
Adding Canonical Tags in WYSIWYG Section of Subscription Based Sites
Our company has a paid subscription-based site that only allows us to add HTML in the WYSIWYG section, not in the backend of each individual page. Because we are an e-commerce site, we have many duplicate page issues. Is there a way for us to add or hide the canonical code in the WYSIWYG section instead of us having to make all of our pages significantly different?
Intermediate & Advanced SEO | | expobranders0 -
Sitemap Query
I've decided to write my own sitemap because frankly, the automated ones pull all kinds of out of I don't know where. So to get around that, manual it is. But I have some products appear in various categories, should I still list every product in each category in the sitemap, regardless of some being duplicates, or should I choose the most relevant category and list them there? I do have a canonical URL extension which should resolve any duplicate content I have.
Intermediate & Advanced SEO | | moon-boots0 -
Use Nonindex or Canonical on product tags of a e-commerce site
I run a e-commerce site and we have many product tags. These product tags come up as "Duplicate Page Content" when Moz does it's crawl. I was wondering if I should use Nonindex or Canonical? The tags all go to the same product when used so I figure I would just nonindex them but was wondering what's the best for SEO?
Intermediate & Advanced SEO | | EmmettButler1 -
Google Indexing Duplicate URLs : Ignoring Robots & Canonical Tags
Hi Moz Community, We have the following robots command that should prevent URLs with tracking parameters being indexed. Disallow: /*? We have noticed google has started indexing pages that are using tracking parameters. Example below. http://www.oakfurnitureland.co.uk/furniture/original-rustic-solid-oak-4-drawer-storage-coffee-table/1149.html http://www.oakfurnitureland.co.uk/furniture/original-rustic-solid-oak-4-drawer-storage-coffee-table/1149.html?ec=affee77a60fe4867 These pages are identified as duplicate content yet have the correct canonical tags: https://www.google.co.uk/search?num=100&site=&source=hp&q=site%3Ahttp%3A%2F%2Fwww.oakfurnitureland.co.uk%2Ffurniture%2Foriginal-rustic-solid-oak-4-drawer-storage-coffee-table%2F1149.html&oq=site%3Ahttp%3A%2F%2Fwww.oakfurnitureland.co.uk%2Ffurniture%2Foriginal-rustic-solid-oak-4-drawer-storage-coffee-table%2F1149.html&gs_l=hp.3..0i10j0l9.4201.5461.0.5879.8.8.0.0.0.0.82.376.7.7.0....0...1c.1.58.hp..3.5.268.0.JTW91YEkjh4 With various affiliate feeds available for our site, we effectively have duplicate versions of every page due to the tracking query that Google seems to be willing to index, ignoring both robots rules & canonical tags. Can anyone shed any light onto the situation?
Intermediate & Advanced SEO | | JBGlobalSEO0 -
How to Remove Joomla Canonical and Duplicate Page Content
I've attempted to follow advice from the Q&A section. Currently on the site www.cherrycreekspine.com, I've edited the .htaccess file to help with 301s - all pages redirect to www.cherrycreekspine.com. Secondly, I'd added the canonical statement in the header of the web pages. I have cut the Duplicate Page Content in half ... now I have a remaining 40 pages to fix up. This is my practice site to try and understand what SEOmoz can do for me. I've looked at some of your videos on Youtube ... I feel like I'm scrambling around to the Q&A and the internet to understand this product. I'm reading the beginners guide.... any other resources would be helpful.
Intermediate & Advanced SEO | | deskstudio0