Canonical Query
-
If Google decides to ignore your canonical and indexes numerous versions, does that count as duplicate content?
We've got a large amount of canonicals ignored by Google, so I'm just trying to gauge if it's an issue or not.
-
Hi Ruth,
Appreciate your response. Trying to get these sorted at a code level, but we currently have six different issues all providing various issues, along with a variety of other features not working correctly. (The joys of working with a 10 year old system that is behind in a few areas)
You say the following:
- Make sure that the pages your canonical tags point to are very similar to the pages the tags are on - if they're too different, Google may decide they both need to be indexed.
Is it strange that the canonicals that are not the exact duplicates (category filters on ecommerce) are the main ones that are obeyed, the product canonicals (with exact duplicates, excluding changes to the breadcrumbs) are the ones being ignored.
There are pages that are receiving search traffic, but not a massive amount (atleast compared to the true versions of these pages, some of these pages get 10s to 100s of clicks, the canonical pages get thousands/tens of thousands)
Would a viable strategy to try and deal with these by redirecting these non-canonical urls to their canonical format? (short term until we can get issues sorted)
Final query, if Google ignores the canonical is this potentially going to be penalising us? If the answer is believed to be yes then it'll be a higher priority item to deal with.
-
Google can definitely choose to ignore the canonical tag, especially if they think that the page in question is a better solution to a query. I agree with the other respondents that the best possible solution would be to fix this at a code level, so the duplicate content isn't an issue on your site anymore. In the meantime, some things to try:
- Make sure that your internal hierarchy makes the canonical versions more important than the duplicate versions, i.e. they appear farther up in your site nav and have more internal links pointing to them.
- Try building some external links to those pages as well, where you can.
- Make sure that the pages your canonical tags point to are very similar to the pages the tags are on - if they're too different, Google may decide they both need to be indexed.
Are any of the duplicate pages receiving organic search traffic? If not, it may be that Google has indexed them but understands they're not as important. Again, though, the best possible solution would be to fix this at a code level.
-
Sent an email, have you received it?
-
Hey Tom,
Thanks will check it out on Deep crawl hope to find out what is going on.
Tom
-
Hi Tom,
I use Moz, Screaming Frog and this canonical checker: https://chrome.google.com/webstore/detail/canonical/dcckfeohihhlbeobohobibjbdobjbhbo?utm_source=chrome-app-launcher-info-dialog I'm sure that these canonicals are set up correctly.
I will send you an email to the email you have included on your profile.
Thanks,
Tom
-
It sounds to me like your problem is your CMS and your inability to access Google Webmaster tools. If you're going off of Google analytics, that's not going to tell you entire story. Use Moz, Deep Crawl, or screaming frog to determine other or not your canonicals are set up correctly.
It is possible that they're being blocked I some code error. And not being picked up by Googlebot.
Please run your site through the tools suggested and let me know if you need help in the form of somebody to run those tools for you I am willing to add that it is a code error, not Google deciding to ignore properly set up canonicals.
Google Analytics will show you whenever somebody has clicked on it does not mean that the bot is following that URL.
Without seeing more I really couldn't tell you much more unfortunately. If you can private message me with your domain if you'd like and I will check it out.
Hope this helps,Tom
Tom
-
Thank you for your responses. Hopefully someone who may have experienced this before will be able to contribute. It seems there's very little in this area about the potential impacts.
-
I believe you could be at risk of duplicate content issues. If it were my client, I'd definitely consider this a code-red issue and attack it from all possible angles.
-
Yep clean URLs there.
So, do you believe that Google ignoring these canonicals is something we should be worried about? (Basically setting a high priority so development sorts these issues out)
-
Hmm...only other thing I can think of is your that XML sitemap may contain these additional URL strings, but I assume you've already got clean URLs there.
-
Yeah they're definitely right, as a whole our canonicals Google agree with, but there's various batches that Google chooses to ignore.
Unfortunately I don't have access to search console, I have access to GA but that's it. I have to rely on third party tools and other things to try and see the impact. We also have a very restrictive platform which requires things to go through development. So i'm just trying to gauge the seriousness of this issue so that I can do a priority list.
To put the scale into perspective, it looks as if Google is ignoring the majority of our product URLs (thanks to a product recommendation software we use) and is using a different url path. Same with breadcrumbs.
255k indexed pages, ignored canonicals that i've found run to about 15k from just the two above.
-
That's odd, I've never seen a case where Google ignored canonical tags. Since I don't have an example, I have to ask, are your canonical tags in the right place?
Another thing you might try, have you set up parameter handling in Search Console?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Best Practice Approaches to Canonicals vs. Indexing in Google Sitemap vs. No Follow Tags
Hi There, I am working on the following website: https://wave.com.au/ I have become aware that there are different pages that are competing for the same keywords. For example, I just started to update a core, category page - Anaesthetics (https://wave.com.au/job-specialties/anaesthetics/) to focus mainly around the keywords ‘Anaesthetist Jobs’. But I have recognized that there are ongoing landing pages that contain pretty similar content: https://wave.com.au/anaesthetists/ https://wave.com.au/asa/ We want to direct organic traffic to our core pages e.g. (https://wave.com.au/job-specialties/anaesthetics/). This then leads me to have to deal with the duplicate pages with either a canonical link (content manageable) or maybe alternatively adding a no-follow tag or updating the robots.txt. Our resident developer also suggested that it might be good to use Google Index in the sitemap to tell Google that these are of less value? What is the best approach? Should I add a canonical link to the landing pages pointing it to the category page? Or alternatively, should I use the Google Index? Or even another approach? Any advice would be greatly appreciated. Thanks!
Intermediate & Advanced SEO | | Wavelength_International0 -
News articles on our website are being indexed, but not showing up for search queries.
News articles on distributed.com are being indexed by Google, but not showing up for any search queries. In Google Search, I can copy and paste the entire first paragraph of the article, and the listing still won't show up in search results. For example, https://distributed.com/news/dtcc-moves-closer-blockchain-powered-trades doesn't rank AT ALL for "DTCC Moves Closer to Blockchain-Powered Trades", the title of the article. We've tried the following so far: re-submitted sitemap to search console checked manual actions in search console checked for any no-index/no-follow tags Please help us solve this SEO mystery!
Intermediate & Advanced SEO | | BTC_Inc0 -
Landing pages for paid traffic and the use of noindex vs canonical
A client of mine has a lot of differentiated landing pages with only a few changes on each, but with the same intent and goal as the generic version. The generic version of the landing page is included in navigation, sitemap and is indexed on Google. The purpose of the differentiated landing pages is to include the city and some minor changes in the text/imagery to best fit the Adwords text. Other than that, the intent and purpose of the pages are the same as the main / generic page. They are not to be indexed, nor am I trying to have hidden pages linking to the generic and indexed one (I'm not going the blackhat way). So – I want to avoid that the duplicate landing pages are being indexed (obviously), but I'm not sure if I should use noindex (nofollow as well?) or rel=canonical, since these landing pages are localized campaign versions of the generic page with more or less only paid traffic to them. I don't want to be accidentally penalized, but I still need the generic / main page to rank as high as possible... What would be your recommendation on this issue?
Intermediate & Advanced SEO | | ostesmorbrod0 -
Possible problem with new site (GWT no queries/very low index vs. submitted)
Hi everyone, I recently launched a new website for a small business loan company in the Dallas area. The site has been live for roughly a month and a half. I submitted everything to GWT as usual, including my sitemap. I am not sure what's going on with the site, as there is no activity from GWT in the impressions or queries. The submit vs. index is 24/3 (and hasn't moved). Also the queries graph on the overview stops at 3/18/2015... On another note, when I go to Crawl > Sitemaps, it shows that there were pages indexed during the month of march and then on April 3 it drops from 17 to 2 and never increases. Google says there are no errors or issues found, but I feel like there's something wrong. When I do site:, my URLs do pop up which makes me believe there's just a problem with my GWT. With that being said, I'm not happy THINKING there's something wrong. I need to actually know what the problem is. The only thing I can think of that I have done is purchase SSL for the site, but when I search what pages are indexed using www. it shows all the HTTPS URLS, so that would tell me that the site is getting indexed without a problem? Does anyone have a clue as to what might be happening? I will attach some screen shots so that you can get a better idea... KQ2366i D5xBNZf mF7kkgW
Intermediate & Advanced SEO | | jameswesleyhunt0 -
Previously owned domain & canonical
Hi, I've recently joined the business and as part of the cleanup process I got told that we owned this domain preferredsafaris.com with some very similar content to our main site southernafricatravel.com. We're no longer owns the preferredsafaris.com domain but looking at Google's cache for it we realised that the title, meta description & page shown when looking at the 'cached page' is for our current domain even though it is showing the 'correct' URL there. I imagine this might have something to do with canonical set on those pages but the weird thing is all those pages now render 404 & do not show a canonical in the source code. I have used Google Removal Tool https://www.google.com/webmasters/tools/removals for all those URLs & Google says that it has removed them & yet they're still showing. What do you suggest? Any potential issue in regards to duplicate content here? Cheers, Julien
Intermediate & Advanced SEO | | SouthernAfricaTravel0 -
Redirect 301 or Canonical.
Hello all, I have a page with a long post title and url path name (more than 70 caracters and 115). This page has many visits but I am changing the SEO website structure according to SEOMOz and forums guidelines so: I WILL CREATE A DUPLICATE PAGE WITH THE SAME INFO. This issue has been marked as an issue in the SEO tools, for long names>70 and url path names>115 My question is which option should I use and you would recommend me? 1. OPTION 1: Ideally I would like to keep the old post, so I should use the canonical tag, but my main concern is if the search engines in terms of SEO, even the canonical has been done, will penalise my SEO as there is still a post with bad SEO optimising, or if this is not the case because I already used the canonical. 2. OPTION 2: Eliminate the post and redirection 301 to the new page to keep the juice. I would prefer option 1, as I keep both post and page, but only if searchengines do not penalise my SEO as they detect a long post name and url path name. Thank you verty much, Antonio
Intermediate & Advanced SEO | | aalcocer20030 -
To "Rel canon" or not to "Rel canon" that is the question
Looking for some input on a SEO situation that I'm struggling with. I guess you could say it's a usability vs Google situation. The situation is as follows: On a specific shop (lets say it's selling t-shirts). The products are sorted as follows each t-shit have a master and x number of variants (a color). we have a product listing in this listing all the different colors (variants) are shown. When you click one of the t-shirts (eg: blue) you get redirected to the product master, where some code on the page tells the master that it should change the color selectors to the blue color. This information the page gets from a query string in the URL. Now I could let Google index each URL for each color, and sort it out that way. except for the fact that the text doesn't change at all. Only thing that changes is the product image and that is changed with ajax in such a way that Google, most likely, won't notice that fact. ergo producing "duplicate content" problems. Ok! So I could sort this problem with a "rel canon" but then we are in a situation where the only thing that tells Google that we are talking about a blue t-shirt is the link to the master from the product listing. We end up in a situation where the master is the only one getting indexed, not a problem except for when people come from google directly to the product, I have no way of telling what color the costumer is looking for and hence won't know what image to serve her. Now I could tell my client that they have to write a unique text for each varient but with 100 of thousands of variant combinations this is not realistic ir a real good solution. I kinda need a new idea, any input idea or brain wave would be very welcome. 🙂
Intermediate & Advanced SEO | | ReneReinholdt0 -
Should I use the canonical tag on all my mobile pages?
I've seen flavors of this question asked but did not see the exact response I was looking for. If I have a site at: www.site.com And I am creating a mobile version at: m.site.com (let's say a responsive design is not feasible at this time) And all the content on m.site.com is duplicative of the content on www.site.com What's the best way to handle that from an SEO perspective? Should I put a canonical tag on every mobile page pointing back to the www page? I assume that is better than a 'no index' tag on all pages of the mobile site?
Intermediate & Advanced SEO | | hbrown1080