Self referencing canonicals AND duplicate URLs. Have I set them up correctly?
-
Hi team,
We've recently redesigned our website.
Originally we had separate product listings for every product. Even if there was one design in two colours, each colour had its own listing.
With the redesign we merged all of these identical products to help with duplicate content. Customers can now browse the different stone colours available in that design from a single product listing (bottom left of screen under 'select a stone' on a product page)
When the customer changes the stone colour, the product images change to the new colour and its product code is appended to the end of the existing URL. eg:
http://www.mountainjade.co.nz/necklaces/assorted-jades-open-koru-necklace-jc1725/ (original listing)
http://www.mountainjade.co.nz/necklaces/assorted-jades-open-koru-necklace-jc1725/?sku=JC1725BL (black selected)
We have the following self referencing canonicals on all product pages [current-page:url:absolute], yet MOZ is telling me I have alot of duplicate content on pages with the above example.
Have I implemented the canonicals correctly? Is this why Moz is flagging the listings as duplicate?
-
If you've got that path anywhere in your navigation or other internal linking, you'd want to remove that or update it to /shop/necklaces/. The next step would be to 301 redirect /shop/necklaces/necklace/ to /shop/necklaces/ just in case you've got any links pointing to it - this will get your users where they want to go and also let search engines know you've relocated the page.
-
One last question,
How exactly would I remove /shop/necklaces/necklace/?
Sorry if that's a stupid question. I just want to know a bit more before I take it to our dev.
Thanks.
-
Thanks for this Logan!
I really appreciate the help.
-
As Yossi said, configuring parameters in Search Console should help - _but, _that's only going to help you out in Google.
Adding a disallow for those parameters in the robots file will help solve the problem in other search engines.
The thin content is definitely contributing as well. Moz identifies dupes based on a source code match between any two pages of 90% or higher. When you consider all your template code is the same across every page, thin content isn't enough to differentiate the source code.
I also noticed on one of those screenshots that you got a one dupe of /shop/necklaces/ and /shop/necklaces/necklace/. If you can, I recommend removing that second one with doubled up 'necklace' folders, that's going to cause a lot of dupes as well.
-
Hi Logan,
Thanks for looking into the canonicals for me. I'm glad to hear they appear to be configured correctly.
There are alot of duplicate page issues, with 109 in total at the moment.
Some are similar to the above example, some are URLS that contain refined search parameters (price, design etc), but most are just products which are almost identical. I think this is because most product pages have thin generic content, so for those examples we're in the process of writing unique product descriptions and adding unique imagery.
I've attached a few screenshot if you'd like to take a look. Your thoughts would be much appreciated
-
Thanks so much for the reply Yossi.
Great tip about using GSC URL parameter tools. I'll definitely implement that.
Appreciate it.
Jake
-
Jacob, as Logan wrote it looks like the canonicals are good to go.. (i just did a small sampling though..)
Not sure how your URLs are set but if the "sku=XXX" parameters are used only for color variations of a specific product, then you can use the URL paramater setting in Google Search Console.This will make your life easier, and it will ensure that no duplicate content is crawled by Google. But URL parameters must be used with caution
good luck
Yossi -
Hi Jacob,
I took a look at your site, and the canonicals appear to be configured correctly. When you look at your duplicates in the Site Crawl report in Moz, and you click the + next to where it says "1 duplicate", what are you seeing? Is it a URL set like the example you've used above, or something else?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google has deindexed a page it thinks is set to 'noindex', but is in fact still set to 'index'
A page on our WordPress powered website has had an error message thrown up in GSC to say it is included in the sitemap but set to 'noindex'. The page has also been removed from Google's search results. Page is https://www.onlinemortgageadvisor.co.uk/bad-credit-mortgages/how-to-get-a-mortgage-with-bad-credit/ Looking at the page code, plus using Screaming Frog and Ahrefs crawlers, the page is very clearly still set to 'index'. The SEO plugin we use has not been changed to 'noindex' the page. I have asked for it to be reindexed via GSC but I'm concerned why Google thinks this page was asked to be noindexed. Can anyone help with this one? Has anyone seen this before, been hit with this recently, got any advice...?
Technical SEO | | d.bird0 -
Duplicate content issue
Moz crawl diagnostic tool is giving me a heap of duplicate content for each event on my website... http://www.ticketarena.co.uk/events/Mint-Festival-7/ http://www.ticketarena.co.uk/events/Mint-Festival-7/index.html Should i use a 301 redirect on the second link? i was unaware that this was classed as duplicate content. I thought it was just the way the CMS system was set up? Can anyone shed any light on this please. Thanks
Technical SEO | | Alexogilvie0 -
URL removals
Hello there, I found out that some pages of the site have two different URL's pointing at the same page generating duplicate content, title and description. Is there a way to block one of them? cheers
Technical SEO | | PremioOscar0 -
Are duplicate page titles fixed by the canonical tag
Google Web Master Tools is saying that some of my pages have duplicate page titles because of pagination. However, I have implemented the canonical tag on the paginated pages which I thought would keep my site from being penalized for duplicate page titles. Is this correct? Or does canonical tag only relate to duplicate content issues?
Technical SEO | | Santaur0 -
Am I self canabalizing?
Hi All, I'm continuing my search to see why my ranking is slipping and slipping; I thought maybe self cannibalizing is an issue. I guessed this as my competitors don't have three pages ranking highly for a particular key word; it is just the one page that ranks highly for them - for all of the important keywords. I have three sometimes four pages ranking (albeit not highly ranking) for a particular keyword. An earlier post said that the pageranker tool on SEOMOZ highlighted self cannibalization; I've checked my pages on this tool and they all get A's and passed the cannibalization check. Does this Absolutely mean that cannibalization is not theproblem for me? Thanks,
Technical SEO | | girlie0 -
Multiple URLs in CMS - duplicate content issue?
So about a month ago, we finally ported our site over to a content management system called Umbraco. Overall, it's okay, and certainly better than what we had before (i.e. nothing - just static pages). However, I did discover a problem with the URL management within the system. We had a number of pages that existed as follows: sparkenergy.com/state/name However, they exist now within certain folders, like so: sparkenergy.com/about-us/service-map/name So we had an aliasing system set up whereby you could call the URL basically whatever you want, so that allowed us to retain the old URL structure. However, we have found that the alias does not override, but just adds another option to finding a page. Which means the same pages can open under at least two different URLs, such as http://www.sparkenergy.com/state/texas and http://www.sparkenergy.com/about-us/service-map/texas. I've tried pointing to the aliased URL in other parts of the site with the rel canonical tag, without success. How much of a problem is this with respect to duplicate content? Should we bite the bullet, remove the aliased URLs and do 301s to the new folder structure?
Technical SEO | | ufmedia0 -
Is a 302 redirect the correct redirect from a root URL to a detail page?
Hi guys The widely followed SEO best practice is that 301 redirects should be used instead of 302 redirects when it is a permanent redirect that is required. Matt Cutts said last year that 302 redirects should "only" be used for temporary redirects. http://www.seomoz.org/blog/whiteboard-interview-googles-matt-cutts-on-redirects-trust-more For a site that I am looking at the SEO Moz Crawll Diagnostics tool lists as an issue that the URL / redirects to www.abc.com/Pages/default.aspx with a 302 redirect. On further searching I found that on a Google Support forum (http://www.google.com/support/forum/p/Webmasters/thread?tid=276539078ba67f48&hl=en) that a Google Employee had said "For what it's worth, a 302 redirect is the correct redirect from a root URL to a detail page (such as from "/" to "/sites/bursa/"). This is one of the few situations where a 302 redirect is preferred over a 301 redirect." Can anyone confirm if it is the case that "a 302 redirect is the correct redirect from a root URL to a detail page"? And if so why as I haven't found an explanation. If it is the correct best practice then should redirects of this nature be removed from displaying as issues in the SEO Moz Crawll Diagnostics tool Thanks for your help
Technical SEO | | CPU0 -
Duplicate content and URL's
Hi Guys, Hope you are all well. Just a quick question which you will find nice and easy 🙂 I am just about to work through duplicate content pages and URL changes. Firstly, With the duplicate content issue i am finding the seo friendly URL i would normally direct to in some cases has less links, authority and root domain to it than some of the unseo friendly URL's. will this harm me if i still 301 redirect them to the seo friendly URL. Also, With the url changed it is going to be a huge job to change all the url so they are friendly and the CMS system is poor. Is there a better way of doing this? It has been suggested that we create a new webpage with a friendly URL and redirect all the pages to that. Will this lose all the weight as it will be a brand new page? Thank you for your help guys your legends!! Cheers Wayne
Technical SEO | | wazza19850