Self referencing canonicals AND duplicate URLs. Have I set them up correctly?
-
Hi team,
We've recently redesigned our website.
Originally we had separate product listings for every product. Even if there was one design in two colours, each colour had its own listing.
With the redesign we merged all of these identical products to help with duplicate content. Customers can now browse the different stone colours available in that design from a single product listing (bottom left of screen under 'select a stone' on a product page)
When the customer changes the stone colour, the product images change to the new colour and its product code is appended to the end of the existing URL. eg:
http://www.mountainjade.co.nz/necklaces/assorted-jades-open-koru-necklace-jc1725/ (original listing)
http://www.mountainjade.co.nz/necklaces/assorted-jades-open-koru-necklace-jc1725/?sku=JC1725BL (black selected)
We have the following self referencing canonicals on all product pages [current-page:url:absolute], yet MOZ is telling me I have alot of duplicate content on pages with the above example.
Have I implemented the canonicals correctly? Is this why Moz is flagging the listings as duplicate?
-
If you've got that path anywhere in your navigation or other internal linking, you'd want to remove that or update it to /shop/necklaces/. The next step would be to 301 redirect /shop/necklaces/necklace/ to /shop/necklaces/ just in case you've got any links pointing to it - this will get your users where they want to go and also let search engines know you've relocated the page.
-
One last question,
How exactly would I remove /shop/necklaces/necklace/?
Sorry if that's a stupid question. I just want to know a bit more before I take it to our dev.
Thanks.
-
Thanks for this Logan!
I really appreciate the help.
-
As Yossi said, configuring parameters in Search Console should help - _but, _that's only going to help you out in Google.
Adding a disallow for those parameters in the robots file will help solve the problem in other search engines.
The thin content is definitely contributing as well. Moz identifies dupes based on a source code match between any two pages of 90% or higher. When you consider all your template code is the same across every page, thin content isn't enough to differentiate the source code.
I also noticed on one of those screenshots that you got a one dupe of /shop/necklaces/ and /shop/necklaces/necklace/. If you can, I recommend removing that second one with doubled up 'necklace' folders, that's going to cause a lot of dupes as well.
-
Hi Logan,
Thanks for looking into the canonicals for me. I'm glad to hear they appear to be configured correctly.
There are alot of duplicate page issues, with 109 in total at the moment.
Some are similar to the above example, some are URLS that contain refined search parameters (price, design etc), but most are just products which are almost identical. I think this is because most product pages have thin generic content, so for those examples we're in the process of writing unique product descriptions and adding unique imagery.
I've attached a few screenshot if you'd like to take a look. Your thoughts would be much appreciated
-
Thanks so much for the reply Yossi.
Great tip about using GSC URL parameter tools. I'll definitely implement that.
Appreciate it.
Jake
-
Jacob, as Logan wrote it looks like the canonicals are good to go.. (i just did a small sampling though..)
Not sure how your URLs are set but if the "sku=XXX" parameters are used only for color variations of a specific product, then you can use the URL paramater setting in Google Search Console.This will make your life easier, and it will ensure that no duplicate content is crawled by Google. But URL parameters must be used with caution
good luck
Yossi -
Hi Jacob,
I took a look at your site, and the canonicals appear to be configured correctly. When you look at your duplicates in the Site Crawl report in Moz, and you click the + next to where it says "1 duplicate", what are you seeing? Is it a URL set like the example you've used above, or something else?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What to do with existing URL when replatforming and new URL is the same?
We are changing CMS from WordPress to Uberflip. If there is a URL that remains the same I believe we should not create a redirect. However, what happens to the old page? Should it be deleted?
Technical SEO | | maland0 -
Duplicate content analysis
Hi all,We have some pages being flagged as duplicates by the google search console. However, we believe the content on these pages is distinctly different (for example, they have completely different search results returned, different headings etc). An example of two pages google finds to be duplicates is below. if anyone can spot what might be causing the duplicate issue here, would very much appreciate suggestions! Thanks in advance.
Technical SEO | | Eric_S
Examples: https://www.vouchedfor.co.uk/IFA-financial-advisor-mortgage/harborne
https://www.vouchedfor.co.uk/accountant/harborne0 -
Canonical question
I have at least three duplicate main pages on my website: www.augustbullocklaw.com www.augustbullocklaw.com/index augustbullocklaw.com I want the first one, www.augustbullocklaw.com to be the main page. I put this code on the index page and uploaded it to my site: http://www.augustbullocklaw.com/canonical-version-of-page/" rel="canonical" /> This code now appears on all three pages shown above. Did I do this correctly? I surmise that www.augustbullocklaw.com is pointing to itself. Is that ok? I don't know how to take the cononical code off the page that is the page I want to be the main page. (I don't know how to remove it from www.augustbullocklaw.com, but leave it on www.augustbullocklaw.com/index and augustbullocklaw.com) Thanks
Technical SEO | | Augster990 -
I need help with a PHP canonical URL tags
I found a little difficult for me to do a canonical tag in my PHP. On-Page Report Card We check to make sure that IF you use canonical URL tags, it points to the right page. If the canonical tag points to a different URL, engines will not count this page as the reference resource and thus, it won't have an opportunity to rank. If you've not made this page the rel=canonical target, change the reference to this URL. NOTE: For pages not employing canonical URL tags, this factor does not apply. I don't know how to tidy my PHP Any suggestion.
Technical SEO | | lnietob0 -
Category URL Duplicate Content
I've recently been hired as the web developer for a company with an existing web site. Their web architecture includes category names in product urls, and of course we have many products in multiple categories thus generating duplicate content. According to the SEOMoz Site Crawl, we have roughly 1600 pages of duplicate content, I expect primarily from this issue. This is out of roughly 3600 pages crawled. My questions are: 1. Fixing this for the long term will obviously mean restructuring the URLs for the site. Is this worthwhile and what will the ramifications be of performing such a move? 2. How can I determine the level and extent of the effects of this duplicated content? 3. Is it possible the best course of action is to do nothing? The site has many, many other issues, and I'm not sure how highly to prioritize this problem. In addition, the IT man is highly doubtful this is causing an SEO issue, and I'm going to need to be able to back up any action I request. I do feel I will need to strongly justify any possible risks this level of site change could cause. Thanks in advance, and please let me know if any more information is needed.
Technical SEO | | MagnetsUSA0 -
Crawl reveals hundreds of urls with multiple urls in the url string
The latest crawl of my site revealed hundreds of duplicate page content and duplicate page title errors. When I looked it was from a large number of urls with urls appended to them at the end. For example: http://www.test-site.com/page1.html/page14.html or http://www.test-site.com/page4.html/page12.html/page16.html some of them go on for a hundred characters. I am totally stymied, as are the people at my ISP and the person who talked to me on the phone from SEOMoz. Does anyone know what's going on? Thanks So much for any help you can offer! Jean
Technical SEO | | JeanYates0 -
Should Canonical URLs be used in Wordpress?
Wordpress offers Canonical URLs in the "All in one SEO" settings. I know that canonical tags for page content will cause the search engine to ignore the content, but I don't understand this setting in Wordpress. The Canonical URLs box for my blog had been checked until a couple weeks ago. I unchecked it (removing the canonical tag) and now I have about 300 duplicate content pages acccording to my SEOMoz reports. It appears that it's just the blog tag in the url now that is causing the confusion. Here's an example of the same url with two tags: http://www.rmtracking.com/blog/tag/aclu/ http://www.rmtracking.com/blog/tag/rfid/ Should I activate the canonical URL setting in Wordpress again. If not, how can I fix this? Your assistance is greatly appreciated. Regards, Brad
Technical SEO | | BradBorst0