Self referencing canonicals AND duplicate URLs. Have I set them up correctly?
-
Hi team,
We've recently redesigned our website.
Originally we had separate product listings for every product. Even if there was one design in two colours, each colour had its own listing.
With the redesign we merged all of these identical products to help with duplicate content. Customers can now browse the different stone colours available in that design from a single product listing (bottom left of screen under 'select a stone' on a product page)
When the customer changes the stone colour, the product images change to the new colour and its product code is appended to the end of the existing URL. eg:
http://www.mountainjade.co.nz/necklaces/assorted-jades-open-koru-necklace-jc1725/ (original listing)
http://www.mountainjade.co.nz/necklaces/assorted-jades-open-koru-necklace-jc1725/?sku=JC1725BL (black selected)
We have the following self referencing canonicals on all product pages [current-page:url:absolute], yet MOZ is telling me I have alot of duplicate content on pages with the above example.
Have I implemented the canonicals correctly? Is this why Moz is flagging the listings as duplicate?
-
If you've got that path anywhere in your navigation or other internal linking, you'd want to remove that or update it to /shop/necklaces/. The next step would be to 301 redirect /shop/necklaces/necklace/ to /shop/necklaces/ just in case you've got any links pointing to it - this will get your users where they want to go and also let search engines know you've relocated the page.
-
One last question,
How exactly would I remove /shop/necklaces/necklace/?
Sorry if that's a stupid question. I just want to know a bit more before I take it to our dev.
Thanks.
-
Thanks for this Logan!
I really appreciate the help.
-
As Yossi said, configuring parameters in Search Console should help - _but, _that's only going to help you out in Google.
Adding a disallow for those parameters in the robots file will help solve the problem in other search engines.
The thin content is definitely contributing as well. Moz identifies dupes based on a source code match between any two pages of 90% or higher. When you consider all your template code is the same across every page, thin content isn't enough to differentiate the source code.
I also noticed on one of those screenshots that you got a one dupe of /shop/necklaces/ and /shop/necklaces/necklace/. If you can, I recommend removing that second one with doubled up 'necklace' folders, that's going to cause a lot of dupes as well.
-
Hi Logan,
Thanks for looking into the canonicals for me. I'm glad to hear they appear to be configured correctly.
There are alot of duplicate page issues, with 109 in total at the moment.
Some are similar to the above example, some are URLS that contain refined search parameters (price, design etc), but most are just products which are almost identical. I think this is because most product pages have thin generic content, so for those examples we're in the process of writing unique product descriptions and adding unique imagery.
I've attached a few screenshot if you'd like to take a look. Your thoughts would be much appreciated
-
Thanks so much for the reply Yossi.
Great tip about using GSC URL parameter tools. I'll definitely implement that.
Appreciate it.
Jake
-
Jacob, as Logan wrote it looks like the canonicals are good to go.. (i just did a small sampling though..)
Not sure how your URLs are set but if the "sku=XXX" parameters are used only for color variations of a specific product, then you can use the URL paramater setting in Google Search Console.This will make your life easier, and it will ensure that no duplicate content is crawled by Google. But URL parameters must be used with caution
good luck
Yossi -
Hi Jacob,
I took a look at your site, and the canonicals appear to be configured correctly. When you look at your duplicates in the Site Crawl report in Moz, and you click the + next to where it says "1 duplicate", what are you seeing? Is it a URL set like the example you've used above, or something else?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Home page URL
Hi, I work on this site: http://www.towerhousetraining.co.uk/about-us. This is the home page URL. Should this be 301'd to: http://www.towerhousetraining.co.uk? I have created a site map, which I submitted to Google Webmaster Tools, which includes these URL's: /about-us, /training-we-offer & /contact-us. There are a total of 3 pages on the website. Webmaster tools has only indexed 2 out of 3 pages. I think this is something to do with the /about-us URL, as when I do a site: search, these pages appear: www.towerhousetraining.co.uk/, /training-we-offer & /contact-us. I am not sure why Google has indexed the home page as www.towerhousetraining.co.uk/ and not /about-us? Is it a bad idea in general not to have your homepage as your root domain? I added a to the homepage, but am wondering if this was the right thing to do? Any help would be appreciated.
Technical SEO | | CWseo0 -
Second URL
Hi We have a .com and a .co.uk Main website is .co.uk, we also have a landing page for the .com If we redirect the .com to the .co.uk, will it create duplicate content ... May seem like a silly question, but want to be sure that that the visitors cant access our website at both urls, as that would be duplicate content Thanks in advance John
Technical SEO | | Johnny4B0 -
Duplicate pages
Hi Can anyone tell me why SEO MOZ thinks these paes are duplicates when they're clearly not? Thanks very much Kate http://www.katetooncopywriter.com.au/how-to-be-a-freelance-copywriter/picture-1-58/ http://www.katetooncopywriter.com.au/portfolio/clients/other/ http://www.katetooncopywriter.com.au/portfolio/clients/travel/ http://www.katetooncopywriter.com.au/webservices/what-i-do/blog-copywriter/
Technical SEO | | ToonyWoony0 -
Duplicate content
I have two page, where the second makes a duplicate content from the first Example:www.mysite.com/mypagewww.mysite.com/mysecondpageIf i insert still making duplicate content?Best regards,Wendel
Technical SEO | | peopleinteractive0 -
Is 100% duplicate content always duplicate?
Bit of a strange question here that would be keen on getting the opinions of others on. Let's say we have a web page which is 1000 lines line, pulling content from 5 websites (the content itself is duplicate, say rss headlines, for example). Obviously any content on it's own will be viewed by Google as being duplicate and so will suffer for it. However, given one of the ways duplicate content is considered is a page being x% the same as another page, be it your own site or someone elses. In the case of our duplicate page, while 100% of the content is duplicate, the page is no more than 20% identical to another page so would it technically be picked up as duplicate. Hope that makes sense? My reason for asking is I want to pull latest tweets, news and rss from leading sites onto a site I am developing. Obviously the site will have it's own content too but also want to pull in external.
Technical SEO | | Grumpy_Carl0 -
Canonical efficiency
Hi, I'm creating recommendations for one of my client's site. It's a news site highly based on a regional aspect. One of the main features would be that you can navigate on a high level, we call it inter-regional (with all the regions news) and on the regional level (with only news related to the region) which act as a filter which means that most of my content will be duplicate. To allow the user to navigate the site on the two levels means that all the news pages will be duplicated, one with the inter-regional URL and one with the regional URL. Example: http://www.sitename.com/category/2011/11/07/name-of-the-article http://www.sitename.com/region-name/category/2011/11/07/name-of-the-article The regional URL is the official one, since it has all the keywords I want, and I'm planning to have a canonical on both version with the regional URL. Is there a risk that this would affect my ranking? Any alternatives? I read that I could prevent SE to crawl inter-regional articles using my robot.txt but I'm not fond of that. Thanks!
Technical SEO | | Pherogab0 -
Blog URLs
I read somewhere - pretty sure is was in Art of SEO - that having dates in the blog permalink URLs was a bad idea. e.g. /blog/2011/3/my-blog-post/ However, looking at Wordpress best practice, it's also not a good idea to have a URL without a number - it's more resource hungry if you don't , apparently. e.g. /blog/my-blog-post/ Does anyone have any views on this? Thanks Ben
Technical SEO | | atticus70 -
Duplicate content
I have to sentences that I want to optimize to different pages for. sentence number one is travel to ibiza by boat sentence number to is travel to ibiza by ferry My question is, can I have the same content on both pages exept for the keywords or will Google treat that as duplicate content and punish me? And If yes, where goes the limit/border for duplicate content?
Technical SEO | | stlastla0