Duplicate Content & Canonicals
-
I am a bit confused about canonicals and whether they are "working" properly on my site. In Webmaster Tools, I'm showing about 13,000 pages flagged for duplicate content, but nearly all of them are showing two pages, one URL as the root and a second with parameters. Case in point, these two are showing as duplicate content:
http://www.gallerydirect.com/art/product/vincent-van-gogh/starry-night
We have a canonical tag on each of the pages pointing to the one without the parameters. Pages with other parameters don't show as duplicates, just one root and one dupe per listing,
So, am I not using the canonical tag properly? It is clearly listed as:Is the tag perhaps not formatted properly (I saw someone somewhere state that there needs to be a /> after the URL, but that seems rather picky for Google)?Suggestions?
-
Thanks, Dr. Pete.
I'll discuss the options with our dev team and see which one will cause the least amount of developer caffeine consumption.
-
Argh... sorry, I didn't even check/see that. Yeah, that may be a real problem - you're basically sending two canonicalization signals that are in conflict. Is there any way to hide the defaults? If the canonicals point to (A), but then (A) redirects to (B), Google may just ignore the canonical.
Unfortunately, your options are to either: (1) hope for the best, (2) canonical to the uglier URL, or (3) kill the redirect and set the default parameters on the server-side (without resetting the URL).
I am primarily seeing the canonical URL in Google's index, so I'm not sure it's actually causing you harm. It's just not an ideal situation.
-
Dr. Pete:
I'm looking into it to be sure, but I believe that you are correct in that this is an ad-tracking URL.
A follow up question:
The URL that is the canonical version of each page would be in the format of
http://www.gallerydirect.com/art/product/vincent-van-gogh/starry-night
However, this exact URL redirects to one with default parameters for substrate, style and frame size:
Should we change our canonical from the first URL (without the parameters) to the second URL with the parameters? Or is that a moot point with Google?
-
While the properly closed tag should have "... />", that's generally only an issue in very isolated cases. I've never seen it interfere with a canonical tag. It's a harmless change to make (and it is more correct), but my gut reaction is that this will make no difference. Google should be honoring these canonicals.
One odd thing I'm seeing. If I dig into the index, I'm finding the following page:
This may be an ad-tracking URL (?) and it's redirecting somehow (but not with a 301 or 302) to the non-canonical URL. This may be sending a mixed signal, and ideally it would redirect to the canonical version of the URL. I'm not sure where this version is coming from, so it's a bit hard to diagnose.
-
Hi Darin
The tag is not working because if you go into Google and enter the URL: http://www.gallerydirect.com/art/product/vincent-van-gogh/starry-night?substrate_id=3&product_style_id=8&frame_id=63&size=25x20 you will see that it is being indexed on Google.
If it's being indexed, then it runs the risk of duplicate content issues.
The tag definitely does need the /> at the end, so the correct usage of the tag would be: rel="canonical" href="http://www.gallerydirect.com/art/product/vincent-van-gogh/starry-night" />
I think if you implement that small change, there shouldn't be any problems.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Looking for list Pro's & Con's of removing Folder from URL?
Hi We have a sub-folder ("/shop-by-department/") which is pretty much useless on our site and I'm looking to remove it. But the team want a list of the Pro's & Con's in doing so. So for example I'll be changing www.example.ie/shop-by-department/furniture/beds/product-a to www.example.ie/furniture/beds/product-a I know there will be an intial hit as Google adjusts to the change but think it's definitely the way to go. I was lookng for a complete list of the Pro's & Con's to send onto the team. It'll be going to the traditional marketing (print, radio, etc.) too so can ve top-level points too. Hope you can help! Thanks
Web Design | | Frankie-BTDublin0 -
Mergers & Acquisitions - Website Transition Good practice
Hi everyone, I was wondering if anyone has come across good practice for maintaining websites after a merger or acquisition where there needs to be an association between two websites of the two companies involved. For an acquisition, I'm considering moving the acquired company to a sub domain of the parent company e.g. aquiredcompany.parentcompany.com. On both websites there wmay be a prominant popup so visitors can switch between the websites if they have visited the incorrect one. One worry I have is the acquired company has some good rankings, which I want to keep. I will of course manage the process through 301 redirects. But I was wondering if anyone has any thoughts on this approach or can suggest any better solutions. Thanks in advance, Stuart
Web Design | | Stuart260 -
Are URL suffixes ignored by Google? Or is this duplicate content?
Example URLs: www.example.com/great-article-on-dog-hygiene.html www.example.com/great-article-on-dog-hygiene.rt-article.html My IT dept. tells me the second instance of this article would be ignored by Google, but I've found a couple of instances in which Google did index the 'rt-article.html' version of the page. To be fair, I've only found a couple out of MANY. Is it an issue? Thanks, Trisha
Web Design | | lzhao0 -
Joomla! Site Returning 12000+ Duplicate Content Errors! W Image
(I do award "Good Answer" and "thumbs up" to responses as earned) I have tried to ask this question previously (maybe not correctly). I have a client that I am doing the on and offsite optimization and the MOZ report is kicking back major errors. I have examples below. They all seem to relate directly to rokecwid and ECWID. Is there ANY solution to fix this? Is this hurting the rankings Since I didn't build the site, I am having to tell the website company what to do when I need changes made to code, etc... I am also not very proficient with Joomla! and my web engineer is one of those closet coders (the best kind to have) and doesn't communicate in a way that a "layman" could understand. He pointed out several issues with the HTML but I don't think that is related to this below. Can anyone tell me what to tell the web company that built this site to get rid of these errors? A very small sample of the urls w errors:
Web Design | | Atlanta-SMO
http://www.metroboltmi.com/shop-spareparts?
Itemid=218&option=com_rokecwid&view=ecwid&ecwid_category_id=3560097
1 14 1 http://www.metroboltmi.com/shop-spareparts?
Itemid=218&option=com_rokecwid&view=ecwid&ecwid_category_id=3560098
1 1 0 http://www.metroboltmi.com/shop-spareparts?
Itemid=218&option=com_rokecwid&view=ecwid&ecwid_category_id=3560099
1 14 1 http://www.metroboltmi.com/shop-spareparts?
Itemid=218&option=com_rokecwid&view=ecwid&ecwid_category_id=3560100
1 14 1 SEOMOZErrors_zps3a1ce2a2.png0 -
Crawl Diagnostics Summary - Duplicate Content
Hello SEO Experts, I am a developer at www.bowanddrape.com and we are working on improving the SEO of the website. The SEOMoz Crawl Diagnostics Summary shows that following 2 URL have duplicate content. http://www.bowanddrape.com/clothing/Tan+Accessories+Calfskin+Belt/50_5142 http://www.bowanddrape.com/clothing/Black+Accessories+Calfskin+Belt/50_5143 Can you please suggest me ways to fix this problem? Is the duplicate content error because of same "The Details", "Size Chart" and "The Silhouette" and "You may also like" ? Thanks, Chirag
Web Design | | ChiragNirmal0 -
Minimising duplicate content
From a minimising duplicate content perspective is it best to create all blog posts with a single tag so google doesn't think the same post being returned via a different tag search is duplicate content. I.e. the urls below return the same blog post; or doesn't it matter. for example http://www.ukholidayplaces.co.uk/blog/?tag=/stay+in+Margate http://www.ukholidayplaces.co.uk/blog/?tag=/Margate+on+a+budget are the same posts... thanks
Web Design | | JonAcourt0 -
Canonical Tag
I've been helping someone out with their website, and I noticed the person who built the site made the canonical tags like this:
Web Design | | StandUpCubicles
href="http://www.example.com/" rel="canonical" /> I'm use to seeing it how seomoz does it: Does this matter? Is it ok to have it inverted? They also have another canonical tag in there like this:
var hs_canonical_url = "http\x3A\x2F\x2Fwww.example.com\x2Fhome" Any idea what that is? Could it be hurting the site?0 -
Duplicate content.
Hi there....we're dealing with a duplicate content mess. We're a franchisor(www.kitchensolvers.com), and each of our franchises have their own landing page. The trouble is, the way the landing pages are set up, it's causing all the links available on the national level of the website to be re-indexed each time for every franchise! We don't have an in-house developer and was wondering if anyone else has had similar issues and point me in the right direction.
Web Design | | tafkat0