To "Rel canon" or not to "Rel canon" that is the question
-
Looking for some input on a SEO situation that I'm struggling with. I guess you could say it's a usability vs Google situation. The situation is as follows:
On a specific shop (lets say it's selling t-shirts). The products are sorted as follows each t-shit have a master and x number of variants (a color).
we have a product listing in this listing all the different colors (variants) are shown. When you click one of the t-shirts (eg: blue) you get redirected to the product master, where some code on the page tells the master that it should change the color selectors to the blue color. This information the page gets from a query string in the URL.
Now I could let Google index each URL for each color, and sort it out that way. except for the fact that the text doesn't change at all. Only thing that changes is the product image and that is changed with ajax in such a way that Google, most likely, won't notice that fact. ergo producing "duplicate content" problems.
Ok! So I could sort this problem with a "rel canon" but then we are in a situation where the only thing that tells Google that we are talking about a blue t-shirt is the link to the master from the product listing.
We end up in a situation where the master is the only one getting indexed, not a problem except for when people come from google directly to the product, I have no way of telling what color the costumer is looking for and hence won't know what image to serve her.
Now I could tell my client that they have to write a unique text for each varient but with 100 of thousands of variant combinations this is not realistic ir a real good solution.
I kinda need a new idea, any input idea or brain wave would be very welcome.
-
Unfortunately, there are still a lot of gaps in how Google handles even the typical e-commerce site. Even issues like search pagination are incredibly complicated on large sites, and Google's answers are inconsistent at best. The only thing I'd say for sure is that I no longer believe the "let us handle it" advice. I've seen it go wrong too many times. I've become a big believer in controlling your own indexation.
-
I completely agree on every point (as I tried to explain above) and I could not myself come up with a better solution, but thought I might give you guys a chance before jumping the rel-canon band wagon
To be honest I didn't expect any amazing ideas but one could hope that I hadn't thought about everything, unfortunately it seems I had.
thx for your time everyone
-
I'm afraid there's no perfect solution. The canonical tag probably is the best bet here - the risk of letting thousands of near-duplicates into the index is much greater than the cost of not landing people on specific colors.
Keep in mind that, once Google removes the color variants, only the "master" product page will appear in search. So, users won't really come into the site with a color intent (except in their heads). Whether that's good or bad for usability isn't clear. On the one hand, it would be nice to rank for every color and have users with a color in mind land on that specific product. On the other hand, some users don't have a color in mind (they know what they like when they see it), and landing on the main product pages shows them all available options. It really depends on your customers, but there are pros and cons, in terms of usability and conversion.
There's no magic Option #3, though - I'm 99% confident saying that. The risks of indexing all color variants post-Panda are relatively high, and I think you'll gain more from consolidating than you'll lose by leaving them all.
-
Hi and thx for your reply.
I agree with you, as I tried to explain in my post. But this doesn't really help me with the users from Google not getting served with the correct picture. Possible leading to a high bounce rate. Plus I have the added problem that Google will see the master as less relevant for the colors as keywords. Since the keyword won't be in the page title, h1,h2,ex.. so all in all the page will have a very low relevance for the key-phrase "blue t-shirt".
Hence I'm looking for a different solution
-
This is exactly the kind of situation that rel="canonical" exists for. Product color is one of those classic examples SEOs bring up when explaining canonicalization. Don't trust Google to figure things out on their own - make it clear to them that these pages are related and should be treated as such.
-
or maybe my explanation is just crappy
-
Ah, sorry. Miss-understood the question then.
-
Hi there and thanks for your input. But what you mention is exactly what I already have (maybe I just explain it badly), I was kinda looking for a different amazingly brilliant solutions that I hadn't thought of myself
But your thoughts and time is very much appreciated. If you have any other ideas do let me know
-
Hi Rene,
The first impression after reading your question is that I meat the same situation as on faceted navigation.
Still this is something different. My advice would be to put the rel=canonical on and get rid of duplicate content. This way you will have one default image, then the visitor can choose what they need (and you just reload the image).
Writing all the hundreds of thousands of unique texts wouldn't be the solution I believe. Still you can use some parameters in the "facets" such as a #color so if people would like to share this content with their friends they can distribute a visitor friendly URL. That would be my choice.
I hope that helped,
Istvan
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Best to Fix Duplicate Content Issues on Blog If URLs are Set to "No-Index"
Greetings Moz Community: I purchased a SEMrush subscription recently and used it to run a site audit. The audit detected 168 duplicate content issues mostly relating to blog posts tags. I suspect these issues may be due to canonical tags not being set up correctly. My developer claims that since these blog URLs are set to "no-index" these issues do not need to be corrected. My instinct would be to avoid any risk with potential duplicate content. To set up canonicalization correctly. In addition, even if these pages are set to "no-index" they are passing page rank. Further more I don't know why a reputable company like SEMrush would consider these errors if in fact they are not errors. So my question is, do we need to do anything with the error pages if they are already set to "no-index"? Incidentally the site URL is www.nyc-officespace-leader.com. I am attaching a copy of the SEMrush audit. Thanks, Alan BarjWaO SqVXYMy
Intermediate & Advanced SEO | | Kingalan10 -
Does Google View "SRC", "HREF", TITLE and Alt tags as Duplicate Content on Home Page Slider?
Greetings MOZ Community. A keyword matrix was developed by my SEO firm. I am in the process of integrating primary, secondary and terciary phrases into the text and am also sprinkling three or four other terms. Using a keyword density tool (http://www.webconfs.com/keyword-density-checker.php) the results were somewhat unexpected after I optimized. So I then looked at the source code and noticed text from HREF, ALT and SRC tags that may be effecting how Google would interpret text on the page. Our home page (www.nyc-officespace-leader.com) contains a slider with commercial real estate listings. Would Google index the SRC, HREF, TITLE and ALT tags in these slider items? Would this be detrimental to SEO? The code for one listing (and there are 7-8 in the slider) looks like this: | href="http://www.nyc-officespace-leader.com/listings/305-fifth-avenue-office-suite-1340sf" title="Lease a Prestigious Fifth Avenue Office - Manhattan, New York">Class A Fifth Avenue Offices class="blockLeft"><a< p=""></a<> href="http://www.nyc-officespace-leader.com/listings/305-fifth-avenue-office-suite-1340sf" title="Lease a Prestigious Fifth Avenue Office - Manhattan, New York"> src="http://dr0nu3l9a17ym.cloudfront.net/wp-content/uploads/fsrep/houses/125x100/305.jpg" alt="Lease a Prestigious Fifth Avenue Office - Manhattan, New York" width="125" height="94" /> 1,340 Sq. Ft. $5,918 / month Fifth Avenue Midtown / Grand Central <a< p=""></a<> | Could the repetition of the title text ("lease a Prestigious Fifth...") trigger a duplicate content penalty? Should the slider content be blocked or set to no-index by some kind of a Java script? We have worked very hard to optimize the home page so it would be a real shame if through some technical oversight we got hit by a Google Panda penalty. Thanks, Alan Thanks
Intermediate & Advanced SEO | | Kingalan10 -
"No Index, No Follow" or No Index, Follow" for URLs with Thin Content?
Greetings MOZ community: If I have a site with about 200 thin content pages that I want Google to remove from their index, should I set them to "No Index, No Follow" or to "No Index, Follow"? My SEO firm has advised me to set them to "No Index, Follow" but on a recent MOZ help forum post someone suggested "No Index, No Follow". The MOZ poster said that telling Google the content was should not be indexed but the links should be followed was inconstant and could get me into trouble. This make a lot of sense. What is proper form? As background, I think I have recently been hit with a Panda 4.0 penalty for thin content. I have several hundred URLs with less than 50 words and want them de-indexed. My site is a commercial real estate site and the listings apparently have too little content. Thanks, Alan
Intermediate & Advanced SEO | | Kingalan10 -
Webmaster Tools "Not found" errors after sitemap update
Hello Mozzers - I found a sitemap with loads of URL errors on it (none of the URLs on sitemap actually existed) so I went ahead and updated sitemap - now I'm seeing a spike in "not found" errors in WMT - is this normal / anything to worry about when you significantly change a sitemap. I've never replaced every URL on a sitemap before! L
Intermediate & Advanced SEO | | McTaggart0 -
Permalink question
For 5 years I have used the permalink custom structure: /%postname% without the end backslash. I didn't think the difference was that big of a deal, yet last month I was curious of what benefits would happen if I made the change. To my surprise my rankings took a slight dive, but recovered stronger than before. As the URL itself doesn't require a redirect the posts and pages loaded the same with or wothout the "/" But now in Open Site Explorer, all my URL's have no page Authority. All the links i built were pointing to links without the backslash: example.com/post-name Questions: Did Google figure out the change, hence the dip in rankings and strong return? Will keeping /%postname%/ even though many links are pointing to a non backslash URL comeback to haunt me? Is there anything I can do to help lead Google to better see the changes I've made? thx
Intermediate & Advanced SEO | | MikePatch0 -
Google SERPs do not display "cached"
When I am signed in with Google and searching sites, the snippets do not display the "cached" link. Not good since I am trying to see when a particular page was crawled. If I login to another server that I never use to browse and search from there the "cache" link does show up. Assumption: google knows who I am on my machine and is "helping" me.......but is there an easy way to turn this help off?
Intermediate & Advanced SEO | | Eyauuk0 -
Prowling report says "duplicate titles" for wp-login.php
Hi there, How are you guys doing? I have a quick question. The last prowling report we received said we have three pages with "duplicate titles". Those three pages are: /wp-login.php wp-login.php?action=lostpassword /wp-login.php?action=register I'm a little confused because those pages don't even have a title. Do you think it's a big deal? Also do you have any idea of why the prowling report says those pages have duplicate titles? Apparently, wp-login.php is part of the Wordpress core. It's a built-in page that
Intermediate & Advanced SEO | | Ericc22
handles login and registration. Not something we can edit. Thanks a lot and have a nice day!0 -
Shall I fix "most Common Errors" for a website that ranked top 3 on Google (difficult KW)?
How can SEOmoz "most Common Errors*" under "Crawl Diagnostics" advice can be right for a good site organic? Site is well ranked top 3 on Google (difficult KW). If I go ahead and fix these errors, I might hurt my SEO , no? like: Too Many On-Page Links 302 (Temporary Redirect) Title Element Too Long (> 70 Characters) Missing Meta Description Tag
Intermediate & Advanced SEO | | Elchanan0