Duplicity Problems - What to do with similar products in e-commerce?
-
Hello,
I have an eCommerce website with hundreds of similar products. On some occasions, besides for their measurements they are completely identical.
The titles are kept different by using the stock reference and the meta descriptions also use their measurements.
However, I'm gettingDuplicate Page Content errors by the MOZ crawler.
This is more than understandable since the products are very similar -
WHAT SHOULD I DO???I noticed a similar situation in BlueNile (the diamond ecommerce site) - They have numerous almost identical pages, see example:
http://www.bluenile.com/round-diamond-1-carat-or-less-ideal-cut-g-color-vs1-clarity_LD02424873
http://www.bluenile.com/round-diamond-1-carat-or-less-ideal-cut-g-color-vs1-clarity_LD02430168
For some reason, they did on each page a canonical to it's self...
I wanted to add...
It is impossible to add different descriptive texts due to the amount of products and to the rapidness they are sold (each product is unique - similar to the diamonds in the BlueNile example).
-
Dear Cyrus,
I completely agree that there is no good and added value with the stock id and measurements for Google but I felt like I had no choice.
I didn't want to start putting canonical between the pages because every other day an item is sold and then I would need to change the canonical to a similar existing item.
Are you saying that when a page makes a canonical to himself Google does not index it? Or treats it as a non original page (a copied page) even if I don't specify from where it is copied?
Please see the following question I asked that is about this matter and got a different response: http://www.seomoz.org/q/is-there-a-reason-to-put-a-canonical-to-yourself-interesting-case
Thanks
-
First, let me explain the SEOmoz duplicate content errors. These are issued anytime the HTML of a page is 95% similar to another page (this means the entire code, not just the text). It sounds like this is what is happening in your case.
Blue Nile solves this dilemma with the canonical tag. They are basically telling the search engines to consolidate all the pages into one for ranking purposes. The downside of this is that any page that doesn't point to itself isn't going to rank.
You stated that each title and description are differentiated using the "stock reference" and "measurements." The big question is... are these important for ranking? By this I mean do your customers search Google for your products by stock number and/or measurements?
If it were me, and without knowing more about your situation, I would try to consolidate your product pages as much as possible and use the canonical tag, similar to Blue Nile, on near-duplicate pages (strictly speaking, Google states the canonical tag is only for exact duplicates, but in the real world they are more flexible)
Hope this helps! Best of luck with your SEO.
-
Thanks for the reply but I am unable to create the 40% unique content.
My case is exactly like the BlueNile sample I gave on top...
These are extremely similar products but still each is unique because of slight differences (that are important to the buyers). I have thousands of products and each product is one of a kind - when it is sold - it is removed to the "sold items" section.
There is no way (and no point since each product can be sold once) to write a description to so many products that are constantly changing.
-
Your errors can be incurred for a number of reasons. You need to ensure you have a enough unique content per page, If you only have a few words or character of text related to any particular item and only a few unique words in the Title tag you will be flagged for duplication. Expand unique text where you can and ensure only Primary Brand Keywords are in the Title tag such that each page should have a majority of unique text. If your URLs are dynamic in nature investigate opportunities to make them Human Readable and in a structured format. SEOmoz has written numerous guides on URL structure. Place unique content wherever you can in images files names, alt text etc... Think minimum of 40% content differential per page including the site template. Too many links in a navigation can impact you if you have limited body content on a page.
-
It looks like on those two examples its just the table% and depth % that are different? Any way you could just combine the similar products, and just make it a option to select the different table % and depth%?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content... Really?
Hi all, My site is www.actronics.eu Moz reports virtually every product page as duplicate content, flagged as HIGH PRIORITY!. I know why. Moz classes a page as duplicate if >95% content/code similar. There's very little I can do about this as although our products are different, the content is very similar, albeit a few part numbers and vehicle make/model. Here's an example:
Intermediate & Advanced SEO | | seowoody
http://www.actronics.eu/en/shop/audi-a4-8d-b5-1994-2000-abs-ecu-en/bosch-5-3
http://www.actronics.eu/en/shop/bmw-3-series-e36-1990-1998-abs-ecu-en/ate-34-51 Now, multiply this by ~2,000 products X 7 different languages and you'll see we have a big dupe content issue (according to Moz's Crawl Diagnostics report). I say "according to Moz..." as I do not know if this is actually an issue for Google? 90% of our products pages rank, albeit some much better than others? So what is the solution? We're not trying to deceive Google in any way so it would seem unfair to be hit with a dupe content penalty, this is a legit dilemma where our product differ by as little as a part number. One ugly solution would be to remove header / sidebar / footer on our product pages as I've demonstrated here - http://woodberry.me.uk/test-page2-minimal-v2.html since this removes A LOT of page bloat (code) and would bring the page difference down to 80% duplicate.
(This is the tool I'm using for checking http://www.webconfs.com/similar-page-checker.php) Other "prettier" solutions would greatly appreciated. I look forward to hearing your thoughts. Thanks,
Woody 🙂1 -
How are you taking you e-commerce site forward in 2014
Hi MOZland, With a new (our first e-commerce) client, we're going through a massive learning curve in handling a site of substantial size and complexity for the first time. While we've weeded out most of the on-page stuff that needed sorting, and we're in the process of dumping poor links implemented by previous SEO/online marketing efforts, do you have any suggestions about how to take a big e-commerce site forward in 2014, especially concerning technical pitfalls and link building efforts (and given that guest blogging has become something of a faux pas). Cheers, M
Intermediate & Advanced SEO | | Martin_S0 -
Product descriptions & Duplicate Content: between fears and reality
Hello everybody, I've been reading quite a lot recently about this topic and I would like to have your opinion about the following conclusion: ecommerce websites should have their own product descriptions if they can manage it (it will be beneficial for their SERPs rankings) but the ones who cannot won't be penalized by having the same product descriptions (or part of the same descriptions) IF it is only a "small" part of their content (user reviews, similar products, etc). What I mean is that among the signals that Google uses to guess which sites should be penalized or not, there is the ratio "quantity of duplicate content VS quantity of content in the page" : having 5-10 % of a page text corresponding to duplicate content might not be harmed while a page which has 50-75 % of a content page duplicated from an other site... what do you think? Can the "internal" duplicated content (for example 3 pages about the same product which is having 3 diferent colors -> 1 page per product color) be considered as "bad" as the "external" duplicated content (same product description on diferent sites) ? Thanks in advance for your opinions!
Intermediate & Advanced SEO | | Kuantokusta0 -
Certain Product Pages Not Indexing
Hey All, We discovered an issue where new product pages on our site were not getting indexed because a "noindex" tag was inadvertently being added to section when those pages were created. We removed the noindex tag in late April and some of the pages that had not been previously indexed are now showing up, but others are still not getting indexed and I'd appreciate some help on why this could be. Here is an example of a page that was not in the index but is now showing after removal of noindex: http://www.cloud9living.com/san-diego/gaslamp-quarter-food-tour And here is an example of a page that is still not showing in the index: http://www.cloud9living.com/atlanta/race-a-ferrari UPDATE: The above page is now showing after I manually submitted it in WMT. I had previously submitted another page like a month ago and it was still not indexing so I thought the manual submission was a dead end. However, it just so happens that the above URL just had its Page Title and H1 updated to something more specific and less duplicative so I am currently running a test to see if that's the problem with these pages not indexing. Will update this soon. Any suggestions? Thanks!
Intermediate & Advanced SEO | | GManSEO0 -
Two Sites Similar content?
I just started working at this company last month. We started to add new content to pages like http://www.rockymountainatvmc.com/t/49/-/181/1137/Bridgestone-Motorcycle-Tires. This is their main site. Then i realized it also put the new content on their sister site http://www.jakewilson.com/t/52/-/343/1137/Bridgestone-Motorcycle-Tires. the first site is the main site and I think will get credit for the unique new content. The second one I do not think will get credit and will more than likely be counted as duplicate content. We are changing this so it will no longer be the same. However, I am curious to see ways people think we could fix this issues? Also is it effecting both sits for just the second one?
Intermediate & Advanced SEO | | DoRM0 -
Issue with duplicate content in blog
I have blog where all the pages r get indexed, with rich content in it. But In blogs tag and category url are also get indexed. i have just added my blog in seomoz pro, and i have checked my Crawl Diagnostics Summary in that its showing me that some of your blog content are same. For Example: www.abcdef.com/watches/cool-watches-of-2012/ these url is already get indexed, but i have asigned some tag and catgeory fo these url also which have also get indexed with the same content. so how shall i stop search engines to do not crawl these tag and categories pages. if i have more no - follow tags in my blog does it gives negative impact to search engines, any alternate way to tell search engines to stop crawling these category and tag pages.
Intermediate & Advanced SEO | | sumit600 -
Duplicate content mess
One website I'm working with keeps a HTML archive of content from various magazines they publish. Some articles were repeated across different magazines, sometimes up to 5 times. These articles were also used as content elsewhere on the same website, resulting in up to 10 duplicates of the same article on one website. With regards to the 5 that are duplicates but not contained in the magazine, I can delete (resulting in 404) all but the highest value of each (most don't have any external links). There are hundreds of occurrences of this and it seems unfeasible to 301 or noindex them. After seeing how their system works I can canonical the remaining duplicate that isn't contained in the magazine to the corresponding original magazine version - but I can't canonical any of the other versions in the magazines to the original. I can't delete the other duplicates as they're part of the content of a particular issue of a magazine. The best thing I can think of doing is adding a link in the magazine duplicates to the original article, something along the lines of "This article originally appeared in...", though I get the impression the client wouldn't want to reveal that they used to share so much content across different magazines. The duplicate pages across the different magazines do differ slightly as a result of the different Contents menu for each magazine. Do you think it's a case of what I'm doing will be better than how it was, or is there something further I can do? Is adding the links enough? Thanks. 🙂
Intermediate & Advanced SEO | | Alex-Harford0 -
Duplicate content for area listings
Hi, I was slightly affected by the panda update on the 14th oct generaly dropping by about 5-8 spots in the serps for my main keywords, since then I've been giving my site a good looking over. On a site I've got city listings urls for certain widget companys, the thing is many areas and thus urls will have the same company listed. What would be the best way of solving this duplicate content as google may be seeing it? I was thinking of one page per company and prominenly listing the areas they operate so still hopefully get ranked for area searches. But i'd be losing the city names in the url as I've got them now for example: mywidgetsite.com/findmagicwidgets/new-york.html mywidgetsite.com/findmagicwidgets/atlanta.html Any ideas on how best to proceed? Cheers!
Intermediate & Advanced SEO | | NetGeek0