Simular product pages
-
I have 27000 products on my website, showed one by one on a separated webpage. Google index them almost all (+- 25000). But the SEOmoz report shows them as duplicated content. Indeed, most of the page is identical, only changing description and price of the product which is indeed not more than 2% of the total content of the page. On the bottom of the product page are shown the alternatives for this product, mainly other colors. So, within the same family of products that can have 50 products, the site creates 50 webpages showing the product and it's family. That's why nearly everything on the page is identical within this family of products.
My guess is, as Google indexed them all, I should not worry about duplicated content.
Is my guess correct?
Thanks for a soon answer.
Rik
-
Are you telling me now that I shouldn't be worried that much about SEOmoz flagging 6000 errors on duplicate content as long as my pages get indexed by Google and my site ranking well?
-
You're right on that count. Google doesn't exactly open-source their algos and share them with everyone. We did just change the way we detect duplicate content, and made a blog post about it yesterday. Fewer false positives should exist regarding duplicate content, but things like this will probably still get flagged.
-
OK, I understand. I saw others doing the same. I only never guessed why they did group products where only the color is a variation. Now you gave me a head-ache as this is a very very major change in my approach if I want to group my products on one page. In case I want to set up canonical for the preferred version, I need to sort out how to do this programmatically as I might have 27000 different product pages, HTML-wise I only have one. But this can't be that hard to find out.
I stay with one doubt, however you might be right. My site is 4 years old and for almost 3 years was on Number One in organic Google in Belgium with a competition of more than 4 million. If Google at one time had considerated my approach of one page per product as duplicate content I never would have get there.
Highly appreciated, your comment as well as the one of donford.
Thanks.
-
I think this is exactly what you are looking for -
http://maileohye.com/seo-tips-for-e-commerce-sites/
And personally what I feel is that these pages will be considered duplicate and therefore, you need to fix them by setting up canonical for the preferred version. Or if there is one standalone page, you need to block other pages/ color variations from search engines. Let me know what you think.
-
The pages you show are (in my eyes) completely different, much more different than the ones I have.
Two samples both indexed where only the color and the price differs. The site ranks well in Google, the pages are showed when entering the product code. The pages itself have very low trust (1).
As the site ranks very well, the pages are indexed and SEOmoz showing duplicate content error, it must be that the used algorithm is different between Google and SEOmoz.
-
It sounds like you're on target if your pages are returning high or well on SERP's. To answer the question about the % of difference that I don't know, I can show you an example using our o-ring pages.
AS001 size orings in 2 different materials
Basically the same information but added some material specific information to each page to help differentiate them in the search engines. Again not sure what % difference that is, but we have no dup content warnings here at SEOmoz.
-
Thanks for your answer and yes it makes sence. However my products shift in brands, every brand has families and every family has product codes. As the product code being the keyword - many visitors are looking for the product using the product code as a keyword - the SE doesn't have any problems to show the right page. If a visitor use the family name or the brand name in a search, my site has specific landing pages for those searches.
I launched my question because I am worried that my site will go down in the organic ranking because Google could consider all those product pages as duplicated content. I never worried before because I'm in the top 3 for years now. I only started to worry the moment I started to use SEOmoz and they give me thousands of errors for duplicated content.
Mainly, how much must a page differ in content to be considered unique and not risking a penalty from Google for duplicated content. Or otherwise, how much must a page differ for SEOmoz not considering it anymore duplicated content.
-
This is common for eCommerce sites, the error should be taken seriously, however there usually isn't an easy fix. The end goal of course is not necessarily for Google or any SE to index them, rather that when a particular keyword is hit the page is returned in the results.
Last year I upgraded our ecommerce site which sells orings (very similar products) I spent a lot of extra time to try and differentiate each o-ring page from one another. In the end it paid off, no duplicate content, all pages indexed, and placing well in SERP's.
So to answer your question while having them in Google's index is helpful, I would say you should worry about the duplicate content error as it is indicative that crawlers / search engines may have trouble returning the correct page for your specific keywords.
Hope that makes sense
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What is the easiest and most scalable way to add links from one page to a related page entry?
We have a Spanish language reference site, and want to link related entries to each other. For example, the entry for "home" can link to the page with the entry for "home away from home." What is the best and most efficient way to do this at scale?
Content Development | | CuriosityMedia0 -
Lonely lonely pages
On my site I have tons of blog posts that have never been visited. (Falls on floor in tears). I of course know why. The content is mediocre in most cases and when it was average to good I didn't market it more. My question is should I go and just scrub the non visited pages or spend the time making these pages better and work on making the content above average? My competition above me do not have as many pages and their ranking is purely (I have researched this to death) from links from sites they have developed - with good authority.
Content Development | | GrangeWeb1 -
Duplicate page issue all from my website blog. How to i fix?
Crawl diagnosis indicates duplicate page content all from the blog on my website. What can i do to fix this?
Content Development | | skinbiz0 -
How to optimize content pages with ecommerce?
Some content pages act as buyers guides for certain products for example Used Paddle Boards for Sale - http://www.islesurfboards.com/used-paddle-boards-for-sale.aspx this is a content page that gets huge amount of traffic and is pure content with no products on the page, but we also have a ecommerce section of the site that is Used Paddle Boards for Sale -http://www.islesurfboards.com/buy-used-paddle-boards-for-sale.aspx however this page just has a small paragraph and all the ecommerce product related to this section on the page. The content only page above gets all the traffic and rank and then they click over to the actual ecomm section wiht the products from a link on that page. Should i merge these two together so its just one page and put the content on the ecom page? If i do all the content with push the ecommerce products down which is not good so what does anyone recommend as a best practice? Also will this mess up the content pages rank is i merge them assuming i redirect? or Keep them seperate like i have with a content page regarding "used paddle boards for sale" and an ecommerce page that sells acutal "used paddle boards for sale"
Content Development | | isle_surf0 -
What is a Hub Page?
Can anybody explain what is a hub page? Do you have any example? In a other post, somebody suggest creating hub pages. This is the post: http://www.seomoz.org/q/online-store-with-4-products-available-in-50-sizes-need-tips-categories-products Thank you, BigBlaze
Content Development | | BigBlaze2050 -
Word Press Page vs Post
Hello, I have a site that is dedicated to real estate. I designed it in dreamweaver. I also attached a blog to it with wordpress. Its self hosted. My question is what is the best way to increase my search rankings with wordpress? Page vs Post? Any tips?
Content Development | | bronxpad0 -
Posts vs Pages and Rankings Differ Greatly
I use wordpress for most of my sites and generally have a post 'news' section. What I've noticed is that just about every time a post will always rank much higher and much faster than a 'page'. As long as I don't let it get buried in the news archives it continues to rank well, better than if I were to create a 'page'. Is there any sort of reason this might occur? I'd like to be able to just create 'pages' but at this point in time it makes no sense.
Content Development | | GYMSN0 -
Please help me stop google indexing https pages on my wordpress site
I added SSL to my wordpress blog because that was the only way to get a dedicated IP address for my site at my host. Now I am noticing Google has started indexing posts both as http and https. Can some one please help how to force google not to index https as I am sure its like having duplicate content. All help is appreciated. So far I have added this to top of htaccess file: RewriteEngine on Options +FollowSymlinks RewriteCond %{SERVER_PORT} ^443$ RewriteRule ^robots.txt$ robots_ssl.txt And added robots_ssl.txt with following: User-agent: Googlebot Disallow: / User-agent: * Disallow: / But https pages are still being indexed. Please help.
Content Development | | rookie1230