E-Commerce Duplicate Content
-
Hello all
We have an e-commerce website with approximately 3,000 products. Many of the products are displayed in multiple categories which in turn generates a different URL!
Accross the entire site I have noticed that the product pages are always outranked by competitors who have lower page authority, domain authority, total links etc etc.
I am convinced this is down to duplicate content issues. I understand there is no direct penalty but how would this affect our rankings? Is page rank split between all the duplicates, which in turn lowers it's ranking potential?
I have looked for a way to identify duplicate content using Google analytics but i've been unsuccessful. If the duplicate content is the issue and page rank is divided am i best using canonical or 301 redirects?
Sorry if this is an obvious question but If i'm correct we could see a huge improvement in rankings accross the board. Wow!
Cheers
Todd
-
When Google finds more than one document (ie URL) with the same content, it has to define which of them is the representative document of the cluster. In doing this it looks at inbound link metrics, essentially, plus date of the page, pagerank and other factors. In this decision, it can be wrong, indexing a page that can hurt you indexation (consider this situation: it indexes as representative document page 2 of a listing page in descending order: new items in this category end to be at page 2 or later and are less likely to be discovered).
The canonical tag can be a good solution, even if it is a hint and not a rule to Google...
-
Great stuff thanks!...
-
SEOMOZ had an awesome whiteboard on this.
http://www.seomoz.org/blog/whiteboard-friday-faceted-navigation
Some more additional resources:
http://www.seomoz.org/ugc/dealing-with-faceted-navigation-a-case-study
Matt Cutts on faceted navigation:
http://www.stonetemple.com/articles/interview-matt-cutts-012510.shtml
Hope they help you
-
Thanks again! Unfortunately our system was built in house from scratch with no consideration for duplicate content
To be honest the product pages that I'm worried about have very few or no inbound links so maybe this isn't such a huge issue.
I have picked up on the fact almost all our pages including the homepage work on www and non www so maybe creating a 301 redirect for these will help also.
I will test the conical tag on a range of pages and mointor the results, hopefully our rankings will increase and I can look at some kind of strategy to roll this out.
Cheers for the help!
-
Google will select the most authortive aka whichever has the most links.
If you have a ton of inbound links I would recommend doing lots of research before inserting that tag. Find out which pages have the authority and don't throw it away.
This was a plague of eCommerce for years. Luckly most of the newest moden platfroms have caught up.
-
The duplicate item pages will not be indexed but visited the google bot. He will consider this page to be the one linked in the canonical tag.
I hope you won't have to set the urls manually !
-
Thanks for the quick response chaps! So if we have 9 duplicates for example will Google index all 9 pages or decide on 1 and never revisit the rest.
I couldn't see any duplicate URLs in the top content report.
We have over 3,000 products so it will be fun adding canonical tags to all the necessary pages
-
Toddy,
For every product of your site, you should identify its main category (the one that will be indexed). When seeing a product with a different category url, use the rel=canonical tag to give google the good url. This works well with e-commerce site.
You may also apply this logic between categories, as some listing between two categories are sometimes very similar.
For more information about the rel=canonical tag, see these resources :
http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=139394
-
The only "penalty" is the fact you could potentially spread your link juice across those multiple pages. Example:
You have 104 links to the same product, but they are equally pointed a 4 unique URLs. Now you technically have 26 links on whatever page Google 'selects' as your authority page.
Your competition has 100 links to the same product which only has 1 page.
With that type of setup your competition is always going to have that authority page ranked abouve you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Page Content
Hello, After crawling our site Moz is detecting high priority duplicate page content for our product and article listing pages, For example http://store.bmiresearch.com/bangladesh/power and http://store.bmiresearch.com/newzealand/power are being listed as duplicate pages although they have seperate URLs, page titles and H1 tags. They have the same product listed but I would have thought the differentiation in other areas would be sufficient for these to not be deemed as duplicate pages. Is it likely this issue will be impacting on our search rankings? If so are there any recommendations as to how this issue can be overcome. Thanks
Technical SEO | | carlsutherland0 -
Uservoice and Duplicate Page Content
Hello All, I'm having an issue where the my UserVoice account is creating duplicate page content (image attached). Any ideas on how to resolve the problem? A couple solutions we're looking into: moving the uservoice content inside the app, so it won't get crawled, but that's all we got for now. Thank you very much for your time any insight would be helpful. Sincerely,
Technical SEO | | JonnyBird1
Jon Birdsong SalesLoft duplicate duplicate0 -
Does turning website content into PDFs for document sharing sites cause duplicate content?
Website content is 9 tutorials published to unique urls with a contents page linking to each lesson. If I make a PDF version for distribution of document sharing websites, will it create a duplicate content issue? The objective is to get a half decent link, traffic to supplementary opt-in downloads.
Technical SEO | | designquotes0 -
Worpress Tags Duplicate Content
I just fixed a tags duplicate content issue. I have noindexed the tags. Was wondering if anyone has ever fixed this issue and how long did it take you to recover from it? Just kind of want to know for a piece of mind.
Technical SEO | | deaddogdesign0 -
How to Solve Duplicate Page Content Issue?
I have created one campaign over SEOmoz tools for my website. I have found 89 duplicate content issue from report. Please, look in to Duplicate Page Content Issue. I am quite confuse to resolve this issue. Can any one suggest me best solution to resolve it?
Technical SEO | | CommercePundit0 -
Large Scale Ecommerce. How To Deal With Duplicate Content
Hi, One of our clients has a store with over 30,000 indexed pages but less then 10,000 individual products and make a few hundred static pages. Ive crawled the site in Xenu (it took 12 hours!) and found it to by a complex mess caused by years of hack add ons which has caused duplicate pages, and weird dynamic parameters being indexed The inbound link structure is diversified over duplicate pages, PDFS, images so I need to be careful in treating everything correctly. I can likely identify & segment blocks of 'thousands' of URLs and parameters which need to be blocked, Im just not entirely sure the best method. Dynamic Parameters I can see the option in GWT to block these - is it that simple? (do I need to ensure they are deinxeded and 301d? Duplicate Pages Would the best approach be to mass 301 these pages and then apply a no-index tag and wait for it to be crawled? Thanks for your help.
Technical SEO | | LukeyJamo0 -
Duplicate Content issue
I have been asked to review an old website to an identify opportunities for increasing search engine traffic. Whilst reviewing the site I came across a strange loop. On each page there is a link to printer friendly version: http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes That page also has a link to a printer friendly version http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes&printfriendly=yes and so on and so on....... Some of these pages are being included in Google's index. I appreciate that this can't be a good thing, however, I am not 100% sure as to the extent to which it is a bad thing and the priority that should be given to getting it sorted. Just wandering what views people have on the issues this may cause?
Technical SEO | | CPLDistribution0 -
Mapping Internal Links (Which are causing duplicate content)
I'm working on a site that is throwing off a -lot- of duplicate content for its size. A lot of it appears to be coming from bad links within the site itself, which were caused when it was ported over from static HTML to Expression Engine (by someone else). I'm finding EE an incredibly frustrating platform to work with, as it appears to be directing 404's on sub-pages to the page directly above that subpage, without actually providing a 404 response. It's very weird. Does anyone have any recommendations on software to clearly map out a site's internal link structure so that I can find what bad links are pointing to the wrong pages?
Technical SEO | | BedeFahey0