E-Commerce Duplicate Content
-
Hello all
We have an e-commerce website with approximately 3,000 products. Many of the products are displayed in multiple categories which in turn generates a different URL!
Accross the entire site I have noticed that the product pages are always outranked by competitors who have lower page authority, domain authority, total links etc etc.
I am convinced this is down to duplicate content issues. I understand there is no direct penalty but how would this affect our rankings? Is page rank split between all the duplicates, which in turn lowers it's ranking potential?
I have looked for a way to identify duplicate content using Google analytics but i've been unsuccessful. If the duplicate content is the issue and page rank is divided am i best using canonical or 301 redirects?
Sorry if this is an obvious question but If i'm correct we could see a huge improvement in rankings accross the board. Wow!
Cheers
Todd
-
When Google finds more than one document (ie URL) with the same content, it has to define which of them is the representative document of the cluster. In doing this it looks at inbound link metrics, essentially, plus date of the page, pagerank and other factors. In this decision, it can be wrong, indexing a page that can hurt you indexation (consider this situation: it indexes as representative document page 2 of a listing page in descending order: new items in this category end to be at page 2 or later and are less likely to be discovered).
The canonical tag can be a good solution, even if it is a hint and not a rule to Google...
-
Great stuff thanks!...
-
SEOMOZ had an awesome whiteboard on this.
http://www.seomoz.org/blog/whiteboard-friday-faceted-navigation
Some more additional resources:
http://www.seomoz.org/ugc/dealing-with-faceted-navigation-a-case-study
Matt Cutts on faceted navigation:
http://www.stonetemple.com/articles/interview-matt-cutts-012510.shtml
Hope they help you
-
Thanks again! Unfortunately our system was built in house from scratch with no consideration for duplicate content
To be honest the product pages that I'm worried about have very few or no inbound links so maybe this isn't such a huge issue.
I have picked up on the fact almost all our pages including the homepage work on www and non www so maybe creating a 301 redirect for these will help also.
I will test the conical tag on a range of pages and mointor the results, hopefully our rankings will increase and I can look at some kind of strategy to roll this out.
Cheers for the help!
-
Google will select the most authortive aka whichever has the most links.
If you have a ton of inbound links I would recommend doing lots of research before inserting that tag. Find out which pages have the authority and don't throw it away.
This was a plague of eCommerce for years. Luckly most of the newest moden platfroms have caught up.
-
The duplicate item pages will not be indexed but visited the google bot. He will consider this page to be the one linked in the canonical tag.
I hope you won't have to set the urls manually !
-
Thanks for the quick response chaps! So if we have 9 duplicates for example will Google index all 9 pages or decide on 1 and never revisit the rest.
I couldn't see any duplicate URLs in the top content report.
We have over 3,000 products so it will be fun adding canonical tags to all the necessary pages
-
Toddy,
For every product of your site, you should identify its main category (the one that will be indexed). When seeing a product with a different category url, use the rel=canonical tag to give google the good url. This works well with e-commerce site.
You may also apply this logic between categories, as some listing between two categories are sometimes very similar.
For more information about the rel=canonical tag, see these resources :
http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=139394
-
The only "penalty" is the fact you could potentially spread your link juice across those multiple pages. Example:
You have 104 links to the same product, but they are equally pointed a 4 unique URLs. Now you technically have 26 links on whatever page Google 'selects' as your authority page.
Your competition has 100 links to the same product which only has 1 page.
With that type of setup your competition is always going to have that authority page ranked abouve you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
ViewState and Duplicate Content
Our site keeps getting duplicated content flagged as an issue... however, the pages being grouped together have very little in common on-page. One area which does seem to recur across them is the ViewState. There's a minimum of 150 lines across the ones we've investigated. Could this be causing the reports?
Technical SEO | | RobLev0 -
WordPress Duplicate Content Caused By Categories
Hello, We have a wordpress blog that has around 250 categories. Due to our platform we have a hierarchy structure for 3 separate stores. For example iPhone > Apps > Books. Placing a blog post in the books category automatically places it into iPhone and iPhone/Apps category, causing 3 instances of any blog post in this category. Is this an issue? I have seen 2 schools of thought on categories, 1 index follow and 2 noindex follow. I know some of our categories get indexed, but with so many, maybe it is better to noindex them. We also considered reducing our categories to 10 to 12 and use tags to provide the indexed site navigation as follows: Reviews (category) iPhone Book App, iPhone App Store (tags) but this seems a little redundant? Anyone want to take this on? thank you Mike
Technical SEO | | crazymikesapps10 -
Duplicate content due to credit card testing
I recently launched a site - http://www.footballtriviaquestions.co.uk and the site uses Paypal. In order to test the PayPal functionality I set up a zapto.org domain via a permanent IP service that points directly to the computer I've written the website on. It appears that Google has now indexed the zapto.org website. Will this cause problems to my main website, as the zapto.org website will pretty much contain content that is an exact duplicate of what is held on the main website. I've looked in Google webmaster tools for the main website and it doesn't mention any duplicate content, but I'm currently not in the top 50 ranking for "football trivia questions' on Google despite SEOMoz ranking my home page with an A rating. The page does rank at position 16 in Yahoo and Bing. This seems odd to me, although I do have very few back links pointing to my site. If the duplicate content is likely to be causing me problems what would be the best way to knock the zapto.org results out of Google
Technical SEO | | ipr1010 -
Development Website Duplicate Content Issue
Hi, We launched a client's website around 7th January 2013 (http://rollerbannerscheap.co.uk), we originally constructed the website on a development domain (http://dev.rollerbannerscheap.co.uk) which was active for around 6-8 months (the dev site was unblocked from search engines for the first 3-4 months, but then blocked again) before we migrated dev --> live. In late Jan 2013 changed the robots.txt file to allow search engines to index the website. A week later I accidentally logged into the DEV website and also changed the robots.txt file to allow the search engines to index it. This obviously caused a duplicate content issue as both sites were identical. I realised what I had done a couple of days later and blocked the dev site from the search engines with the robots.txt file. Most of the pages from the dev site had been de-indexed from Google apart from 3, the home page (dev.rollerbannerscheap.co.uk, and two blog pages). The live site has 184 pages indexed in Google. So I thought the last 3 dev pages would disappear after a few weeks. I checked back late February and the 3 dev site pages were still indexed in Google. I decided to 301 redirect the dev site to the live site to tell Google to rank the live site and to ignore the dev site content. I also checked the robots.txt file on the dev site and this was blocking search engines too. But still the dev site is being found in Google wherever the live site should be found. When I do find the dev site in Google it displays this; Roller Banners Cheap » admin <cite>dev.rollerbannerscheap.co.uk/</cite><a id="srsl_0" class="pplsrsla" tabindex="0" data-ved="0CEQQ5hkwAA" data-url="http://dev.rollerbannerscheap.co.uk/" data-title="Roller Banners Cheap » admin" data-sli="srsl_0" data-ci="srslc_0" data-vli="srslcl_0" data-slg="webres"></a>A description for this result is not available because of this site's robots.txt – learn more.This is really affecting our clients SEO plan and we can't seem to remove the dev site or rank the live site in Google.Please can anyone help?
Technical SEO | | SO_UK0 -
Duplicate Content
Hi, we need some help on resolving this duplicate content issue,. We have redirected both domains to this magento website. I guess now Google considered this as duplicate content. Our client wants both domain name to go to the same magento store. What is the safe way of letting Google know these are same company? Or this is not ideal to do this? thanks
Technical SEO | | solution.advisor0 -
Duplicate content + wordpress tags
According to SEOMoz platform, one of my wordpress websites deals with duplicate content because of the tags I use. How should I fix it? Is it loyal to remove tag links from the post pages?
Technical SEO | | giankar0 -
Snippets on every page considered duplicate content?
If I create a page that pulls a 10 snippets of information from various external site, would that content be considered duplicate content? If I link to the source, would it be recommended to use a "nofollow" tag?
Technical SEO | | nicole.healthline0 -
Duplicate content
Greetings! I have inherited a problem that I am not sure how to fix. The website I am working on had a 302 redirect from its original home url (with all the link juice) to a newly designed page (with no real link juice). When the 302 redirect was removed, a duplicate content problem remained, since the new page had already been indexed by google. What is the best way to handle duplicate content? Thanks!
Technical SEO | | shedontdiet0