E-Commerce Duplicate Content
-
Hello all
We have an e-commerce website with approximately 3,000 products. Many of the products are displayed in multiple categories which in turn generates a different URL!
Accross the entire site I have noticed that the product pages are always outranked by competitors who have lower page authority, domain authority, total links etc etc.
I am convinced this is down to duplicate content issues. I understand there is no direct penalty but how would this affect our rankings? Is page rank split between all the duplicates, which in turn lowers it's ranking potential?
I have looked for a way to identify duplicate content using Google analytics but i've been unsuccessful. If the duplicate content is the issue and page rank is divided am i best using canonical or 301 redirects?
Sorry if this is an obvious question but If i'm correct we could see a huge improvement in rankings accross the board. Wow!
Cheers
Todd
-
When Google finds more than one document (ie URL) with the same content, it has to define which of them is the representative document of the cluster. In doing this it looks at inbound link metrics, essentially, plus date of the page, pagerank and other factors. In this decision, it can be wrong, indexing a page that can hurt you indexation (consider this situation: it indexes as representative document page 2 of a listing page in descending order: new items in this category end to be at page 2 or later and are less likely to be discovered).
The canonical tag can be a good solution, even if it is a hint and not a rule to Google...
-
Great stuff thanks!...
-
SEOMOZ had an awesome whiteboard on this.
http://www.seomoz.org/blog/whiteboard-friday-faceted-navigation
Some more additional resources:
http://www.seomoz.org/ugc/dealing-with-faceted-navigation-a-case-study
Matt Cutts on faceted navigation:
http://www.stonetemple.com/articles/interview-matt-cutts-012510.shtml
Hope they help you
-
Thanks again! Unfortunately our system was built in house from scratch with no consideration for duplicate content
To be honest the product pages that I'm worried about have very few or no inbound links so maybe this isn't such a huge issue.
I have picked up on the fact almost all our pages including the homepage work on www and non www so maybe creating a 301 redirect for these will help also.
I will test the conical tag on a range of pages and mointor the results, hopefully our rankings will increase and I can look at some kind of strategy to roll this out.
Cheers for the help!
-
Google will select the most authortive aka whichever has the most links.
If you have a ton of inbound links I would recommend doing lots of research before inserting that tag. Find out which pages have the authority and don't throw it away.
This was a plague of eCommerce for years. Luckly most of the newest moden platfroms have caught up.
-
The duplicate item pages will not be indexed but visited the google bot. He will consider this page to be the one linked in the canonical tag.
I hope you won't have to set the urls manually !
-
Thanks for the quick response chaps! So if we have 9 duplicates for example will Google index all 9 pages or decide on 1 and never revisit the rest.
I couldn't see any duplicate URLs in the top content report.
We have over 3,000 products so it will be fun adding canonical tags to all the necessary pages
-
Toddy,
For every product of your site, you should identify its main category (the one that will be indexed). When seeing a product with a different category url, use the rel=canonical tag to give google the good url. This works well with e-commerce site.
You may also apply this logic between categories, as some listing between two categories are sometimes very similar.
For more information about the rel=canonical tag, see these resources :
http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=139394
-
The only "penalty" is the fact you could potentially spread your link juice across those multiple pages. Example:
You have 104 links to the same product, but they are equally pointed a 4 unique URLs. Now you technically have 26 links on whatever page Google 'selects' as your authority page.
Your competition has 100 links to the same product which only has 1 page.
With that type of setup your competition is always going to have that authority page ranked abouve you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content Mystery
Hi Moz community! I have an ongoing duplicate mystery going on here and I'm hoping someone here can answer my question. We have an Ecommerce site that has a variety of product pages and category pages. There are Rel canonicals in place, along with parameters in GWT, and there are also URL rewrites. Here are some scenarios, maybe you can give insight as to what’s exactly going on and how to fix it. All the duplicates look to be coming from category pages specifically. For example:
Technical SEO | | Ecom-Team-Access
This link re-writes: http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves.html?cat=407&color=152&price=20- To: http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves.html The rel canonical tag looks like this: http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves.html" /> The CONTENT is different, but the URLs are the same. It thinks that the product category view is the same as the all products view, even though there is a canonical in there telling it which one is the original. Some of them don’t have anything to do with each other. Take a look: Link identified as duplicate: http://www.incipio.com/cases/smartphone-cases/htc-smartphone-cases/htc-windows-phone-8x-cases.html?color=27&price=20- Link this is a duplicate of: http://www.incipio.com/cases/macbook-cases/macbook-pro-13in-cases.html Any idea as to what could be happening here?0 -
Duplicate content for vehicle inventory.
Hey all, In the automotive industry... When uploading vehicle inventory to a website I'm concerned with duplicate content issues. For example, 1 vehicle is uploaded to the main manufacturers website, then again to the actual dealerships website & then again to Craigslist & even sometimes to a group site. The information is all the same, description, notes, car details & images. What would you all recommend for alleviating duplicate content issues? Should I be using the rel canonical back to the manufacturers website? Once the vehicle is sold all pages disappear. Thanks so much for any advice.
Technical SEO | | DCochrane0 -
Duplicate page content & titles on the same domain
Hey, My website: http://www.electromarket.co.uk is running Magento Enterprise. The issue I'm running into is that the URLs can be shortened and modified to display different things on the website itself. Here's a few examples. Product Page URL: http://www.electromarket.co.uk/speakers-audio-equipment/dj-pa-speakers/studio-bedroom-monitors/bba0051 OR I could remove everything in the URL and just have: http://www.electromarket.co.uk/bba0051 and the link will work just as well. Now my problem is, these two URL's load the same page title, same content, same everything, because essentially they are the very same web page. But how do I tell Google that? Do I need to tell Google that? And would I benefit by using a redirect for the shorter URLs? Thanks!
Technical SEO | | tomhall900 -
How different does content need to be to avoid a duplicate content penalty?
I'm implementing landing pages that are optimized for specific keywords. Some of them are substantially the same as another page (perhaps 10-15 words different). Are the landing pages likely to be identified by search engines as duplicate content? How different do two pages need to be to avoid the duplicate penalty?
Technical SEO | | WayneBlankenbeckler0 -
Duplicate content problem from an index.php file
Hi One of my sites is flagging a duplicate content problem which is affecting the search rankings. The duplicate problem is caused by http://www.mydomain.com/index.php which has a page rank of 26 How can I sort the duplicate content problem, as the main page should just be http://www.mydomain.com which has a page rank of 42 and is the stronger page with stronger links etc Many Thanks
Technical SEO | | ocelot0 -
Need help with Joomla duplicate content issues
One of my campaigns is for a Joomla site (http://genesisstudios.com) and when my full crawl was done and I review the report, I have significant duplicate content issues. They seem to come from the automatic creation of /rss pages. For example: http://www.genesisstudios.com/loose is the page but the duplicate content shows up as http://www.genesisstudios.com/loose/rss It appears that Joomla creates feeds for every page automatically and I'm not sure how to address the problem they create. I have been chasing down duplicate content issues for some time and thought they were gone, but now I have about 40 more instances of this type. It also appears that even though there is a canonicalization plugin present and enabled, the crawl report shows 'false' for and rel= canonicalization tags Anyone got any ideas? Thanks so much... Scott | |
Technical SEO | | sdennison0 -
404's and duplicate content.
I have real estate based websites that add new pages when new listings are added to the market and then deletes pages when the property is sold. My concern is that there are a significant amount of 404's created and the listing pages that are added are going to be the same as others in my market who use the same IDX provider. I can go with a different IDX provider that uses IFrame which doesn't create new pages but I used a IFrame before and my time on site was 3min w/ 2.5 pgs per visit and now it's 7.5 pg/visit with 6+min on the site. The new pages create new content daily so is fresh content and better on site metrics (with the 404's) better or less 404's, no dup content and shorter onsite metrics better? Any thoughts on this issue? Any advice would be appreciated
Technical SEO | | AnthonyLasVegas0 -
Duplicate homepage content
Hi, I recently did a site crawl using seomoz crawl test My homepage seems to have 3 cases of duplicate content.. These are the urls www.example.ie/ www.example..ie/%5B%7E19%7E%5D www.example..ie/index.htm Does anyone have any advise on this? What impact does this have on my seo?
Technical SEO | | Socialdude0