Duplicate Content... Really?
-
Hi all,
My site is www.actronics.eu
Moz reports virtually every product page as duplicate content, flagged as HIGH PRIORITY!.
I know why.
Moz classes a page as duplicate if >95% content/code similar.
There's very little I can do about this as although our products are different, the content is very similar, albeit a few part numbers and vehicle make/model.
Here's an example:
http://www.actronics.eu/en/shop/audi-a4-8d-b5-1994-2000-abs-ecu-en/bosch-5-3
http://www.actronics.eu/en/shop/bmw-3-series-e36-1990-1998-abs-ecu-en/ate-34-51Now, multiply this by ~2,000 products X 7 different languages and you'll see we have a big dupe content issue (according to Moz's Crawl Diagnostics report).
I say "according to Moz..." as I do not know if this is actually an issue for Google? 90% of our products pages rank, albeit some much better than others?
So what is the solution? We're not trying to deceive Google in any way so it would seem unfair to be hit with a dupe content penalty, this is a legit dilemma where our product differ by as little as a part number.
One ugly solution would be to remove header / sidebar / footer on our product pages as I've demonstrated here - http://woodberry.me.uk/test-page2-minimal-v2.html since this removes A LOT of page bloat (code) and would bring the page difference down to 80% duplicate.
(This is the tool I'm using for checking http://www.webconfs.com/similar-page-checker.php)Other "prettier" solutions would greatly appreciated. I look forward to hearing your thoughts.
Thanks,
Woody -
Hey David
Thanks for reply.
3. Use a plugin to apply rich snippet markup to the individual product pages, adding another layer of "uniqueness"
I had thought about this already and was looking into the MPN (Manufacturer Part Number) attribute for products (https://schema.org/mpn) however, it's not clear if, like SKU, the MPN needs to be unique to ProductModel (https://schema.org/ProductModel)?
If that were the case, I'd have a problem as there are multiple MPN's per ProductModel.
I see https://schema.org/isVariantOf too, which could be useful?
Anyone with experience of Schema?
-
First, why were you looking at the reports? Have you seen some type of ranking loss that you are trying to remedy?
Second, the moz tools are just tools to provide you with an oversight on where you are at, and potential areas your site can be improved. They work, but are not dedicated to any one type of website i.e. e-commerce vs static or content-based.
To get the unique pages you seek, it may be possible to use javascript to load content for variables of part numbers. As stated before, your site is getting seen as duplicate due to only a few things changing out per page.
Possible fixes:
1. Use dynamic coding to load part number variables, such as drop down menus for alternate versions or parts or models. This will allow you fewer pages to direct your backlinks to as well.2. Have more top level pages based around the category, and focus on getting the category pages ranking rather than the individual part pages. Again, focus your backlinking efforts on these pages.
3. Use a plugin to apply rich snippet markup to the individual product pages, adding another layer of "uniqueness"
-
The pages were not intended strictly for SEO value, they were mainly built for user value, i.e. returning a 100% focused page on the part number they searched for. Remember, many people use Google as a navigational tool and they also consider the product to the the part no. they searched for, not the main manufacturer of the product (ATE).
I understand what you are saying though and think building stronger product pages is the way to go, although I will try on a subset of pages and monitor results.
Now to decide which approach to take to yield the best results:
a.) SEO focus on ATE MK70 (list all the vehicle makes/models/years this product work on, including list of part numbers)
or...
b.) SEO focus on vehicle makes/model (then list all the manufacturers of suitable products, with corresponding part numbers)Thanks,
Woody -
This is one of the things Panda was trying to discourage (creating pages strictly for SEO value as opposed to user value that have thin content).
Consolidating and building out a single page is the way to go. Google will still crawl the product numbers, and they will be on a much stronger page. Even if they're not in the URL and title, a more valuable page nearly always wins out.
Not only that, you're playing with fire right now. If you haven't been hit by Panda yet, your odds are much higher with the numerous little pages.
-
Thanks guys
William
What's the thought process of creating a bunch of new pages, even though it's the same product, just referred to differently by different companies? Just for the unique URLs and titles?
Samuel
Would you want to create a separate page for "red Honda Civic," "green Honda civic," and countless other colors? Of course not.
To hopefully address both questions with one answer; the reason for building separate pages was to give SEO focus to the unique part numbers and the product type by vehicle make / model / year.
Very few people in the industry search for the product by name, it's always by part number. In fact, I'd go as far as to say there's few who would actually know the brand of "the product", that being ATE MK70 in our example above.
I understand the logic of building a strong single product page with all these part numbers listed, but would this page really rank well for searches on part number? Bear in mind, unlike the red, green, blue Honda Civic example, where there's perhaps a dozen different colours, we're talking literally 100's of part numbers per product and variations of it's formatting.
I welcome further conversation and ideas on this
Thanks so far guys! -
Thanks for the question. I'm not able to go through your site at the moment, but I would ask: Do you really need a separate page for every single make, model, and part number? Correct me if I'm wrong, but this seems to be what you're doing. If so, you're just asking for a Panda penalty.
Here's a basic example: Say that you sell Honda Civics. Would you want to create a separate page for "red Honda Civic," "green Honda civic," and countless other colors? Of course not. All of the content would be entirely the same except for the listed color throughout each title and page's text.
I'd take a look at Amazon as an example. Say that I go to a page for a certain T-shirt. The same page for that individual product will include all of the color variations w_ithin that single product page_. Each color variation is not a new page and URL (or if it is, it has a rel=canonical tag back to the main product page -- I don't remember). I'd look to this example as a way that you can vastly cut down the number of product pages so that each one is truly unique, valuable, and useful to both search engines and customers.
I hope that helps -- good luck!
-
I think you're already in Panda territory. The content can't get much thinner. It seems like all those sub-pages that are linked to on the page you just shared are unnecessary, no? Couldn't you just have the one page, build it out with the cars it works in, maybe a diagram or instruction on how to put it in, and make a really valuable page?
What's the thought process of creating a bunch of new pages, even though it's the same product, just referred to differently by different companies? Just for the unique URLs and titles?
Consolidating all of that would eliminate thin content and likely strengthen your landing page exponentially.
-
Thank you for your answer William and taking the time to respond,
I understand what you are saying but I am a little skeptical as that being a logical/achievable solution?
Let's say we did write some content for each product, the content would be "thin" to say the least.
As an example, we have over 700 products (per language), this being on of them - http://www.actronics.eu/en/shop/product/ate-mk70
This product alone works in over 43 different vehicle marques, illustrated in the list of on the page.
The only thing different about them is the part number, i.e. what the manufacturer refers to this part as (Audi A3 refer to it as 10097003153, Peugeot 206 refer to it as 9659136980). There really is nothing more to say about the product, without creating more dupe content and getting into Panda territory, so I don't see this being a viable solution?
We have the pages in place as mechanics/garages search by manufactures number, not product type.
Any more thoughts/ideas?
-
This issue isn't duplicate content, Moz is just flagging it as that because of the severe lack of content, making the footer, sidebar, etc. the majority of the content on the page. This is not good, and the best way to remedy it would be to build out more content.
I realize with roughly 14k pages, this isn't realistic to do for every single page, but you could prioritize. What are your most popular products? Start with those and build out content to make sure they rank and perform as well as possible, and then continue to go down the list as you have time to do so, manually optimizing and building out the most profitable/popular pages first.
When it comes to unique content, there is no automated solution. Either you write stuff, hire someone else to write stuff, or do what a lot of places do: implements a review system for customers to use and crowd-source the unique content that way.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
International SEO and duplicate content: what should I do when hreflangs are not enough?
Hi, A follow up question from another one I had a couple of months ago: It has been almost 2 months now that my hreflangs are in place. Google recognises them well and GSC is cleaned (no hreflang errors). Though I've seen some positive changes, I'm quite far from sorting that duplicate content issue completely and some entire sub-folders remain hidden from the SERP.
Intermediate & Advanced SEO | | GhillC
I believe it happens for two reasons: 1. Fully mirrored content - as per the link to my previous question above, some parts of the site I'm working on are 100% similar. Quite a "gravity issue" here as there is nothing I can do to fix the site architecture nor to get bespoke content in place. 2. Sub-folders "authority". I'm guessing that Google prefers sub-folders over others due to their legacy traffic/history. Meaning that even with hreflangs in place, the older sub-folder would rank over the right one because Google believes it provides better results to its users. Two questions from these reasons:
1. Is the latter correct? Am I guessing correctly re "sub-folders" authority (if such thing exists) or am I simply wrong? 2. Can I solve this using canonical tags?
Instead of trying to fix and "promote" hidden sub-folders, I'm thinking to actually reinforce the results I'm getting from stronger sub-folders.
I.e: if a user based in belgium is Googling something relating to my site, the site.com/fr/ subfolder shows up instead of the site.com/be/fr/ sub-sub-folder.
Or if someone is based in Belgium using Dutch, he would get site.com/nl/ results instead of the site.com/be/nl/ sub-sub-folder. Therefore, I could canonicalise /be/fr/ to /fr/ and do something similar for that second one. I'd prefer traffic coming to the right part of the site for tracking and analytic reasons. However, instead of trying to move mountain by changing Google's behaviour (if ever I could do this?), I'm thinking to encourage the current flow (also because it's not completely wrong as it brings traffic to pages featuring the correct language no matter what). That second question is the main reason why I'm looking out for MoZ's community advice: am I going to damage the site badly by using canonical tags that way? Thank you so much!
G0 -
Duplicate content on URL trailing slash
Hello, Some time ago, we accidentally made changes to our site which modified the way urls in links are generated. At once, trailing slashes were added to many urls (only in links). Links that used to send to
Intermediate & Advanced SEO | | yacpro13
example.com/webpage.html Were now linking to
example.com/webpage.html/ Urls in the xml sitemap remained unchanged (no trailing slash). We started noticing duplicate content (because our site renders the same page with or without the trailing shash). We corrected the problematic php url function so that now, all links on the site link to a url without trailing slash. However, Google had time to index these pages. Is implementing 301 redirects required in this case?1 -
Scraping / Duplicate Content Question
Hi All, I understanding the way to protect content such as a feature rich article is to create authorship by linking to your Google+ account. My Question
Intermediate & Advanced SEO | | Mark_Ch
You have created a webpage that is informative but not worthy to be an article, hence no need create authorship in Google+
If a competitor comes along and steals this content word for word, something similar, creates their own Google+ page, can you be penalised? Is there any way to protect yourself without authorship and Google+? Regards Mark0 -
Moving some content to a new domain - best practices to avoid duplicate content?
Hi We are setting up a new domain to focus on a specific product and want to use some of the content from the original domain on the new site and remove it from the original. The content is appropriate for the new domain and will be irrelevant for the original domain and we want to avoid creating completely new content. There will be a link between the two domains. What is the best practice for this to avoid duplicate content and a potential Panda penalty?
Intermediate & Advanced SEO | | Citybase0 -
Duplicate peices of content on multiple pages - is this a problem
I have a couple of WordPress clients with the same issue but caused in different ways: 1. The Slash WP theme which is a portfolio theme, involves setting up multiple excerpts of content that can then be added to multiple pages. So although the pages themselves are not identical, there are the same snippets of content appearing on multiple pages 2. A WP blog which has multiple categories and/or tags for each post, effectively ends up with many pages showing duplicate excerpts of content. My view has always been to noindex these pages (via Yoast), but was advised recently not to. In both these cases, even though the pages are not identical, do you think this duplicate content across multiple pages could cause an issue? All thoughts appreciated
Intermediate & Advanced SEO | | Chammy0 -
Duplicate Content on Product Pages
I'm getting a lot of duplicate content errors on my ecommerce site www.outdoormegastore.co.uk mainly centered around product pages. The products are completely different in terms of the title, meta data, product descriptions and images (with alt tags)but SEOmoz is still identifying them as duplicates and we've noticed a significant drop in google ranking lately. Admittedly the product descriptions are a little bit thin but I don't understand why the pages would be viewed as duplicates and therefore can be ranked lower? The content is definitely unique too. As an example these three pages have been identified as being duplicates of each other. http://www.outdoormegastore.co.uk/regatta-landtrek-25l-rucksack.html http://www.outdoormegastore.co.uk/canyon-bryce-adult-cycling-helmet-9045.html http://www.outdoormegastore.co.uk/outwell-minnesota-6-carpet-for-green-07-08-tent.html
Intermediate & Advanced SEO | | gavinhoman0 -
Pop Up Pages Being Indexed, Seen As Duplicate Content
I offer users the opportunity to email and embed images from my website. (See this page http://www.andertoons.com/cartoon/6246/ and look under the large image for "Email to a Friend" and "Get Embed HTML" links.) But I'm seeing the ensuing pop-up pages (Ex: http://www.andertoons.com/embed/5231/?KeepThis=true&TB_iframe=true&height=370&width=700&modal=true and http://www.andertoons.com/email/6246/?KeepThis=true&TB_iframe=true&height=432&width=700&modal=true) showing up in Google. Even worse, I think they're seen as duplicate content. How should I deal with this?
Intermediate & Advanced SEO | | andertoons0 -
Mobile version creating duplicate content
Hi We have a mobile site which is a subfolder within our site. Therefore our desktop site is www.mysite.com and the mobile version is www.mysite.com/m/. All URL's for specific pages are the same with the exception of /m/ in them for the mobile version. The mobile version has the specific user agent detection capabilities. I never saw this as being duplicate content initially as I did some research and found the following links
Intermediate & Advanced SEO | | peterkn
http://www.youtube.com/watch?v=mY9h3G8Lv4k
http://searchengineland.com/dont-penalize-yourself-mobile-sites-are-not-duplicate-content-40380
http://www.seroundtable.com/archives/022109.html What I am finding now is that when I look into Google Webmaster Tools, Google shows that there are 2 pages with the same Page title and therefore Im concerned if Google sees this as duplicate content. The reason why the page title and meta description is the same is simply because the content on the 2 verrsions are the exact same. Only layout changes due to handheld specific browsing. Are there any speficific precausions I could take or best practices to ensure that Google does not see the mobile pages as duplicates of the desktop pages Does anyone know solid best practices to achieve maximum results for running an idential mobile version of your main site?1