What constitutes duplicate content on a page?
-
I am working on SEO for a Shopify store. Their products are very similar, hence the pages are so similar that Moz shows them as duplicate content. The only difference in the product pages is the title and model number. I am going to "go for the gold" and try re-writing all the product descriptions. It's incredibly difficult due to the products being nearly identical with just a minor variation. I know I could go down the road of just creating variants --- but the customer is not down for that.
Here's my question: what constitutes duplicate content? 80% of the content, 90%????
If I can going to re-write the descriptions, what should I aim for?
Thank you!
-
If you're not trying to rank, then you may not need to prioritize fixing this.
Duplicate content isn't a penalty; it's a risk. The risk is that more than one page on your site will seem appropriate in response to a particular search query and Google might 1) rank the wrong one, or 2) "decide" it's not clear and rank neither. If these aren't pages you'd expect or want to show up in search results anyway, then you can feel free simply to put together a page that'll provide the best experience for the user.
-
I am not trying to rank both pages - I am trying to create unique pages for each product. The complicated part is that they are tile. So a tile is made of the same material, same process for making them, used in the same application, and have the same size. So there are 1000 tiles that are very similar. The only slight variance is their color, part number and potentially if they have a pattern on them - such as a flower. Inside the set, there may be 100 tiles that have flowers. so it gets a bit difficult to write a description when so many things are the same about each tile.
-
Hi Steve! I'm not positive what Google considers duplicate, but I can tell you that our tools flag pages as duplicate when ≥90% of the source code (including content) matches. It's likely that we're more sensitive to it than Google is, which is intentional.
Out of curiosity, are you trying to rank both of these pages?
-
Hi-
I wouldn't being to guess the % needed, but I can tell you one way that we try to get around the issue with similar products. We added a "short description" section with bullet points to highlight all the questions people might ask about the product and in there we are very specific about color, shape, flavor etc. When we have similar products (say different flavored gummi bears) we list both basic facts about the product that are all the same (i.e. size, how many per bag) as well as listing the other attributes that are unique to that product (i.e. banana flavored, yellow colored) and that seems to be enough to keep us from a duplicate content penalty. It's also nice because it cuts down customer questions.
Just a thought
Ken
-
Hey Steve,
First question: How similar are these products? Are they the same but with color/trim/size differences? Have you had the conversation about canonicalization or are they not that similar?
Regarding duplication: I wouldn't look at it from a percentage standpoint, and if I did, I'd aim for 0-20% duplication with the assumption that 20% dupe was due to sentence beginnings and common intro phrases such as "If you're looking for....", which even as I look at that, I'd want to fix (because they're ubiquitous). Focus on the five Ws (who, what, where, when, why and how) and answer each one of those the best that you can, with the product's uses and why each model is different in mind.
Is there a way you can discuss the different use cases for each model number? Highlight benefits and applications? When people look for your product, what else are they searching for or concerned about?
Beau
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Will I have duplicate content on my own website?
Hello Moz community, We are an agency providing services to various industries, and among them the hair salon industry. On our website, we have our different service pages in the main menu, as usual. These service pages are general information and apply to any industry.We also have a page on the website that is only intended for the hair salon industry. On this page, we would like to link new service pages: they will be the same services as our “general” services, but specialized for hair salons. My questions relate to duplicate content: Do we have to make the new individual service pages for hair salons with completely different text, even though it’s the same service, in order to avoid having duplicate content? Can we just change a few words from the “general service” page to specifically target hair salons, and somehow avoid Google seeing it as duplicate content? Reminder that these pages will be internal links inside of the hair salon industry page. Thank you in advance for your answers, Gaël
On-Page Optimization | | Gael_Regnault0 -
How best to deal with internal duplicate content
hi having an issue with a client site and internal duplicate content. The client has a custom cms and when they post new content it can appear, in full, at two different urls on the site. Short of getting the client to move cms, which they won't do, I am trying to find an easy fix that they could do themselves. ideally they would add a canonical on one of the versions but the cms does allow them to view posts in html view, also would be a lot if messing about wth posting the page and then going back to the cms and adding the tag. the cms is unable to auto generate this either. The content editors are copywriters not programmers. Would there be a solution using wmt for this? They have the skill level to be able to add a url in wmt so im thinking that a stop gap solution could be to noindex one of the versions using the option in webmaster tools. Ongoing we will consult developers about modifying the cms but budgets are limited so looking for a cheap and quick solution to help until the new year. anyone know of a way other than wmt to block Google from seeing duplicate content. We can block Google from folders because only a small percentage of the content in the folder would be internally duplicate. would be very grateful for any suggestions anyone could offer. thanks.
On-Page Optimization | | daedriccarl0 -
Tool To Search For Duplicate Content
Hi Is there such a tool that can be use to search a website for duplicate content? Thanks
On-Page Optimization | | Bossandy0 -
I have a question about having to much content on a single page. Please help :)
I am working on a music related site. We are building a feature in our system to allow people to write information about songs on their playlist. So when a song is currently being played a user can read some cool facts or information about the song. http://imgur.com/5jFumPW ( screenshot). Some playlists have over 100 songs and could be completely random in genre and artist. I am wondering if some of these playlists have over 5,000 words of content if that is going to hurt us? We will be very strict about making sure its non spammy and good content. Also for the titles of the content is it bad to have over 100 h3 tags on one page? Just want to make sure we are on the right track. Any advice is greatly appreciated.
On-Page Optimization | | mikecrib10 -
Can you have more than 1 site on the first page if site look and content is completely different but keywords are the sam.
I have a client that wants to build another completely different site than his main site and optimize it to have 2 websites on the first page for his keywords. The content and look and feel of the website would be completely different. One of his competitors is doing it and getting away with it. What is your advice.
On-Page Optimization | | Roots70 -
Duplicate Content- Best Practise Usage of the canonical url
Canonical urls stop self competition - from duplicate content. So instead of a 2 pages with a rank of 5 out of 10, it is one page with a rank of 7 out of 10.
On-Page Optimization | | WMA
However what disadvantages come from using canonical urls. For example am I excluding some products like green widet, blue widget. I have a customer with 2 e-commerce websites(selling different manufacturers of a type jewellery). Both websites have massive duplicate content issues.
It is a hosted CMS system with very little SEO functionality, no plugins etc. The crawling report- comes back with 1000 of pages that are duplicates. It seems that almost every page on the website has a duplicate partner or more. The problem starts in that they have 2 categorys for each product type, instead of one category for each product type.
A wholesale category and a small pack category. So I have considered using a canonical url or de-optimizing the small pack category as I believe it receives less traffic than the whole category. On the original website I tried de- optimizing one of the pages that gets less traffic. I did this by changing the order of the meta title(keyword at the back, not front- by using small to start of with). I also removed content from the page. This helped a bit. Or I was thinking about just using a canonical url on the page that gets less traffic.
However what are the implications of this? What happens if some one searches for "small packs" of the product- will this no longer be indexed as a page. The next problem I have is the other 1000s of pages that are showing as duplicates. These are all the different products within the categories. The CMS does not have a front office that allows for canonical urls to be inserted. Instead it would have to be done going into the html of the pages. This would take ages. Another issue is that these product pages are not actually duplicate, but I think it is because they have such little content- that the rodger(seo moz crawler, and probably googles one too) cant tell the difference.
Also even if I did use the canonical url - what happened if people searched for the product by attributes(the variations of each product type)- like blue widget, black widget, brown widget. Would these all be excluded from Googles index.
On the one hand I want to get rid of the duplicate content, but I also want to have these pages included in the search. Perhaps I am taking too idealistic approach- trying to optimize a website for too many keywords. Should I just focus on the category keywords, and forget about product variations. Perhaps I look into Google Analytics, to determine the top landing pages, and which ones should be applied with a canonical. Also this website(hosted CMS) seems to have more duplicate content issues than I have seen with other e-commerce sites that I have applied SEO MOZ to On final related question. The first website has 2 landing pages- I think this is a techical issue. For example www.test.com and www.test.com/index. I realise I should use a canonical url on the page that gets less traffic. How do I determine this? (or should I just use the SEO MOZ Page rank tool?)0 -
How do I avoid duplicate content and page title errors when using a single CMS for a website
I am currently hosting a client site on a CMS with both a Canadian and USA version of the website. We have the .com as the primary domain and the .ca is re-directed from the registrar to the Canadian home page. The problem I am having is that my campaign produces errors for duplicate page content and duplicate page titles. Is there a way to setup the two versions on the CMS so that these errors do not get produced? My concern is getting penalized from search engines. Appreciate any help. Mark Palmer
On-Page Optimization | | kpreneur0 -
Will duplicate content supplied from a hotel provider damage my website, or simply just the pages that it appears on?
Hi, I currently have a lot of hotel listings pages with little or no content, as I'm scared that if I place duplicate hotel descriptions on the pages then Google will stop ranking the page. I've found that having descriptions of some kind do help conversion significantly, so I'm considering generating unique hotel descriptions on each main page (page 1 in each set of listings) - these are the pages that Google indexes. On subsequent pages (page 2, page 3 etc.) I'm thinking about resorting to displaying the duplicate affiliate content hotel descriptions - these pages can be crawled but are set to noindex. My question is, do you think this is likely to have an effect on my website in the rankings, and as a result push my primary pages (that contain 100% unique content) down in SERPs. Thanks Mike
On-Page Optimization | | mjk260