Duplicate content - how to diagnose duplicate content from another domain before publishing pages?
-
Hi,
My company is having new distributor contract, and we are starting to sell products on our own webshop.
Bio-technology is an industry in question and over 1.000 products. Writing product description from scratch would take many hours. The plan is to re-write it.
With permission from our contractors we will import their 'product description' on our webshop. But, I am concerned being penalies from Google for duplicate content.
If we re-write it we should be fine i guess. But, how can we be sure? Is there any good tool for comparing only text (because i don't want to publish the pages to compare URLs)?
What else should we be aware off beside checking 'product description' for duplicate content?
Duplicate content is big issue for all of us, i hope this answers will be helpful for many of us.
Keep it hard work and thank you very much for your answers,
Cheers,
Dusan
-
Thank you again Monica. The reviews are definitely be implemented. Good luck!
-
I think you should stay above 90% unique.
I would strongly encourage you to make sure you add user generated content to the pages. Even rewriting the content to be exactly unique will not be enough to guarantee your pages can rank. The content is going to be written in unique words, but it will essentially be the same thing. You will need something that is uniquely valuable to the user.
No you won't necessarily be penalized, but, it will become harder to rank in branded searches.
-
Thank you Monica, really good answer!
Now, with copyscape.com what is the percentage of uniqueness that is 'allright'?
We are selling laboratory products for cell analysis, tittles and descriptions of our products are highly scientific. Manufacturer is the only one who can write it. Re-writing is our only option.
However, i am not that scared of being penalies any more. Thank you very much!
-
I would add that if you are going to have user generated content, make sure there's a review process so it doesn't get spammed/abused.
-
Monica said basically everything I would and probably a little better. The review system is extremely helpful in generating unique content, for a few reasons. One, you don't have to write it yourself (just review it), second customers want to hear from other customers, third the way a user describes a product may actually attract keyword opportunities you may not have thought of on your own.
-
Hi Dusan,
I have a couple of suggestions for you. The first, to answer your question, is that copyscape.com will let you compare two pieces of content for free, and then it will tell you the percentage of uniqueness. This will be a good way to tell if your rewrites are adequate.
My second suggestion is to implement a review system that will stream user generated content onto your pages. The duplicate content "penalty" you are referring to is really not a penalty in the empirical sense. When it comes to ranking, Google will look at two sites with the same content and pick one to display. Usually the branded site would win in a branded search. There are many other factors, like page rank and domain authority that can influence which page is displayed, and the user query can influence that as well.
Having uniquely valuable content on your site, like user generated comments and reviews, can be the offset for your duplicate content issue. Rewriting the manufacturer's descriptions isn't really going to accomplish the goal of offering the searcher something they can't find anywhere else. In my opinion, just rewriting the content isn't enough of an advantage to beat out other sites. You have to offer something valuable, that can only be found on your site. User generated content is (in my opinion) the best content you can have. Every consumer reads reviews when they are available. They want to find out what regular people have to say about a product. Is is the right size, is the color consistent, how long did it last, is this a fair price? These are all questions that can be answered in a review system.
I added reviews to my Ecommerce site about 6 months ago and have seen great success. 80% of my content is the same as 4 other sites, except my category pages. I write extremely unique content for those pages, which helps me target long tail and branded key terms. Then my product pages have the manufacturer's descriptions, tech specs and warranty info plus the user generated content. It has been very successful whenever I have implemented it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content That Isn't Duplicated
In Moz, I am receiving multiple messages saying that there is duplicate page content on my website. For example, these pages are being highlighted as duplicated: https://www.ohpopsi.com/photo-wallpaper/made-to-measure/pop-art-graffiti/farm-with-barn-and-animals-wall-mural-3824 and https://www.ohpopsi.com/photo-wallpaper/made-to-measure/animals-wildlife/little-elephants-garden-seamless-pattern-wall-mural-3614. As you can see, both pages are different products, therefore I can't apply a 301 redirect or canonical tag. What do you suggest?
Intermediate & Advanced SEO | | e3creative0 -
Content Publishing Volume/Timing
I am working with a company that has a bi-monthly print magazine that has several years' worth of back issues. We're working on building a digital platform, and the majority of articles from the print mag - tips, how-tos, reviews, recipes, interviews, etc - will be published online. Much of the content is not date-sensitive except for the occasional news article. Some content is semi-date-sensitive, such as articles focusing on seasonality (e.g. winter activities vs. summer activities). My concern is whether, once we prepare to go live, we should ensure that ALL historical content is published at once, and if so, whether back-dates should be applied to each content piece (even if dating isn't relevant), or whether we should have a strategy in place in terms of creating a publishing schedule and releasing content over time - albeit content that is older but isn't necessarily time-sensitive (e.g. a drink recipe). Going forward, all newly-created content will be published around the print issue release. Are there pitfalls I should avoid in terms of pushing out so much back content at once?
Intermediate & Advanced SEO | | andrewkissel0 -
Duplicate currency page variations?
Hi guys, I have duplicate category pages across a ecommerce site. http://s30.postimg.org/dk9avaij5/screenshot_160.jpg For the currency based pages i was wondering would it be best (or easier) to exclude them in the robots.txt or use a rel canonical? If using the robots.txt (would be much easier to implement then rel canonical) to exclude the currency versions from being indexed what would the correct exclusion be? Would it look something like: Disallow: */?currency/ Google is indexing the currency based pages also: http://s4.postimg.org/hjgggq1tp/screenshot_161.jpg Cheers,
Intermediate & Advanced SEO | | jayoliverwright
Chris0 -
What is considered duplicate content?
Hi, We are working on a product page for bespoke camper vans: http://www.broadlane.co.uk/campervans/vw-campers/bespoke-campers . At the moment there is only one page but we are planning add similar pages for other brands of camper vans. Each page will receive its specifically targeted content however the 'Model choice' cart at the bottom (giving you the choice to select the internal structure of the van) will remain the same across all pages. Will this be considered as duplicate content? And if this is a case, what would be the ideal solution to limit penalty risk: A rel canonical tag seems wrong for this, as there is no original item as such. Would an iFrame around the 'model choice' enable us to isolate the content from being indexed at the same time than the page? Thanks, Celine
Intermediate & Advanced SEO | | A_Q0 -
Using Webmaster Tools to Redirect Domain to Specific Page on Another Domain
Hey Everyone, we redirected an entire domain to a specific URL on another domain (not the homepage). We used a 301 Redirect, but I'm also wondering if I should use the Google Webmaster Tools "Change of Address" section to redirect. There is no option to redirect the old domain to the specific URL on the new domain within the "Change of Address" section. Thoughts?
Intermediate & Advanced SEO | | M_D_Golden_Peak0 -
Guest blogging and duplicate content
I have a guest blog prepared and several sites I can submit it to, would it be considered duplicate content if I submitted one guest blog post to multipul blogs? and if so this content is not on my site but is linking to it. What will google do? Lets say 5 blogs except the same content and post it up, I understand that the first blog to have it up will not be punished, what about the rest of the blogs? can they get punished for this duplicate content? can I get punished for having duplicate content linking to me?
Intermediate & Advanced SEO | | SEODinosaur0 -
How to manage duplicate content?
I have a real estate site that contains a large amount of duplicate content. The site contains listings that appear both on my clients website and on my competitors websites(who have better domain authority). It is critical that the content is there because buyers need to be able to find these listings to make enquiries. The result is that I have a large number pages that contain duplicate content in some way, shape or form. My search results pages are really the most important ones because these are the ones targeting my keywords. I can differentiate these to some degree but the actual listings themselves are duplicate. What strategies exist to ensure that I'm not suffereing as a result of this content? Should I : Make the duplicate content noindex. Yes my results pages will have some degree of duplicate content but each result only displays a 200 character summary of the advert text so not sure if that counts. Would reducing the amount of visible duplicate content improve my rankings as a whole? Link back to the clients site to indicate that they are the original source Any suggestions?
Intermediate & Advanced SEO | | Mulith0 -
Accepting RSS feeds. Does it = duplicate content?
Hi everyone, for a few years now I've allowed school clients to pipe their news RSS feed to their public accounts on my site. The result is a daily display of the most recent news happening on their campuses that my site visitors can browse. We don't republish the entire news item; just the headline, and the first 150 characters of their article along with a Read more link for folks to click if they want the full story over on the school's site. Each item has it's own permanent URL on my site. I'm wondering if this is a wise practice. Does this fall into the territory of duplicate content even though we're essentially providing a teaser for the school? What do you think?
Intermediate & Advanced SEO | | peterdbaron0