How to check duplicate content with other website?
-
Hello,
I guest that my website may be duplicate contents with other websites. Is this a important factor on SEO? and how to check and fix them?
Thanks,
-
If you want to check who "copied" your content you can use - as told by the others - Copyscape.
Or, you can use Google itself.
Pro Tip:
- set search in order to show you 100 search result per time;
- tell Google to show you also the results it may have filtered out for being "substantially identical" to the ones it is showing you already;
- use the scraper extension for Chrome and scrape the Google results and export them in Google Docs, so to start analyzing the site that are scraping your content
- if the content you write is copyrighted, you can ask Google to deindex the site scraping it in order to defend your rights as the original Author.
-
if Google thinks that your content si a copy from another site, the page copied will be penalised by the Panda algorithm.
Sorry to disagree with you: if "copied" content was a problem, then we will have sites like Techmeme out of the index.
The problem with with publishing syndicated content is not the act of republishing it, but the value you add or not while republishing the content of another site. For instance, if you add classic content curation practice, as commenting inline or before or after the "copied" content, or if you published it and open a discussion that generates UGC content, then that copied content is not a problem.
Be aware, I am talking of content republished with the permission of the original author/publisher of the content itself.
Other thing is scraped content, which don't add value. In that case the scrapers seriously are at risk of Panda or, simply, of being filtered out of the visible index.
Similarly duplicated content can be a risk when it comes to products description in an eCommerce or Classified site. That content - again - seriously can lead you to a Panda penalization. That's why it is always better to rewrite the standard products description, or add more unique content that may add value, as "the site review" of the product, users' reviews, etc etc.
-
Hi,
I will have to disagree with Natan - duplicate content is not really such a big deal as a lot of people are advertising it for.
There is no such thing as duplicate content penalty and de-indexation of a site based on duplicate content - it was never the case and it will never be the case.
I am not saying you don't have to deal with it - you do - you should - but only when appropriate.
As far as Panda is concerned, it is a ranking or you can even call it a filter - but not a penalty and it is only based on market and competition. Yes, with low authority and a strong competition providing more or less the same information you can get under this Panda filter but it's way more then that - it's not 1 and 0 - black and white with it.
To see how "unique" your content is and where on the web other sites holds the same or parts of your content you can use copyscape - as Natan mention - but for the rest, sorry Nate, the advice is just not right.
Cheers.
-
Hello,
Duplicate content is a key factor in SEO, if Google thinks that your content si a copy from another site, the page copied will be penalised by the Panda algorithm.
If someone copies your content and is indexed earlier than you, then, your page will rank lower than your thief.
To prevent that, you must share the content immediately on Google Plus, and other SocialMedia and social bookmarks.
If Google thinks that all of your content is a copy, not only a page, but your entire site could suffer a penalty, or even a un-indexation.
if you think that your articles are being stolen or that you bought articles and the redactor is giving you copies from somewhere, you can chek that with copyscape.com
I hope to be usefull and easy to understand!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
There is a copy of our website that is ranking. How can I let Google know our website is the authentic site?
I just found another copy of my old website and have no way to take it down. Unfortunately, it's ranking so he didn't place it as a nofollow. (My boss hired someone to redevelop our website before I came on board and never finished the project). So, could this be hurting us? I tried to look to see if we were being penalized and couldn't find that we were. Also, ever since we migrated to a new domain name, our ranking is tumbling. I've redirected properly and tested to make sure they're resolving correctly and they are. I have no idea what is going on. We've virtually lost all ranking. Any help would be much appreciated.
On-Page Optimization | | npuffer790 -
Website server errors
I launched a new website at www.cheaptubes.com and had recovered my search engine rankings as well after penguin & panda devestation. I'm was continuing to improve the site Sept 26th by adding caching of images and W3 cache but moz analytics is now saying I went from 288 medium issues to over 600 and i see the warning "45% of site pages served 302 redirects during the last crawl". I'm not sure how to fix this? I'm on WP using Yoast SEO so all the 301's I did are 301's not 302's. I do have SSL, could it be Http vs Https? I've asked this question before and two very nice people replied with suggestions which I tried to implement but couldn't, i got the WP white screen of death several times. They suggested the code below. Does anyone know how to implement this code or some other way to reduce the errors I'm getting? I've asked this at stackoverflow with no responses. "you have a lot of http & https issues so you should fix these with a bit of .htaccess code, as below. RewriteEngine On
On-Page Optimization | | cheaptubes
RewriteCond %{HTTPS} !=on
RewriteRule ^.*$ https://%{SERVER_NAME}%{REQUEST_URI} [R,L] You also have some non-www to www issues. You can fix these in .htaccess at the same time... RewriteCond %{HTTP_HOST} !^www.
RewriteRule ^(.*)$ http://www.%{HTTP_HOST}/$1 [R=301,L] You should find this fixes a lot of your issues. Also check in your Wordpress general settings that the site is set to www.cheaptubes.com for both instances." When I tried to do as they suggested it gave me an internal server error. Please see the code below from .htaccess and the server error. I took it out for now. BEGIN WordPress <ifmodule mod_rewrite.c="">RewriteEngine On
RewriteBase /
RewriteRule ^index.php$ - [L]
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule . /index.php [L]
RewriteEngine On RewriteCond %{HTTPS} !=on RewriteRule ^.$ https://%{SERVER_NAME}%{REQUEST_URI} [R,L]
RewriteCond %{HTTP_HOST} !^www. RewriteRule ^(.)$ http://www.%{HTTP_HOST}/$1 [R=301,L]</ifmodule> END WordPress Internal Server Error The server encountered an internal error or misconfiguration and was unable to complete your request. Please contact the server administrator, webmaster@cheaptubes.com and inform them of the time the error occurred, and anything you might have done that may have caused the error. More information about this error may be available in the server error log. Additionally, a 500 Internal Server Error error was encountered while trying to use an ErrorDocument to handle the request.0 -
How to avoid duplicates when URL and content changes during the course of a day?
I'm currently facing the following challenge: Newspaper industry: the content and title of some (featured) articles change a couple of times during a normal day. The CMS is setup so each article can be found by only using it's specific id (eg. domain.tld/123). A normal article looks like this: domain.tld/some-path/sub-path/i-am-the-topic,123 Now the article gets changed and with it the topic. It looks like this now: domain.tld/some-path/sub-path/i-am-the-new-topic,123 I can not tell the writers that they can not change the article as they wish any more. I could implement canonicals pointing to the short url (domain.tld/123). I could try to change the URL's to something like domain.tld/some-path/sub-path/123. Then we would lose keywords in URL (which afaik is not that important as a ranking factor; rather as a CTR factor). If anyone has experiences sharing them would be greatly appreciated. Thanks, Jan
On-Page Optimization | | jmueller0 -
Duplicate content from category pages?
I have an ecommerce store with different categories for my products. Some products do appear in more than one category, is that an issue even if you end up on the same product page/link? Also, I have a "show all products" category, which I believe creates duplicate content too? What is your take on this? What can I do to solve this? Is it even an issue of duplicate content? All answers are very much appreciated!
On-Page Optimization | | danielpett0 -
Number of characters to duplicate content
I wonder how much characters in a page title so it can be characterized for Googleas duplicate content?
On-Page Optimization | | imoveiscamposdojordao
Sorry for the English, I used Google Translator.
I'm from Brazil 😄
Thanks.0 -
Duplicate page content,
Hi, in my campaign crawls diagnostic, I have a lot of Duplicate page content, but we use canonicalization and I used webmastertool to make sure the campaign parameters are not consider by the Google bot. Can you see what could be my problem, or do you have a tip for me or things to look at ? Thank You VB
On-Page Optimization | | Vale70 -
Why does SEOmoz use /blog/content-title vs /category/content-title? Any difference?
Assume a brand new blog being designed and all other things equal. What are the pros & cons between using the url structure /blog/content-title vs. /category/content-title? Note:
On-Page Optimization | | JasonJackson
Both scenarios would be using categorical archiving.0 -
Archetecture to avoid content duplicate
Hi, I have lots of duplicate stuff and I need a better site architecture. http://www.furnacefilterscanada.com/ We are selling furnace filters. All furnace filters are sold in 50 different sizes, each sizes comes in 3 different qualities, Bronze, Silver and Gold. Total: 150 products. Right now I have created many categories and subcategories for furnace filters sizes. When the client pickup is sizes, he will end-up to the products page with 3 different options, Bronze, Silver and Gold. They can then compare the filter a select the one he wants to purchase. The problem is, it is not possible to provide different content for each filters, Gold has a description, Silver has another one and also Bronze. The only text that will change in the descriptions, is the filter size. This makes Duplicates text description. Not good when you what to index your site. The positive things to 150 different products, is the page title. example 16x25x4 furnace filters. Those exacte tem get search in Google. A new site architecture with 3 categories, Gold, Silver and Bronze & 50 variables by products (filters sizes) might not be the best options, because no filter size will be index. Can you please help me to find the best architecture in a SEO point of view? Also what about the top navigation bar menu, what is the best options in using it? Right now it is use for Legal, Contact, Policy and I fill it is a wast, those page only get less then 1% clicks. It might be more convenient to use those for categories for example, what is your recommendations in a SEO point of view? Can I create a information page in the left navigation menu and includ all the standard page, like: Policy, Legal ... If I do, will I get penalize by Google? Thank you for your help. We have puts lots of money in AdWords before, but now the next step is to come home organics. I'm using SEOmoz tools, read there new book, and I want increase traffic. I just need your help. Thank you, BigBlaze
On-Page Optimization | | BigBlaze2050