How to check duplicate content with other website?
-
Hello,
I guest that my website may be duplicate contents with other websites. Is this a important factor on SEO? and how to check and fix them?
Thanks,
-
If you want to check who "copied" your content you can use - as told by the others - Copyscape.
Or, you can use Google itself.
Pro Tip:
- set search in order to show you 100 search result per time;
- tell Google to show you also the results it may have filtered out for being "substantially identical" to the ones it is showing you already;
- use the scraper extension for Chrome and scrape the Google results and export them in Google Docs, so to start analyzing the site that are scraping your content
- if the content you write is copyrighted, you can ask Google to deindex the site scraping it in order to defend your rights as the original Author.
-
if Google thinks that your content si a copy from another site, the page copied will be penalised by the Panda algorithm.
Sorry to disagree with you: if "copied" content was a problem, then we will have sites like Techmeme out of the index.
The problem with with publishing syndicated content is not the act of republishing it, but the value you add or not while republishing the content of another site. For instance, if you add classic content curation practice, as commenting inline or before or after the "copied" content, or if you published it and open a discussion that generates UGC content, then that copied content is not a problem.
Be aware, I am talking of content republished with the permission of the original author/publisher of the content itself.
Other thing is scraped content, which don't add value. In that case the scrapers seriously are at risk of Panda or, simply, of being filtered out of the visible index.
Similarly duplicated content can be a risk when it comes to products description in an eCommerce or Classified site. That content - again - seriously can lead you to a Panda penalization. That's why it is always better to rewrite the standard products description, or add more unique content that may add value, as "the site review" of the product, users' reviews, etc etc.
-
Hi,
I will have to disagree with Natan - duplicate content is not really such a big deal as a lot of people are advertising it for.
There is no such thing as duplicate content penalty and de-indexation of a site based on duplicate content - it was never the case and it will never be the case.
I am not saying you don't have to deal with it - you do - you should - but only when appropriate.
As far as Panda is concerned, it is a ranking or you can even call it a filter - but not a penalty and it is only based on market and competition. Yes, with low authority and a strong competition providing more or less the same information you can get under this Panda filter but it's way more then that - it's not 1 and 0 - black and white with it.
To see how "unique" your content is and where on the web other sites holds the same or parts of your content you can use copyscape - as Natan mention - but for the rest, sorry Nate, the advice is just not right.
Cheers.
-
Hello,
Duplicate content is a key factor in SEO, if Google thinks that your content si a copy from another site, the page copied will be penalised by the Panda algorithm.
If someone copies your content and is indexed earlier than you, then, your page will rank lower than your thief.
To prevent that, you must share the content immediately on Google Plus, and other SocialMedia and social bookmarks.
If Google thinks that all of your content is a copy, not only a page, but your entire site could suffer a penalty, or even a un-indexation.
if you think that your articles are being stolen or that you bought articles and the redactor is giving you copies from somewhere, you can chek that with copyscape.com
I hope to be usefull and easy to understand!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content with tagging and categories
Hello, Moz is showing that a site has duplicate content - which appears to be because of tags and categories. It is a relatively new site, with only a few blog publications so far. This means that the same articles are displayed under a number of different tags and categories... Is this something I should worry about, or just wait until I have more content? The 'tag' and 'category' pages are not really pages I would expect or aim for anyone to find in google results anyway. Would be glad to here any advice / opinions on this Thanks!
On-Page Optimization | | wearehappymedia1 -
Internal Duplicate Content/Canonical Issue/ or nothing to worry about
Unfortunately, my developer cannot give me an answer to this so I really do hope someone can help. The homepage of my website is http://www.laddersfree.co.uk however I also have a page http://www.laddersfree.co.uk/index.php that has a page rank and essentially duplicates the home page. Does someone know what this is? Do I need to get my developer to do a 404? It is worrying that he has not come back to me. Thanks Jason
On-Page Optimization | | gymmad0 -
800 number on website
Hi, My client just sent me an 800 number that he would like to use to replace his number on the website. I know that it is best to keep a local phone number on the website and across all citations for NAP/Local SEO reasons. Is there anywhere that I could still incorporate the 800 number and not have it affect SEO? Thanks, Erin
On-Page Optimization | | HiddenPeak0 -
Static content VS Dynamic changing content what is best
We have collected a lot of reviews and we want to use them on our Categories pages. We are going to be updating the top 6 reviews per categories every 4 days. There will be another page to see all of the reviews. Is there any advantage to have the reviews static for 1 or 2 weeks vs. having unique new ones pulled from the data base every time the page is refreshed? We know there is an advantage if we keep them on the page forever with long tail; however, we have created a new page with all of the reviews they can go to.
On-Page Optimization | | DoRM0 -
Duplicate Content- Best Practise Usage of the canonical url
Canonical urls stop self competition - from duplicate content. So instead of a 2 pages with a rank of 5 out of 10, it is one page with a rank of 7 out of 10.
On-Page Optimization | | WMA
However what disadvantages come from using canonical urls. For example am I excluding some products like green widet, blue widget. I have a customer with 2 e-commerce websites(selling different manufacturers of a type jewellery). Both websites have massive duplicate content issues.
It is a hosted CMS system with very little SEO functionality, no plugins etc. The crawling report- comes back with 1000 of pages that are duplicates. It seems that almost every page on the website has a duplicate partner or more. The problem starts in that they have 2 categorys for each product type, instead of one category for each product type.
A wholesale category and a small pack category. So I have considered using a canonical url or de-optimizing the small pack category as I believe it receives less traffic than the whole category. On the original website I tried de- optimizing one of the pages that gets less traffic. I did this by changing the order of the meta title(keyword at the back, not front- by using small to start of with). I also removed content from the page. This helped a bit. Or I was thinking about just using a canonical url on the page that gets less traffic.
However what are the implications of this? What happens if some one searches for "small packs" of the product- will this no longer be indexed as a page. The next problem I have is the other 1000s of pages that are showing as duplicates. These are all the different products within the categories. The CMS does not have a front office that allows for canonical urls to be inserted. Instead it would have to be done going into the html of the pages. This would take ages. Another issue is that these product pages are not actually duplicate, but I think it is because they have such little content- that the rodger(seo moz crawler, and probably googles one too) cant tell the difference.
Also even if I did use the canonical url - what happened if people searched for the product by attributes(the variations of each product type)- like blue widget, black widget, brown widget. Would these all be excluded from Googles index.
On the one hand I want to get rid of the duplicate content, but I also want to have these pages included in the search. Perhaps I am taking too idealistic approach- trying to optimize a website for too many keywords. Should I just focus on the category keywords, and forget about product variations. Perhaps I look into Google Analytics, to determine the top landing pages, and which ones should be applied with a canonical. Also this website(hosted CMS) seems to have more duplicate content issues than I have seen with other e-commerce sites that I have applied SEO MOZ to On final related question. The first website has 2 landing pages- I think this is a techical issue. For example www.test.com and www.test.com/index. I realise I should use a canonical url on the page that gets less traffic. How do I determine this? (or should I just use the SEO MOZ Page rank tool?)0 -
Content ideas for different sections of a news website?
A news website I'm working on has pages for various sectors, much like any major news site (in this case for example - defence, energy, trade & finance etc.). I've been asked to add content of 150 words or more to each sector, containing keywords we're targeting. I can see the value of this SEO-wise but can't see how we can write anything that adds value for the user. I don't want to add some rubbish for the sake of keywords, I want the information to be useful.The best idea I can come up with is to write an overview of the challenges and topics making the news in each sector, perhaps a bit of historical detail - but I don’t think this will add much value from a user-perspective, and it's not something where there will be the resources available to update often (or to provide some 'best on the web' type info). Any other ideas? Or do you think my idea is a great one? ;-)The pages in question are like the majority of news pages; each item with a synopsis and the usual extra things like a poll and 'most read' box. I've looked at other news sites and can't see one that has any extra content in the way we require.Thanks.
On-Page Optimization | | Alex-Harford0 -
Do product pages need unique content or does having duplcate content hurt on those pages?
We are adding product rapidly to our website but this requires allowing duplicate to exist on our product pages of furniture-online.com. From an SEO standpoint do we need to make this content unique for each product. Since we aren't link building to specific product pages and we don't anticipate product pages being found in a search result, are we ok leaving the duplicate content in place and spending our dollars elsewhere?
On-Page Optimization | | gallreddy0 -
Avoiding "Duplicate Page Title" and "Duplicate Page Content" - Best Practices?
We have a website with a searchable database of recipes. You can search the database using an online form with dropdown options for: Course (starter, main, salad, etc)
On-Page Optimization | | smaavie
Cooking Method (fry, bake, boil, steam, etc)
Preparation Time (Under 30 min, 30min to 1 hour, Over 1 hour) Here are some examples of how URLs may look when searching for a recipe: find-a-recipe.php?course=starter
find-a-recipe.php?course=main&preperation-time=30min+to+1+hour
find-a-recipe.php?cooking-method=fry&preperation-time=over+1+hour There is also pagination of search results, so the URL could also have the variable "start", e.g. find-a-recipe.php?course=salad&start=30 There can be any combination of these variables, meaning there are hundreds of possible search results URL variations. This all works well on the site, however it gives multiple "Duplicate Page Title" and "Duplicate Page Content" errors when crawled by SEOmoz. I've seached online and found several possible solutions for this, such as: Setting canonical tag Adding these URL variables to Google Webmasters to tell Google to ignore them Change the Title tag in the head dynamically based on what URL variables are present However I am not sure which of these would be best. As far as I can tell the canonical tag should be used when you have the same page available at two seperate URLs, but this isn't the case here as the search results are always different. Adding these URL variables to Google webmasters won't fix the problem in other search engines, and will presumably continue to get these errors in our SEOmoz crawl reports. Changing the title tag each time can lead to very long title tags, and it doesn't address the problem of duplicate page content. I had hoped there would be a standard solution for problems like this, as I imagine others will have come across this before, but I cannot find the ideal solution. Any help would be much appreciated. Kind Regards5