Tool to identify duplicated content on other sites
-
Hi
does anyone know of a tool that could be used to identify if a site is using our content without permission?
Thanks
-
Like Deacyde I use the content in quotes in Google itself "to test if this is duplicate." But I also use a couple tools:
- http://www.copyscape.com/
- http://www.siteliner.com/
- http://www.webconfs.com/similar-page-checker.php
- http://www.plagspotter.com/ (undergoing a bit of renovation currently)
If you wanted to build this into your own tool, PlagTracker has an API
http://www.plagtracker.com/api.html
(As do a few of the ones above.)
-
One great way to do this is to take a sentence or two and search it in google surrounded by double quotes, like:
" A sentence I want to google search to see if other sites come up using these sentences "
This is one of the best ways to find external duplicate content
Can always use this - http://www.seoreviewtools.com/duplicate-content-checker/ to doublecheck google results, as well check within your site for duplicate content.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Developing supporting content for main ideas
I attended Mozcon this year and the session by Joe Hall "Rethinking Information Architecture for SEO and Content Marketing" has me considering some changes to our site structure and architecture. I'm currently putting together a landing page for our webinars but could things like webinars and case studies be considered supporting content to our main ideas? For example instead of my architecture being: home > webinars > webinar about an idea It could be: home > main idea 1 > webinar about an idea So my webinar landing page would link out to all of the different webinar pages on the site instead of being contained in this bucket. Just wanted to get some thoughts on this.
Content Development | | Brando160 -
Why is content getting longer?
I find it odd that with the way life is today -- the gotta-have-it-now, instant gratification, can't hold someone's attention span for longer than 3 seconds -- why Google is wanting content to be REALLY long?? I've read articles saying content should be as long as 2,000 words per page. This just seems nuts to me. No one wants to read anymore. Look at how short Twitter posts are and how videos are so prevalent now. Any thoughts?
Content Development | | SEOhughesm0 -
What if your content is getting social shares but no links?
Suppose you have a weekly blog article and sometimes your articles earn social shares (e.g. 23 +1's on Google Plus on one article but normally 3-5 social shares). One out of 10 earns an organic link from a random blog. Would you continue publishing these blog posts?
Content Development | | ProjectLabs0 -
Would adding a news page hurt my site ranking ?
Hi Mozers I was thinking about adding an industry news page where we would post articles written by others but give proper citation and linking. Would a page like this hurt my SEO ? Thank You
Content Development | | Pzabarko0 -
Integration of content on other sites
(by Google Traductor) Hello, we are able to make agreements with sites of good quality and reputation to integrate our classified ads for agropeuariosector on the websites of these companies. We fear that thesesites begin to index all this content and begin to compete with us in organic positioning. On the other hand this would generateduplicate content? That strategy will be applied in order to do so. Greetings and thanks!
Content Development | | romaro
Robert0 -
Tools to Eval Blog Content - Rate your Fav tool
Ok, so I know that is has been covered in depth and at the risk of being sent to “google it!” (Which I have done with no success) I thought that I would ask your opinions on the topic. What are the best content marketing evaluation tools? By this I am specifically referring to tools that evaluate the content of Blogs, etc and not the performance of the blog, etc. I’m eager to hear your thoughts of what works and if you care to share what tools did not. Thanks
Content Development | | Questionmana0 -
4 sites - 4 keyword sets?
Hi, I have 4 domains and 4 keyword sets i.e. penny stocks mutual funds stocks day trading My question is: what is the better approach: Have each site around one set e.g. everything about "penny stocks" Site A - penny stocks Site B - mutual funds Site C - stocks Site D - day trading Create 4 categories in each website about each set of keywords? Site A - penny stocks | mutual funds | stocks | day trading and the same.. Thank you, Alex
Content Development | | pwpaneuro0 -
Archive older, low ranked content to help new content in Panda 2.2?
After watching the white board friday re: Panda 2.2, it got me to thinking about old content. One of the sites that I work with generates 3-10 new articles/day (movie reviews, interviews, guides, event previews, etc) and has been doing so since 2005. Now, they have almost 10k articles, 7k of which are indexed. The quality of the content varies, and much of it is dated (movies, events) much of the amount of older content gets 0-5 pageviews/month, made in the days BEFORE the site was using Google News + social tools to spread the word (and backlinks). Note that those older articles also of course tend to have 100% bounce, and small/zero TOS. Is this hurting the site? With 75-100 articles/month being published, I want to make sure they get maximum exposure. I'm also concerned that crawlers get sucked into the site chasing down old BS content, and that is hurting it as well. What to do with this content? Should I unpublish unpopular, dated content and get it off the internet? Or, do I leave it on, but NOINDEX it so Google won't crawl it?
Content Development | | EricPacifico0