How to check for Duplicate Content Locally
-
Hi All,
Scenario
I have Instant Wordpress installed on my local machine.
I am in the process of redesigning my website; content, articles, etc.
I have internet access on my local machine.Question
I would like to cross check all internal links/pages against each other for duplicate content.
I would also like to check all internal pages against external www page instances for duplicate content.How can I achieve the following.
Thanks Mark
-
For external pages, copyscape should be sufficient.
For internal content, I would launch it noindexed on a test server (use meta robots and robots.txt) and run a Moz campaign crawl on it. That should be able to tell you whether any content is duplicated within the site without the content getting indexed.
-
I second Jonathan on copyscape -- it is a great tool to check for external duplicate content and the Copysentry feature is fairly good when combined with other checking methods.
To check for internal duplicate content, check out http://www.siteliner.com/ They are new, but it has been helpful each time I have tried it (and free).
-
Try copyscape.com. You can specific text by using Copyscape Premium, or protect your site (automatic checking) by using Copysentry.
http://www.copyscape.com/products.php
I personally use Copyscape Premium for checking stuff and it is pretty good. It is 5c per search, so very cheap.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why competitors rank with no content & filters
We're investing a lot in on-page content and creating categories/sub categories to target relevant keywords. Although it seems competitors with high domain authorities are clearly dominating in the SERPs and the pages have NO content. They are literally filters that create landing pages. For example when you search "Kaydian Beds", Debenhams rank below the manufacturer with just a filtered page, this page has not been optimised with content to target the keywords, only the META title changes depending on the chosen filter. Link: http://www.debenhams.com/furniture/beds/kaydian Our link: https://kontenta.co.uk/brands/kaydian-beds.html This is very frustrating as quality content should beat pages with little or no content, we will spend some time and get some inbound links to the page to try boost the rankings but does anyone have any similar experiences with anything like this?
Competitive Research | | Jseddon921 -
Whats the Best Tool for Bulk SERPS Checking?
To get round the 'not provided' issue I've just generated a list of the 5000+ keywords that people have used to visit my site over the last 7+ years. I want to do a one off SERPS check for all these keywords and export into excel where I'll tie it up with lots of analytics and moz based data I was going to use the queries report on analytics but my data only goes back as far as May this year, so I'll use that if I have to (and probably will anyway for the impressions and CTR) but I'd prefer to cover everything so I can see any opportunities in my very long tail
Competitive Research | | Zippy-Bungle0 -
Site Ranking for keywords that they haven't targeted in content
There is a site that I am constantly battling for the #1 spot for a particular keyword and I can't see that they are doing any link building, they are not using any anchor text for the keyword "at all" just their company name (not exact match) and their content doesn't even contain the keyword. I used Open site explorer to analyze their activity, but they are doing something I can't figure out from that data. Any other tools to use? I have higher quality links than them, post content nearly 5 times per week to my blog and their blog hasn't been updated in ages, I kill them in social media, there isn't one instance that they are better than my site and I only build quality driven links, no blog comment crap and get featured on lots of industry blogs for our work. I distribute my content very effectively, I just can't figure it out. They were no where about 5 months ago now they are tearing it up for lots of keywords in the industry top spots. I can build a few links and surpass them, but I have to do it every week or so and I think they are doing something fishy. I just want to figure out what they are doing and bury them. I don't want to post their url and mine here as I don't want them to see this post in search results.
Competitive Research | | photoseo10 -
Content: How to top the top article in my niche
Hello, The top article in my niche is http://www.webmd.com/diet/default.htm I want to write a better article on the same topic, but on a low budget. The article would be a combination of tools, top 10 diet reviews, and what I already have here. I'm a pretty good authority on the topics.
Competitive Research | | BobGW1 -
How Do You Create an excel spreadsheet of all blog post content on a site?
Is there a quick and easy way to create an excel spreadsheet with a list of all blog content and it's SEO factors i.e. URL, title, description, etc.? I know I could use screaming from to get the entire site but is there a way to just get the content on the blog?
Competitive Research | | RonMedlin0 -
Duplicate content for www & non-www results
why would my campaign show duplicate content entries for www & non-www versions of my url? Here's an example I have a page called 'mydomain.com/resources/', and the campaign analysis shows it as being duplicate content, with the duplicate being 'www.mydomain.com/resources'. I don't know where I can adjust this or if it is perhaps related to some other setting, like Google Analytics or something else. /G
Competitive Research | | swdmedia0 -
How to check the Pr and Da of a thousand sites
Hi, I have a list of a thousand sites. I want to qualify the sites based on Pr and Da so that I can do my outreach. Pls advice how this can be done without manually putting each site in ose. Cheers, Vishal
Competitive Research | | vishalkhialani0 -
Competitor with over 100,000 links to duplicate copy ranks well, anyone know why?
One of our competitors has over 100,000 links on Yahoo site explorer, most of them are from pages on their site with duplicate content and they don't seem to have a focus on gaining backlinks, but the site still ranks really well. Can anyone think of a reason why this duplicate content hasn't been penalised by Google?
Competitive Research | | RobertHill0