Is there a Tool to compare Duplicate content for non web Live content?
-
Is there a tool that can give me % of duplicate content when comparing two pieces of content that are not Live on the web? Like copyscape but for content that may not be indexed by copyscape or not live on the web?
Does Word or any other program allow you do do this?
-
I'm going through some of the older questions, and wondering if you found a solution to your problem, or if you're still looking for some advice. Thanks!
-
I've never seen a percentage similar type option in Word, but you can merge and compare two documents to see the differences. I don't think it'll work enough for your case, it's more helpful for two documents that are in the same order and spotting the differences between them (like a draft proposal and final proposal).
-
Hi Bozzie,
I use WinMerge (open source software) to compare individual files/folders containing text or code.
Also, a quick search for [find similar files] on google brought me numerous software that will let you find similar files on your hard drive.
Best regards,
Guillaume Voyer. -
I haven't tested this, but apparently Google Docs can compare and highlight the differences between two documents - perhaps this is close enough?
-
Can't you make your own private index in Copyscape and compare content against just that?
If you're comparing a lot of pages 1to1 though, I guess that would be tedious.
Compare and merge feature in Word? Not really going to work how I suspect you want though.
Yeah, private copyscape index if it's only a few pieces.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is your live site supposed to have rel canonical tags?
I recently started working for a company and got them to use Moz and I have found that our secure site and our live sites are creating "duplicate content" according to the Crawl Diagnostics feature. On our secure site we have rel canonical tags pointing to our live site. I'm not super familiar with rel canonical tags, but our developer says we're doing the right thing. Would love any insight you guys may have if this is actually duplicate content or not. Thanks so much!
Moz Pro | | Chase_Cleckner0 -
Competitor Ranks Highly Despite LOTS of Duplication
One of my Moz campaigns tracks my main competitor. I can't get over the fact that this competitor continues to rank well despite having thousands of duplication issues while I currently have less than 10. Our websites are similar both structurally and product-wise. The two images attached include OSE comparisons for the keyword "kraft envelopes". My competitor ranks #2 for the term, while I rank #9. I don't get why! And again, looking from a top-level perspective, MOZ shows that my competitor's website is full of duplication issues and other problems while my site is healthy as a horse. All responses are greatly appreciated. QmSlfET.jpg 9CDny60.jpg
Moz Pro | | jampaper0 -
Duplicate Content Issue because of root domain and index.html
SEOMoz crawl diagnostics is suggesting that my root domain and the rootdomain/index.html are duplicate content. What can be done to ensure that both are considered as a single age only?
Moz Pro | | h1seo1 -
Sorting Dupe Content Pages
Hi, I'm no excel pro, and I'm having a bit of a challenge interpreting the Crawl Diagnostics export .csv file. I'd like to see at a glance which of my pages (and I have many) are the worst offenders for dupe content – ie. which have the most "Other URLs" associated with them. Thanks, would appreciate any advice on how other people are using this data, and/or how 'Moz recommends to do it. 🙂
Moz Pro | | ntcma0 -
Duplicate Content Issue from using filters on a directory listing site
I have a directory listing site of harpists and have alot of issues coming up that say: Content that is identical (or nearly identical) to content on other pages of your site forces your pages to unnecessarily compete with each other for rankings. Because this is a directory listing site the content is quite generic.The main issue appears to be coming from the functionality of the page. It appears that the "spider" is picking up each different choice of filter as a new page? If you have a look at this link you will see what I mean. People searching the site can filter the results of the songs played by this harpist by changing the dropdowns etc... but for some reason the filter arguments are being picked up...? Do you have any good approaches to solving this issue? A similar issue comes from the video pages for each harpist. They are being flagged as identical content - as there are currently no videos on the page. | http://www.find-a-harpist.co.uk/user/39/videos | http://www.find-a-harpist.co.uk/user/37/videos | Do you have any suggestions? Many thanks for taking the time to read this and respond. | | | | | |
Moz Pro | | dseo241
| |0 -
How to Fix the Errors with Duplicate Title or Content?
The latest Crawl Diagnostic has found 160 Errors on my site.
Moz Pro | | hanmark
And my error is, that the same content or title is used on two different! pages:
on both my root domain (han-mark.com) and the www subdomain. What does it matter (with or without www)? How serious is that error? Do I need to fix all the errors (and hundreds of warnings too)? What's the best practice? Is there any Guide on how to do it
or Tools for doing it the fast way? Viggo Joergensen0 -
Broken Links and Duplicate Content Errors?
Hello everybody, I’m new to SEOmoz and I have a few quick questions regarding my error reports: In the past, I have used IIS as a tool to uncover broken links and it has revealed a large amount of varying types of "broken links" on our sites. For example, some of them were links on my site that went to external sites that were no longer available, others were missing images in my CSS and JS files. According to my campaign in SEOmoz, however, my site has zero broken links (4XX). Can anyone tell me why the IIS errors don’t show up in my SEOmoz report, and which of these two reports I should really be concerned about (for SEO purposes)? 2. Also in the "errors" section, I have many duplicate page titles and duplicate page content errors. Many of these "duplicate" content reports are actually showing the same page more than once. For example, the report says that "http://www.cylc.org/" has the same content as "http://www.cylc.org/index.cfm" and that, of course, is because they are the same page. What is the best practice for handling these duplicate errors--can anyone recommend an easy fix for this?
Moz Pro | | EnvisionEMI0 -
Which tools are better? SEOMoz Tools or Bruce Clay's Tools.
I've ALWAYS wanted to hear some discussion on this, please give me your honest opinion so I can make the correct decision.
Moz Pro | | fergseo2