Best Way to check for duplicate pages
-
With Google's updates we know they want to clean out duplicate content. i have been seeing the same crap spit out even word for word on different sites. Anyway how do you experienced SEO people test for dups on your own site as well as other sites. The only thing i can come up with is paying copyscape 5 cts a test. There has to be other ways. Advise/
-
Well there's the rub. Close key words...
avoid an audit
avoid a tax audit
avoid audit
all get hits from the rankings data. they bring up different pages as top ranking because different sites and pages rank differently based on the key word optimization.
Now I have the experience to know that in the past a good well developed linking structure will overcome this and you actually can rank for several variations by having a good topical silo structure. I am sometimes surprised what my main page ranks for just because a term is on the page! Content dominating the page does get hits.
BUT...competition is getting crazy in my industry. Sales companies are ranking and then selling the leads.I'm not just competing with similar businesses anymore. It seems like as long as I can vary the content, and i really can, because I have over 30 years in the business and more experience than anyone i know , so i can write varied content all day long!
As for the page, we don't use flash or anything. Had over 100 links on many pages so some changes had to be made.
Tell me more about an easier path to trend instead of doing pages for slight variations. Long term phrases don't seem to get many hits in my business. At least I don't think it's worth doing pages for till we knock off the shorter ones we have a chance of ranking for, which includes the three listed above.
-
Thanks, so then 5ct a test will add up but then it will be worth it. That's the feedback i was looking for, just wanted to make sure i wasn't missing another cheaper service that was as good.
-
Hey Joe
I think you have likely answered your own question to some extent - there are lots of manual ways to check for duplicate content by searching for it. You could even automate this against new sites by setting up Google Alerts to check for various random search strings in double quotes from all the pages across your site but why do this when there are professional solutions like copyscape?
Sure, there is a charge involved with copyscape and such tools but it is a fairly low. Alteratively, you will have to spend more time (money) coming up with a manual solution that will likely be less comprehensive.
A client of mine recently got hit quite badly by duplicate problems, some internal repetition (chunks on multiple pages) & loads of other sites had taken chunks of text from multiple pages, some entire pages had been copied with small edits. The offshoot of this was nearly 3 months without rankings whilst we identified and resolved the issue and being an internet company this was nearly three months without any new orders coming in.
If your have concerns about being caught up in a duplicate filter & this could have a series impact on your business then copyscape is a solid investment.
Hope it helps
Marcus
-
Does your situation benefit from having higher content pages that rank for your target keyword plus variables and long term additions? If so that might be an easier path to tread than something like page 1: kitten, page 2: kittens, page 3: young kittens...
If you're really looking into cranking out the content you're more along the lines of a eHow or other Demand Media property and then yes, you'd need a program to prevent duplication of both content created and keywords targeted. Still, even they get variable content by avoiding overly similar topics.
Finally, if header, footer, and side bar info is complicating things too drastically you could take steps on some pages to keep it unindexed via flash, image map, or by minimizing where applicable.
-
Thanks, I am aware of those things. I still don't want to worry about duplicate content. i have many pages that have similar key words that i can rank for, and I have to write them. I also have a lot of header and side bar info and need my content long enough to overcome that. Writing many pages that focus on similar key words can cause me to inadvertantly write similar info that i need to watch out for. that's why i need a copyscape type program.
-
You can search for long quote, supposedly unique phrases in Google. If you find multiple pages from other domains you'll have located duplicate content. Duplicate content isn't a death knell though as Google understands that there are many sites out there which scrape and farm content. With enough authority you should outrank those sorts of sites and also be able to highlight your original content status in Google Webmaster tools.
If you have copyright status to your content you can always pursue DMCA removal requests.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate links from forum what to do?
After a crawl it found over 5k errors and over 5k warnings. Those are: Duplicate page content; Duplicate page title; Overly-Dynamic URLs; Missing Meta descr; Title Element too long. All those come from domain.com/forum/ I don't need SEO on forum so what should I do? What could be an easy solution to this? No index? No follow? Please help
On-Page Optimization | | OVJ0 -
Page rank check
Hello everyone, How long should I wait to see if page rank for optimized pages have improved? cheers
On-Page Optimization | | PremioOscar0 -
Duplicate Content - Deleting Pages
The Penguin update in April 2012 caused my website to lose about 70% of its traffic overnight and as a consequence, the same in volume of sales. Almost a year later I am stil trying to figure out what the problem is with my site. As with many ecommerce sites a large number of the product pages are quite similar. My first crawl with SEOMOZ identified a large number of pages that are very similar - the majority of these are in a category that doesn't sell well anyway and so to help with the problem I am thinking of removing one of my categories (about 1000 products). My question is - would removing all these links boost the overall SEO of the site since I am removing a large chunk of near-duplicate links? Also - if I do remove all these links would I have to put in place a 301 redirect for every single page and if so, what's the quickest way of doing this. My site is www.modern-canvas-art.com Robin
On-Page Optimization | | robbowebbo0 -
How much SEO value does a fashion site get from bolting text onto the bottom of home page? Does the value compensate for cluttering up a page focused on an iconic image?
Getting ready to launch a completely redesigned site for a fashion designer. Since it is a fashion site, visitors do not need text to describe what the site is about., We are weighing three options: 1) clean design with no text (just images and navigational links), 2) bolting on a couple of sentences of text at the bottom of the page to signal keyword terms to the search engines, 3) following the lead of the top ranking site in the category and adding lots of text to the bottom of the page. Do the SEO benefits justify cluttering up the design by bolting text onto the bottom of the home page, and if so, how many characters of text seem to be the minimum to be effective?
On-Page Optimization | | RandyP0 -
Best practice for introducing new landing page to my site?
I have a client, and want to know the best way to add new, keyword specific landing pages to their site and link to it in a logical way that isn't spammy. Example: My homepage targets “Adelaide Cars” I also want to target “Melbourne Cars” which I would do via a targeted landing page. How then would I logically link to this landing page? As Google gets better at spotting un-natural content, I’d like to know how to introduce this new page to get the best traction. If I was to just create the page, it would not make sense to have it in the main navigation. Same goes from various industry type terms. Eg. pest control and exterminator. How do you target both and still have a logical sitemap and page structure that Google will like and make sense to users.
On-Page Optimization | | letgo3450 -
Duplicate Page Content Issues
How can I fix Duplicate Page Content Issues on my site : www.ifocalmedia.com. This is a WP site and the diagnostics shows I have 115 errors? I know this is damaging to my SEO campaign how do I clear these? Any help is very welcome.
On-Page Optimization | | shami0 -
How to fix duplicate page content and page titles?
Apologies in advance if this has already been answered (it probably has) - I'm just not seeing it. Is there a guide on here for how to fix the issues brought up by the crawler - specifically, things like duplicate page content, or duplicate page titles? A lot of these seem to have been created by wordpress.org combos that I didn't anticipate - i.e., category pages, author pages, etc. The crawler brings up the problems, but I don' t know where to start to go about fixing them. Also, any guide on best SEO practices or fixing optimization problems, specifically for wordpress.org blogs, would be greatly appreciated. Thanks!
On-Page Optimization | | prospects1 -
Why Does SEOMOZ Crawl show that i have 5,769 pages with Duplicate Content
Hello... I'm trying to do some analysis on my site (http://goo.gl/JgK1e) and SEOMOZ Crawl Diagnostics is telling me that I have 5,769 pages with duplicate content. Can someone, anyone, please help me understand: how does SEOMOZ determine if i have duplicate content Is it correct ? Are there really that many pages of duplicate content How do i fix this, if true <---- ** Most important ** Thanks in advance for any help!!
On-Page Optimization | | Prime850