How do I identify what is causing my Duplicate Page Content problem?

BGroup

Hello,

I'm trying to put my finger on what exactly is causing my duplicate page content problem... For example, SEOMoz is picking up these four pages as having the same content:

http://www.penncare.net/ambulancedivision/braunambulances/express.aspx

http://www.penncare.net/ambulancedivision/recentdeliveries/millcreekparamedicservice.aspx

http://www.penncare.net/ambulancedivision/recentdeliveries/monongaliaems.aspx

http://www.penncare.net/softwaredivision/emschartssoftware/emschartsvideos.aspx

As you can tell, they really aren't serving the same content in the body of the page. Anybody have an idea what might be causing these pages to show up as Duplicate Page Content? At first I thought it was the photo gallery module that might be causing it, but that only exists on two of the pages...

Thanks in advance!

Andy.Drinkwater

Ah right - OK then.

With regards to data coming back from SEOmoz's crawler, I might be tempted to ask them what it is seeing. I should really have a look at this myself because I haven't yet.

BGroup

I'm currently getting that information from Moz's own web crawler wherein it tells me the pages of that have Duplicate Page Content and the other URLs that that duplicate content exists on.

With regard to the 301's - I have rewrite rules setup to 1.) set all requests to lowercase 2.) trim off home.aspx 3.) append www. to the beginning of the request, etc. When processed these should function as a single redirect / rewrite.

Andy.Drinkwater

Before looking at the duplicate content (what did you use to find there is duplicate content?)... a quick question - you have a lot of 301's. Just want to check, are these just a single redirect or a redirect of a redirect etc?

webfeatseo

I would add some content to these pages to help differentiate. None of them are text heavy so it may be hard for spiders to see a difference. Add a summary, maybe a text translation of what is in the vids, etc

BGroup

Thanks for your reply... I guess more specifically I was wondering what it was about these particular page elements that makes search engines incapable of deciphering them from one another.

smstv

Search engines don't know which version(s) to include/exclude from their indices
Search engines don't know whether to direct the link metrics (trust, authority, anchor text, link juice, etc.) to one page, or keep it separated between multiple versions
Search engines don't know which version(s) to rank for query results

When duplicate content is present, site owners suffer rankings and traffic losses and search engines provide less relevant results.

Hope this helps!

Resources, http://www.seomoz.org/learn-seo/duplicate-content

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.