Why is my Crawl Report Showing Thousands of Pages that Do Not Exist?

JennaCMag

Hi,

I just downloaded a Crawl Summary Report for a client's website. I am seeing THOUSANDS of duplicate page content errors. The overwhelming majority of them look something like this:

ERROR: http://www.earlyinterventionsupport.com/resources/parentingtips/development/parentingtips/development/development/development/development/development/development/parentingtips/specialneeds/default.aspx

This page doesn't exist and results in a 404 page. Why are these pages showing up? How do I get rid of them? Are they endangering the health of my site as a whole?

Thank you,

Jenna

<colgroup><col width="1051"></colgroup>
| |

StreamlineMetrics

Hi Jenna,

It's not so much the fact you have 404 pages that is the problem for SEO, but rather the fact your site is creating a problem for the search engines to crawl the site correctly and efficiently since they are getting caught in an endless loop. This can be a problem because the crawlers may get caught in the endless loop and just give up on your site and leave, which means the search engines may not be able to access the rest of the pages on your site and may have a negative impact on your rankings as a whole. One of the most important parts of SEO is to make your website as "friendly" to the search engines as possible so if they caught in endless loops then that is definitely not ideal. Hope that helps!

Patrick

JennaCMag

Hi Streamline -

Thanks for your help thus far. Could you elaborate on some of the SEO challenges this presents? After a bit of research, I'm seeing people say that having hundreds or thousands of 404s are okay, if they are in fact non-existant pages. I'm not that well educated on this, so just looking for a bit of clarification.

I will look into the relative URL issue. I just recently took over the work on this site, and I'm still digging in to what the original web developer created.

Jenna

StreamlineMetrics

It looks like the crawler is being caught in an endless loop, most likely a result of using relative URLs somewhere on your site. Yes, this is a problem for the site as a whole so I highly recommend implementing absolute URLs throughout the entire site.

Edit - I just looked at your site and this is exactly what it is. The links in your navigation are relative, such as "<a <="" span="">href="</a>../development/default.aspx"" so just change it to absolute URLs such as http://www.yoursite.com/development/default.aspx and it should fix the problem.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Why is my Crawl Report Showing Thousands of Pages that Do Not Exist?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Does content revealed by a 'show more' button get crawled by Google?

Mobile Googlebot vs Desktop Googlebot - GWT reports - Crawl errors

After Server Migration - Crawling Gets slow and Dynamic Pages wherein Content changes are not getting Updated

Is it a problem to use a 301 redirect to a 404 error page, instead of serving directly a 404 page?

Date of page first indexed or age of a page?

Could you use a robots.txt file to disalow a duplicate content page from being crawled?

301 - should I redirect entire domain or page for page?

Does Google crawl the pages which are generated via the site's search box queries?

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved