Drupal infinite URL depth? SEOMOZ treating as duplicate content

dresults

I'm monitoring a subdirectory of my site on SEOMOZ but with catastrophic results. It's finding infinite duplicate content e.g.www.example.co.uk/product/samples/product/product/productand so on...

The website is running on Drupal. Do you have any ideas on how I can solve this?

ann.stringer

I'm having this same issue with a new drupal site. Does anyone know the underlying cause and how to fix it.

Would any relative path cause this?

Thanks.

Atlanta-SMO

Can you list the modules you're running? What e-Commerce module are you running?

Dr-Pete

I'm not a Drupal expert, but it sounds like you may have some kind of relative path that's getting perpetuated. Robots.txt could help as a patch, but I'd definitely want to solve the crawl problem, as this could spin out into other problems.

Have you tried a desktop crawler, like Xenu or Screaming Frog? Sorry, it's tough to diagnose without seeing the actual site, but it's almost got to be a relative path that's causing "/product" to keep being added to links.

OrionGroup

Yes, anything deeper would also be blocked.

dresults

Thanks Scott, this is really helpful.

Out of interest, would disallowing '/product/samples/product' automatically stop the bots from indexing all the pages underneath this, too such as '/product/samples/product/product/product/'?

OrionGroup

Try adding something like this to your robots.txt file:

User-agent: rogerbot
Disallow: /product/samples/product/
Disallow: /product/samples2/product1/
Disallow: /product/samples3/product4/

etc...

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Drupal infinite URL depth? SEOMOZ treating as duplicate content

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Canonical Tags for Legacy Duplicate Content

Query Strings causing Duplicate Content

Duplicate Content?

Duplicate Page Content Report

404's and duplicate content.

If two websites pull the same content from the same source in a CMS, does it count as duplicate content?

Duplicate content across multiple domains

Solution for duplicate content not working