Seomoz crawling filtered pages

nvs.nim

Hi,

I just checked an seo campaign we started last week, so I opened seomoz to see the crawl diagnostics.

Lot's of duplicate content & duplicate titles showing up, but that's because Rogerbot is crawling all of the filtered pages as well. How do I exclude these pages from being crawled?

/product/brand-x/3969?order=brand&sortorder=ASC
/product/brand-x/3969?order=popular&sortorder=ASC
/product/brand-x/3969?order=popular&sortorder=DESC&page=10
/product/brand-x/3969?order=popular&sortorder=DESC&page=11

nvs.nim

So if the site has a structure like this:

www.xyz.com/overview

and the filter on this page has several options like /?order= , ?brand=, .... I have to rel-canonical them al to www.xyz.com/overview

My-Favourite-Holiday-Cottages

I'd rel-canonical if you can, as theres still nothing stopping links to them being indexed. It might stop Rodger/Google from crawling them, but the potential indexation issues won't go away. Otherwise perhaps no-index them.

I'd usualy go as far to do re-prev and rel-next for the paginated searches as well.

nvs.nim

Rel-canonical will be an immense job, would it be ok using robots.txt like this

disallow: /?order=

disallow: /?sortorder=

...

?

My-Favourite-Holiday-Cottages

Last I checked, Rodger obays robot.txt and meta-robot commands as he trys to simulate what google would crawl. If he can crawl those pages, google probably is as well.

I think if you rel-canonical them properly, or no-idex them properly e.t.c. It should show as a normal page.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Seomoz crawling filtered pages

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

What SeoMoz tool am I thinking of?

Only One page crawled..Need help

Duplicate content in SEOMOZ report

Why would the SEOMoz Page analysis pick up exact keywords used in page title and text?

SEOMoz Crawl Warnings, do they really hurt rankings?

SEOmoz crawl error questions

Campaign Not Crawling

Why is Roger crawling pages that are disallowed in my robots.txt file?