Seomoz crawling filtered pages

nvs.nim

Hi,

I just checked an seo campaign we started last week, so I opened seomoz to see the crawl diagnostics.

Lot's of duplicate content & duplicate titles showing up, but that's because Rogerbot is crawling all of the filtered pages as well. How do I exclude these pages from being crawled?

/product/brand-x/3969?order=brand&sortorder=ASC
/product/brand-x/3969?order=popular&sortorder=ASC
/product/brand-x/3969?order=popular&sortorder=DESC&page=10
/product/brand-x/3969?order=popular&sortorder=DESC&page=11

nvs.nim

So if the site has a structure like this:

www.xyz.com/overview

and the filter on this page has several options like /?order= , ?brand=, .... I have to rel-canonical them al to www.xyz.com/overview

My-Favourite-Holiday-Cottages

I'd rel-canonical if you can, as theres still nothing stopping links to them being indexed. It might stop Rodger/Google from crawling them, but the potential indexation issues won't go away. Otherwise perhaps no-index them.

I'd usualy go as far to do re-prev and rel-next for the paginated searches as well.

nvs.nim

Rel-canonical will be an immense job, would it be ok using robots.txt like this

disallow: /?order=

disallow: /?sortorder=

...

?

My-Favourite-Holiday-Cottages

Last I checked, Rodger obays robot.txt and meta-robot commands as he trys to simulate what google would crawl. If he can crawl those pages, google probably is as well.

I think if you rel-canonical them properly, or no-idex them properly e.t.c. It should show as a normal page.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Seomoz crawling filtered pages

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Pages with Temporary Redirects on pages that don't exist!

On-page grader question

Need to find all pages that link to list of pages/pdf's

I have another Duplicate page content Question to ask.Why does my blog tags come up as duplicates when my page gets crawled,how do I fix it?

I'm getting "Issue: Title Element Too Long" when the title of the overall website + page title are being combined, shouldn't this solely depend on the page title itself?

Why Is SEOMOZ No Longer crawling All Of My Site

Only 1 page is being crawled by SEOmoz for the last 2 crawls

Help with SEOmoz API