Panda Updates - robots.txt or noindex?

ianmcintosh

Hi,

I have a site that I believe has been impacted by the recent Panda updates. Assuming that Google has crawled and indexed several thousand pages that are essentially the same and the site has now passed the threshold to be picked out by the Panda update, what is the best way to proceed?

Is it enough to block the pages from being crawled in the future using robots.txt, or would I need to remove the pages from the index using the meta noindex tag? Of course if I block the URLs with robots.txt then Googlebot won't be able to access the page in order to see the noindex tag.

Anyone have and previous experiences of doing something similar?

Thanks very much.

dmccarthy

This is a good read. http://www.seomoz.org/blog/duplicate-content-in-a-post-panda-world I think you should be careful with robot.txt because blocking access to the bot will not cause them to remove the content from their index. They will simply include a message saying not quite sure what's on this page.. I would use noindex to clear out the index first before attempting robot.txt exclusion.

irvingw

Yes, both because if a page is linked to on another site google with spider that other site and follow your link without hitting the robots.txt and the page could get indexed if there is not a noindex on it.

JarnoNijzing

Indeed try both.

Irving +1

irvingw

both. block the lowest quality lowest traffic pages with nodindex and block the folder in robots.txt

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Panda Updates - robots.txt or noindex?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Sanity Check: NoIndexing a Boatload of URLs

Phantom 3 Update?

Is there an advantage to using rel=canonical rather than noindex on pages on my mobile site (m.company.com)?

Robots.txt - blocking JavaScript and CSS, best practice for Magento

Help with Robots.txt On a Shared Root

Error: Missing required field "updated"

Removing Dynamic "noindex" URL's from Index

Why is Google Reporting big increase in duplicate content after Canonicalization update?