I want to block search bots in crawling all my website's pages expect for homepage. Is this rule correct?

esiow2013

User-agent: *

Disallow: /*

GPainter

some great answers you can also find a list of all the robots here & here

Depending on your site you can also for example hide the rest of your site behind a login screen or a form which bots won't fill in.

esiow2013

Thanks Matt! I will surely test this one.

esiow2013

Thanks David! Will try this one.

Kingof5

Use this:

User-agent: Googlebot
Noindex: /

User-agent: Googlebot
Disallow: /

User-agent: *
Disallow: /

This is what I use to block our dev sites from being indexed and we've had no issues.

MattAntonino

Actually, there are two regex that Robots can handle - asterisk and $.

You should test this one. I think it will work (about 95% sure - tested in WMT quickly):

User-agent: *
Disallow: /
Allow: /$

Travis_Bailey

I don't think that will work. Robots.txt doesn't handle regular expressions. You will have to explicitly list all of the folders, and files to be super sure, that nothing is indexed unless you want it to be found.

This is kind of an odd question. I haven't thought about something like this in a while. I usually want everything but a couple folders indexed. : ) I found something that may be a little more help. Try reading this.

If you're working with extensions, you can use **Disallow:/*.html$ **or php or what have you. That may get you closer to a solution.

Definitely test this with a crawler that obeys robots.txt.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

I want to block search bots in crawling all my website's pages expect for homepage. Is this rule correct?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Robots.txt - Do I block Bots from crawling the non-www version if I use www.site.com ?

Title Tag Verses H1 Tag. Is having both the same better than different if there's only one clear winner in keyword search volume

Parallax site with snippets of internal pages on the homepage

Will adding 1000's of outbound links to just a few website impact rankings?

How do I prevent 404's from hurting my site?

To index or de-index internal search results pages?

What's next?

Most Painless way of getting Duff Pages out of SE's Index