Robots.txt question

seoug_2005

Hello,

What does the following command mean -

User-agent: *
Allow: /

Does it mean that we are blocking all spiders ? Is Allow supported in robots.txt ?

Thanks

KeriMorgret

It's a good idea to have an xml site map and make sure the search engines know where it is. It's part of the protocol that they will look in the robots.txt file for the location for your sitemap.

seoug_2005

I was assuming that by including / after allow, we are blocking the spiders and also thought that allow is not supported by search engines.

Thanks for clarifications. A better approach would be

User-Agent: *
Allow:

right ?

The best one of course is

**User-agent: *
Disallow:**

john4math

That's not really necessary unless there URLs or directories you're disallowing after the allow in your robots.txt. Allow is a directive supported by major search engines, but search engines assume they're allowed to crawl everything they find unless you disallow it specifically in your robots.txt.

The following is universally accepted by bots and essentially means the same thing as what I think you're trying to say, allowing bots to crawl everything:

User-agent: *
Disallow:

There's a sample use of the Allow directive on the wikipedia robots.txt page here.

KeriMorgret

There's more information about robots.txt from SEOmoz at http://www.seomoz.org/learn-seo/robotstxt

SEOmoz and the robots.txt site suggest the following for allowing robots to see everying and list your sitemap:

User-agent: *
Disallow:

Sitemap: http://www.example.com/none-standard-location/sitemap.xml

seoug_2005

Any particular reason for doing so ?

JamesNorquay

That robots txt should be fine.

But you should also add your XML sitemap to the robots.txt file, example:

User-Agent: *
Allow: /

Sitemap: http://www.website.com/sitemap.xml

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt question

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

I have two robots.txt pages for www and non-www version. Will that be a problem?

"Url blocked by robots.txt." on my Video Sitemap

2 sitemaps on my robots.txt?

No descripton on Google/Yahoo/Bing, updated robots.txt - what is the turnaround time or next step for visible results?

Duplicate content question...

Wordpress Robots.txt Sitemap submission?

Robots.txt question

301 Redirect "wildcard" question