What can I do if Google Webmaster Tools doesn't recognize the robots.txt file?

DotCar

I'm working on a recently hacked site for a client and and in trying to identify how exactly the hack is running I need to use the fetch as Google bot feature in GWT.

I'd love to use this but it thinks the robots.txt is blocking it's acces but the only thing in the robots.txt file is a link to the sitemap.

Unde the Blocked URLs section of the GWT it shows that the robots.txt was last downloaded yesterday but it's incorrect information. Is there a way to force Google to look again?

wrttnwrd

No, but they might write to it, modify it, or do all sorts of other nasty stuff I've seen hackers do when they get a hold of any writeable file on a system.

cbielich

lol it's a robots text file. what are they going to do. Steal it? I should have clarified do a 777 to make sure that is not your problem, then yes change the permission to be tighter

wrttnwrd

Eesh I don't recommend 777. 644 or, if you're going to change it right back, 755 at most.

cbielich

File permission maybe? Change it to 777 and try it again

loopyal

If you have shell access on Linux you can use wget or GET or run lynx.

If google is getting the wrong robots file then your web server must be sending out something other than what you think is the robots file.

What happens if you do this in your browser:

http://yourdomain.com/robots.txt

wrttnwrd

Looking in my log files, Google hits robots.txt just about every time it crawls our site.

What are you trying to accomplish using fetch as Googlebot? Any chance CURL could do the job for you, or another tool that ignores robots.txt?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

What can I do if Google Webmaster Tools doesn't recognize the robots.txt file?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Can I Block https URLs using Host directive in robots.txt?

Robots file set up

Dealing with 410 Errors in Google Webmaster Tools

How to Remove a website from your Bing Webmaster Tools account

I am cleaning up a clients link profile and am coming across a lot of directories (no surprise) My question is if an obvious fre for all generic directory doesn't look to have been hit by any updates is it a wise move recommending tit for removal?

Does Bing ignore robots txt files?

Why isn't Google pushing my Schema data to the search results page

Does Google pass link juice a page receives if the URL parameter specifies content and has the Crawl setting in Webmaster Tools set to NO?