What can I do if Google Webmaster Tools doesn't recognize the robots.txt file?

DotCar

I'm working on a recently hacked site for a client and and in trying to identify how exactly the hack is running I need to use the fetch as Google bot feature in GWT.

I'd love to use this but it thinks the robots.txt is blocking it's acces but the only thing in the robots.txt file is a link to the sitemap.

Unde the Blocked URLs section of the GWT it shows that the robots.txt was last downloaded yesterday but it's incorrect information. Is there a way to force Google to look again?

wrttnwrd

No, but they might write to it, modify it, or do all sorts of other nasty stuff I've seen hackers do when they get a hold of any writeable file on a system.

cbielich

lol it's a robots text file. what are they going to do. Steal it? I should have clarified do a 777 to make sure that is not your problem, then yes change the permission to be tighter

wrttnwrd

Eesh I don't recommend 777. 644 or, if you're going to change it right back, 755 at most.

cbielich

File permission maybe? Change it to 777 and try it again

loopyal

If you have shell access on Linux you can use wget or GET or run lynx.

If google is getting the wrong robots file then your web server must be sending out something other than what you think is the robots file.

What happens if you do this in your browser:

http://yourdomain.com/robots.txt

wrttnwrd

Looking in my log files, Google hits robots.txt just about every time it crawls our site.

What are you trying to accomplish using fetch as Googlebot? Any chance CURL could do the job for you, or another tool that ignores robots.txt?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

What can I do if Google Webmaster Tools doesn't recognize the robots.txt file?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Can I use a 301 redirect to pass 'back link' juice to a different domain?

Is there a limit to how many URLs you can put in a robots.txt file?

Google Crawling Issues! How Can I Get Google to Crawl My Website Regularly?

John Mueller says don't use Schema as its not working yet but I get markup conflicts using Google Mark-up

The use of robots.txt

'External nofollow' in a robots meta tag? (advertorial links)

Is having no robots.txt file the same as having one and allowing all agents?

I add microdata but why Google don't show it in SERP?