Best way to block a sub-domain from being indexed

JohnW-UK

Hello,

The search engines have indexed a sub-domain I did not want indexed its on

old.domain.com and dev.domain.com - I was going to password them but is there a best practice way to block them.

My main domain default robots.txt says :-

Sitemap: http://www.domain.com/sitemap.xml

global

User-agent: *
Disallow: /cgi-bin/
Disallow: /wp-admin/
Disallow: /wp-includes/
Disallow: /wp-content/plugins/
Disallow: /wp-content/cache/
Disallow: /wp-content/themes/
Disallow: /trackback/
Disallow: /feed/
Disallow: /comments/
Disallow: /category//
Disallow: */trackback/
Disallow: */feed/
Disallow: /comments/
Disallow: /?

KristinaKledzik

Hi,

CleverPhD has some interesting ideas with robots.txt and Google Webmaster Tools, but simply password protecting all dev pages should keep pages out of Google's index. There's no best practice here, since a password wall will keep Googlebot out on its own.

To be doubly safe, you can also include a meta noindex tag on dev pages.

Keep in mind that once a page is in Google's index, it's going to take awhile for it to leave (unless you use CleverPhD's method). But, having a blank page in Google's index really isn't all that bad. It's there, but it won't rank for much.

Hope this helps,

Kristina

KristinaKledzik

I've never tried a method like this - FreshFireOne, did you?

CleverPhD

First and foremost when you finish all this - password protect your dev instances. A url will leak out eventually and then this happens. I know it is a PIA, but it is worth it.

To remove subdomains. Go into GWT and register the subdomains as separate websites in GWT. Create a robots.txt for each subdomain (not the one you mention, you need a robots that is specific to that subdomain that disallows all files. If you cant do that, have your subdomains include a noindex meta tag on all pages. You have to be careful with this as you do not want to push out your dev. robots.txt or the noindex meta tags to your production server, but it can be done. Talk to your devs. Then go into GWT and use the URL removal tool. Just leave it blank and it will remove the whole site.

Poof. Gone. You can then watch the GWT accounts. They will show errors for the dev site like "Severe health issues are found on your site - Some important page has been removed by request." This is a good error as it confirms that that subdomain is removed.

We actually used this not on a dev site but on our www1 server that was indexed. We use a load balancer with multiple copies of the site. www1 was completing with www. Using this above did the trick.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Best way to block a sub-domain from being indexed

global

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

My url disappeared from Google but Search Console shows indexed. This url has been indexed for more than a year. Please help!

What are best page titles for sub-folders or sub-directories? Same as website?

Moving html site to wordpress and 301 redirect from index.htm to index.php or just www.example.com

How long to re-index a page after being blocked

Does having a different sub domain for your Landing Page and Blog affect your overall SEO benefits and Ranking?

Best way to remove full demo (staging server) website from Google index

Redirect ruined domain to new domain without passing link juice

How do you de-index and prevent indexation of a whole domain?

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved