How do I disallow crawl on a directory when it's a prefix to my site's URL?

Simon-Plan

I am trying to disallow our media repository (hosted elsewhere, but appears as a directory on our site) from being crawled by robots but it is not a subdirectory of the site, it's a prefix.

So I need to disallow: mediabank.mywebsite.org

Not: mysite.org/mediabank

What would I need to put in my robots.txt and/or the other host's robots.txt to make this happen?

Thanks!

tawnycase

Hey there! Tawny from Moz's Help Team here.

You'll want to add a robots.txt file for that subdomain, and then add a Disallow command to that robots.txt file. So, using your example, you'd want a file like mediabank.mywebsite.org/robots.txt that had a Disallow command for any robots you don't want crawling that subdomain.

For all user-agents, that would look something like this:

User-agent: *
Disallow: /

That would stop any user-agents from crawling any pages on that subdomain.

I hope this helps! If you've still got questions, feel free to send us a note at help@moz.com and we'll do our best to sort things out for you.

Alick300

Hi,

Please check this old thread on the same topic @ https://moz.com/community/q/block-an-entire-subdomain-with-robots-txt

Thanks

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

How do I disallow crawl on a directory when it's a prefix to my site's URL?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Is there a way to export all your crawl errors for multiple Moz campaigns at once?

Sorry, but that URL is inaccessible?

Crawl test csv has lost its formatting??

I'm getting a Crawl error 605 Page Banned by robots.txt, X-Robots-Tag HTTP Header, or Meta Robots Tag

Why'd Moz stop showing the list of users?

Moz ranking report shows massive loss in Bing SERP's. But it is not the case?

Moz crawl sees meta description but there are none

Moz Dupe content crawl anomaly