Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Moz Q&A is closed.

After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

Robots.txt: Can you put a /* wildcard in the middle of a URL?

Intermediate & Advanced SEO

1028

IHSwebsite last edited by

We have noticed that Google is indexing the language/country directory versions of directories we have disallowed in our robots.txt.

For example:

Disallow: /images/ is blocked just fine

However, once you add our /en/uk/ directory in front of it, there are dozens of pages indexed.

The question is: Can I put a wildcard in the middle of the string, ex. /en/*/images/, or do I need to list out every single country for every language in the robots file. Anyone know of any workarounds?
1 Reply Last reply
Reply Quote 0
irvingw last edited by

Yes, wildcards work, thank god.
1 Reply Last reply
Reply Quote 1

Got a burning SEO question?

Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.

Start my free trial

Browse Questions

View

From

Sorted by

With category

Explore more categories

Related Questions

Block session id URLs with robots.txt

Hi, I would like to block all URLs with the parameter '?filter=' from being crawled by including them in the robots.txt. Which directive should I use: User-agent: *
Disallow: ?filter= or User-agent: *
Disallow: /?filter= In other words, is the forward slash in the beginning of the disallow directive necessary? Thanks!
Intermediate & Advanced SEO | | Mat_C

1
What can we do to optimize / be mobile-friendly for PDFs?

I'm getting a "Your page is not mobile-friendly." notice in the SERPs for all of our PDFs. I check the pdf on the phone and it appears just fine. rFtLq
Intermediate & Advanced SEO | | johnnybgunn

0
Redirect wordpress from /%post_id%/%postname%/ to /blog/%postname%/

Hi what is the code to redirect wordpress blog from site.com/%post_id%/%postname%/ to site.com/blog/%postname%/ We are moving the site to a new server and new url structure. Thanks in advance
Intermediate & Advanced SEO | | Taiger

0
What do you add to your robots.txt on your ecommerce sites?

We're looking at expanding our robots.txt, we currently don't have the ability to noindex/nofollow. We're thinking about adding the following: Checkout Basket Then possibly: Price Theme Sortby other misc filters. What do you include?
Intermediate & Advanced SEO | | ThomasHarvey

0
Why is /home used in this company's home URL?

Just working with a company that has chosen a home URL with /home latched on - very strange indeed - has anybody else comes across this kind of homepage URL "decision" in the past? I can't see why on earth anybody would do this! Perhaps simply a logic-defying decision?
Intermediate & Advanced SEO | | McTaggart

0
Canonical URL & sitemap URL mismatch

Hi We're running a Magento store which doesn't have too much stock rotation. We've implemented a plugin that will allow us to give products custom canonical URLs (basically including the category slug, which is not possible through vanilla Magento). The sitemap feature doesn't pick up on these URLs, so we're submitting URLs to Google that are available and will serve content, but actually point to a longer URL via a canonical meta tag. The content is available at each URL and is near identical (all apart from the breadcrumbs) All instances of the page point to the same canonical URL We are using the longer URL in our internal architecture/link building to show this preference My questions are; Will this harm our visibility? Aside from editing the sitemap, are there any other signals we could give Google? Thanks
Intermediate & Advanced SEO | | tomcraig86

0
Product or Shop in URL

What do you think is better for seo and for sale, I am using woo-ecommerce for health products website. websitename.com/product/keyword OR websitename.com/shop/keyword
Intermediate & Advanced SEO | | MasonBaker

0
How to Disallow Tag Pages With Robot.txt

Hi i have a site which i'm dealing with that has tag pages for instant - http://www.domain.com/news/?tag=choice How can i exclude these tag pages (about 20+ being crawled and indexed by the search engines with robot.txt Also sometimes they're created dynamically so i want something which automatically excludes tage pages from being crawled and indexed. Any suggestions? Cheers, Mark
Intermediate & Advanced SEO | | monster99

0

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Robots.txt: Can you put a /* wildcard in the middle of a URL?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Block session id URLs with robots.txt

What can we do to optimize / be mobile-friendly for PDFs?

Redirect wordpress from /%post_id%/%postname%/ to /blog/%postname%/

What do you add to your robots.txt on your ecommerce sites?

Why is /home used in this company's home URL?

Canonical URL & sitemap URL mismatch

Product or Shop in URL

How to Disallow Tag Pages With Robot.txt

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved