Robots.txt advice

eLab_London

Hey Guys,

Have you ever seen coding like this in a robots.txt, I have never seen a noindex rule in a robots.txt file before - have you?

user-agent: AhrefsBot

User-agent: trovitBot
User-agent: Nutch
User-agent: Baiduspider
Disallow: /

User-agent: *
Disallow: /WebServices/
Disallow: /*?notfound=
Disallow: /?list=
Noindex: /?*list=
Noindex: /local/
Disallow: /local/
Noindex: /handle/
Disallow: /handle/
Noindex: /Handle/
Disallow: /Handle/
Noindex: /localsites/
Disallow: /localsites/
Noindex: /search/
Disallow: /search/
Noindex: /Search/
Disallow: /Search/
Disallow: ?

I have never seen a noindex rule in a robots.txt file before - have you?
Any pointers?

Martijn_Scheijbeler

Never seen this, doubt it's any useful as this isn't part of any search engines recommended statements to use. I don't think this would have any impact on what search engine robots would look at as it's not a statement in the robots.txt documentation.

Tylerj

Best I could find was-

Unlike disallowed pages, noindexed pages don’t end up in the index and therefore won’t show in search results. Combine both in robots.txt to optimise your crawl efficiency: the noindex will stop the page showing in search results, and the disallow will stop it being crawled

From-https://www.deepcrawl.com/blog/best-practice/robots-txt-noindex-the-best-kept-secret-in-seo/

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt advice

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Keyword rank drop, any advice?

Robots.txt was set to disallow for 14 days

Scary bug in search console: All our pages reported as being blocked by robots.txt after https migration

Set Robots.txt file to crawl my website at specific times

Our parent company has included their sitemap links in our robots.txt file - will that have an impact on the way our site is crawled?

Your advice regarding thin content would be really appreciated

Blocking out specific URLs with robots.txt

1200 pages no followed and blocked by robots on my site. Is that normal?