Is it hurting my seo ranking if robots.txt is forbidden?

Assembla

robots.txt is forbidden - I have read up on what the robots.txt file does and how to configure it but what about if it is not able to be accessed at all?

Sebes

Yes, excluding certain pages can be a benefit to your rankings: if the excluded pages could be considered duplicate content with your marketing pages or with it each other.

This is usually the case for blogs (think wordpress categories) or webshops (pagination, as well as single product pages reachable by different paths (and thus having different urls). As Ryan pointed out: controll that on the page level via noindex,follow to allow PR to flow. Use noindex,nofollow for "internal" pages you dont want to see crawled.

I am not sure, but having 9950 pages indexed, but considered duplicate content might hurt rankings for other pages on that domain. Google might consider the Domain spammy.

If you need a specific hint for your domain, send me a PM and I have a look if time permits.

RyanKent

In general, I do not use robots.txt. It is a better practice to use "noindex" for the pages you do not wish to have indexed.

If I had a 10k page site with 50 marketing pages, I would either want to index the entire site, or question why the other 99% of the site exists if it does not help market the products. There are numerous challenges your scenario prevents. If you block 99% of your site with robots.txt or the noindex meta tag, you are severely disrupting the flow of PR throughout your site. Also you are either blocking content which should be indexed, or you are wasting time and resources creating junk pages on your site.

If the content truly should not be indexed, it likely should be moved to another site. I would need a lot more details about the site, it's purpose and the pages involved. Whatever the proper solution, it is not likely going to be using robots.txt to block 99% of the site.

Assembla

So in regards to increasing ranking, is there a benefit of using the robots.txt file to only index certain "marketing" page and exclude other content that may dilute your site. For example, lets say I have 10,000 pages but only about 50 or so are my marketing page. Would using robots.txt to only crawl my main marketing pages help place emphasis on that content?

RyanKent

Sebes is correct. To add a bit more, it is not necessary to provide a robots.txt file. Actually, it is preferable in most cases not to use the file but it is necessary if you do not have direct control over the code used in every page of your site. For example, if you have a CMS or Ecommerce based site you may not have likely do not have control over many pages on your site which are automatically generated through the software. In these cases the only way you can control how crawlers will treat your site's pages is either to pay for custom modifications to your site's code or to use a robots.txt file.

Sebes

If the robots.txt can not be read by google or bing they assume that they can crawl as much as they want to. Check out the google webmaster tool to see whether google can "see" and access your robots.txt.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Is it hurting my seo ranking if robots.txt is forbidden?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

About porn sites and ranking

Number or percentage of new visitors impact Google rankings?

Should I document rankings from an Incognito Window?

Delay between being indexed and ranking for new pages.

Looking for Search Engine Ranking Factors more recent than 2013

Half my ranks have gone missing

How do I rank multiple pages for my busness/domain name?

How to write a good resourceful SEO enabled article