When You Add a Robots.txt file to a website to block certain URLs, do they disappear from Google's index?
-
I have seen several websites recently that have have far too many webpages indexed by Google, because for each blog post they publish, Google might index the following:
- www.mywebsite.com/blog/title-of-post
- www.mywebsite.com/blog/tag/tag1
- www.mywebsite.com/blog/tag/tag2
- www.mywebsite.com/blog/category/categoryA
- etc
My question is: if you add a robots.txt file that tells Google NOT to index pages in the "tag" and "category" folder, does that mean that the previously indexed pages will eventually disappear from Google's index? Or does it just mean that newly created pages won't get added to the index? Or does it mean nothing at all? thanks for any insight!
-
Hi William
If the pages in question are
- already indexed by Google then if you block them via the robots.txt , they will show up in search result but the meta description will say something along the lines of
A description for this result is not available because of this site's robots.txt – learn more.
2) not indexed by Google for example on a new site , they don't follow it and the pages does not come up in search directly BUT if some external sites link to the pages then they can still come up in the SERP some time down the track.
Your best bet to keep the page out of the public SERP index is the meta robots tag : http://www.robotstxt.org/meta.html
-
William, If the pages in question are linked to from external resources the robots.txt file will not prevent the pages from appearing in the index. Per Moz's Robots.txt and Meta Robots best practices, "the robots.txt tells the engines not to crawl the given URL, but that they may keep the page in the index and display it in in results.
To prevent all robots from indexing a page on your site, place the following meta tag into the section of your page:
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Making Shopify URL's Simpler - Losing the words 'collection', 'product' and 'page' in a Shopify store URL. Any advice?
Hi Mozers! I have a Shopify store (of which there are many advantages) however one big SEO disadvantage, is that my URL structures contravene all Moz advice on dynamic URL structure and whats more I am reminded about this every week when I have a Moz site crawl and I have a batch of URL's that are longe than the 75 characters. A Shopify URL will run www.domain name.com/collections/collection-name/product/product-name. According to advice a it should be www.domain name.com/collection-name/product-name - Don't even get started on sub-collections! I sell portfolio books, album etc and keepsake memory boxes (so long keywords) AND, I have a long(ish) business name. So, For user experience and keyword length, do I just ignore trying to achieve a dynamic URL under 75 characters? When I have asked Shopify, the say their URL's are an integral part of the "Ruby on Rails" system, so nothing can be done Or can it ??? I can't be the only Moz member with this issue can I ??
On-Page Optimization | | nick_HandCo0 -
Changing url (permalink) structure of website??
Currently I'm working on SEO of one website www.mocomi.com. I want to change url (permalink) structure of entire website which has more than 5000 pages. Currently website have structure of http://mocomi.com/tenali-raman-the-kings-condition/ Which I want to change it to http://mocomi.com/fun/stories/tenali-raman/tenali-raman-the-kings-condition/ Likewise I want to change entire website permalink url structure to make site architecture more SEO friendly. Which means I'am going to add only categories & subcategories before actual link. Kindly guide with following questions which I need to move forward with this step. How much is it worth to change URL structure? Checklist or factors I need to consider while making this decision? Is it a good practice to change URL's of entire website at once or Should I change it in Parts? How much time it takes google to rank those urls again? Which are the best practices to do so?
On-Page Optimization | | Mocomi1 -
I want to improve our client's website structure, so he gets more traffic locally. What advice do you have ?
We want to "revamp" our client's website, by improving the overall looking (content, images, structure). Our client is a small retail business but wants to have more traffic. What advice can you give me ?
On-Page Optimization | | marketingmedia.ca0 -
Google Index/Cashe questions
I have 15k+ pages. I have 4.5k pages indexed. What relation is the google cashe to indexing pages? My site gets cashed every two days. The competition in my SERP goes 2-3weeks to get cashed. What does this indicate? Is your cashe date your last google crawl? How can I get google to crawl my site? Is there a way I can get google to crawl my site starting from an internal page. This way I could set up a better linking structure that would benefit from doing activities that get that page indexed to help get my site indexed more thoroughly...
On-Page Optimization | | JML11790 -
Number ID's in URL's
If you have to place a number ID into a URL. Does it matter from an seo perspective whether it is placed in the middle or at the end? Thanks
On-Page Optimization | | simmo2350 -
How to avoid product's lists from making your site's content duplicated?
Hi there! We at Outitude, recently launched an outdoor activities marketplace and to make it easy for users to compare activities we show a list of available activities in each activity view. The problem is that though the content is different, the first half is practically identical. Example:
On-Page Optimization | | alexmc
Sailing for a full day: http://outitude.com/en/sailing/world/sailing-full-day and sailing for half a day: http://outitude.com/en/sailing/world/sailing-half-day both URL's are different, their content is different but most of it is not (first half of the page), so that the user can compare the activity it is currently seing with others. Questions: How can we show the activities list without it ruining the page rank? Do you advise the use of "", "" surrounding the duplicated content aka activities lists? Thanks in advance.0