When You Add a Robots.txt file to a website to block certain URLs, do they disappear from Google's index?
-
I have seen several websites recently that have have far too many webpages indexed by Google, because for each blog post they publish, Google might index the following:
- www.mywebsite.com/blog/title-of-post
- www.mywebsite.com/blog/tag/tag1
- www.mywebsite.com/blog/tag/tag2
- www.mywebsite.com/blog/category/categoryA
- etc
My question is: if you add a robots.txt file that tells Google NOT to index pages in the "tag" and "category" folder, does that mean that the previously indexed pages will eventually disappear from Google's index? Or does it just mean that newly created pages won't get added to the index? Or does it mean nothing at all? thanks for any insight!
-
Hi William
If the pages in question are
- already indexed by Google then if you block them via the robots.txt , they will show up in search result but the meta description will say something along the lines of
A description for this result is not available because of this site's robots.txt – learn more.
2) not indexed by Google for example on a new site , they don't follow it and the pages does not come up in search directly BUT if some external sites link to the pages then they can still come up in the SERP some time down the track.
Your best bet to keep the page out of the public SERP index is the meta robots tag : http://www.robotstxt.org/meta.html
-
William, If the pages in question are linked to from external resources the robots.txt file will not prevent the pages from appearing in the index. Per Moz's Robots.txt and Meta Robots best practices, "the robots.txt tells the engines not to crawl the given URL, but that they may keep the page in the index and display it in in results.
To prevent all robots from indexing a page on your site, place the following meta tag into the section of your page:
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Wordpress 'Hide Title' Feature, does this help shorten title length
Im wondering if anyone with some Wordpress experience can help me. I am using Yoast to create my page titles, but yet Moz tells me that my page titles including my actual page title tag which is 'dumfries wedding photography | Hemera Visuals' by clicking on the 'hide title' feature in wordpress will this in turn stop wordpress from automatically adding my page title and therfor bring my title length down drastically? And if so will I have to wait till google next crawls my page to see if this works? Kind Regards Cameron.
On-Page Optimization | | hemeravisuals120 -
What's the best Magento Community blog extension?
We are looking at FishPig's Word Press Integrations extension. has anybody used it? Possibly a dumb question, but is SEO adversely affected by the fact it's a WordPress extension on a Magento site?
On-Page Optimization | | Anne_Marie_English0 -
Inches or " Feet or ' Does Google translate the symbols?
I have a client who sells things that the size is important. In their industry some people say "15 Inch Blue Widget" and others say "15" Blue Widget" using the symbol " for inches. On the page I know we could say both to cover all the bases but I want to get the title right. In their industry there is not one more preferred than the other. Does anybody know if Google translates ' to feet and " to inches. Should I work both into the title for a product or only one?
On-Page Optimization | | JoshuaLindley0 -
Sold Products appear as duplicate pages 'Page Not Found' ???
Hi there, I'm down to just 6 duplicate page warnings but I'm not sure how to deal with this one: Information Page Not Found! http://www.vintageheirloom.com/index.php?route=information/information&information_id=6 My Ecommerce shopping site products are unique, 1 of a kind. So once one product has sold and been delivered we take the product off our website, hence the Information Page Not Found! As I understand when search engines re-index these warnings will drop off but new sold products would replace them. So redirecting seems like hard work and never ending. Is it ok to ignore these warnings? Thanks Mozzers..
On-Page Optimization | | well-its-1-louder0 -
Removing old URLs from Google
Hello, I am sure that this question has been asked many times, but I am still not sure what to do about the following: Our site's URL structure has changed a few times in the past few months. Recenty, we have changed our URLs to become more SEO friendly. However, Google has indexed the old URLs as well. To give an example: The following page in our website shows the following URLs in Google Webmaster Tools: Confúcio e Seus Ensinamentos /artigo/68_38/2/as_religioes_iv_confucio_e_seus_ensinamentos//aula/14_6132/vestibular/confucio_e_seus_ensinamentos//aula/1_14_6132/vestibular/confucio_e_seus_ensinamentos//aula/_14_6132/Vestibular/confucio_e_seus_ensinamentos//aula/ensino/confucio_e_seus_ensinamentos/ The correct URL is the last one. What should I do about the other ones? Almost all the pages in our website have this problem. We have redirected the old URLs to the new ones, but is there anything else we should do? We were asking Google to remove them, but Google has informed us that it has reached the limit. Please advise us on waht we should do. We have removed the old sitemap with the old URLs. What else must we do? Thank you very much.
On-Page Optimization | | Tev0 -
Advice on why page isn't being indexed in google top 1000 for keyword
Hi, http://www.cgcomposites.com.au/composite-material.html This page isn't listed for keywords 'composite material' It has been live for a few weeks and gets grade A report for onpage optimisation? regards Michael
On-Page Optimization | | bluelilyseo0