Removing a Page From Google index
-
We accidentally generated some pages on our site that ended up getting indexed by google. We have corrected the issue on the site and we 404 all of those pages. Should we manually delete the extra pages from Google's index or should we just let Google figure out that they are 404'd? What the best practice here?
-
Thanks Ryan. I fully understand what you are saying and will be careful while making the change.
-
Hi Atul,
Generally speaking I am uncomfortable advising others on specific changes to the .htaccess file. If you make even a slight error while working with the file, your site security can be compromised, not to mention your SEO. There are also many factors to consider such as which mods are enabled on your particular server along with other configuration issues. Lastly, the order in which your code is placed in the file can effect it's operation so it's not like adding a meta tag to the section of an html document.
If you are on managed hosting, my recommendation is to ask your web host to make the change. If you are not on managed hosting, I recommend asking the developer who manages the site to make the change.
If you still insist on making the change yourself, try
Redirect gone /ABC/xyz.html
-
After reading your answer, i searched for methods on how to generate 410 error.
Lets say i want to remove a page named xyz.html.
Which of the following entry in .htaccess is correct
Redirect gone xyz.html
or
Redirect gone /xyz/
If xyz were in a folder named ABC,
would it be correct
Redirect gone /ABC/xyz.html
Thaks
-
Thanks everyone! We are just going to leave it as is. Google will eventually flush it out. Ryan - because of the 90 days we can't remove the URL's. I will need them back sooner than that when we actually put products in those states. Thanks again! helpful....as usual!
-
Remove URL tool will just expedite the inevitable. There is no downside in doing so.
I agree with everything you shared Esko up to this point. Aside from the time spent to remove the page, there is another downside. The URL you remove will not appear in SERPs again for 90 days after being manually removed.
If your URL was mysite.com/blue-widgets then your site will not have another /blue-widgets page listed again for 90 days. I can share it is a headache as an SEO trying to figure out why a page is not being indexed, and later learning I did not ask all the right questions i.e. "Prior to hiring my services, have you or anyone with access to your WMT account used the URL Removal tool within the past 90 days?". That otherwise obscure question now is asked regularly of my clients. Painful lesson.
Also, I wanted to share another helpful link I located from Google: When NOT to use the URL Removal tool.
-
Google will completely drop the page from the index after the next time they crawl it. Using the Remove URLs tool in Google Webmaster Tools will only expedite removal.
Best practice is to 404 (Not Found) or 410 (Gone) the page first of all.
Remove URL tool will just expedite the inevitable. There is no downside in doing so.
-
Should we manually delete the extra pages from Google's index or should we just let Google figure out that they are 404'd? What the best practice here?
The best practice would be to generate a 410 error (GONE) for the pages and Google will remove them from their index fairly quickly.
The next best practice would be to leave the pages as 404s and Google will still remove them from their index but it will take a bit longer.
A 410 is used to inform Google and others the page is definitely gone. A 404 merely states the page is unavailable now. It could be available later.
The removal tool should only be used if it is a major concern for the search result to appear in SERPs. An example would be if confidential information was leaked.
-
I think it's always good to let Google know as they might remove it sooner. But there's no guarantee either way. Though if you can, you should 301 your content to a new/similar page rather than just let it 404.
-
I understand HOW to remove a page. I want to know whether it's better for me to manually remove it or let Google remove it on its own.
-
Remove a page from Google's Index
Use Google Webmaster Tools www.google.com/webmasters/tools/
http://www.google.com/support/webmasters/bin/answer.py?answer=1663419
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Staging website got indexed by google
Our staging website got indexed by google and now MOZ is showing all inbound links from staging site, how should i remove those links and make it no index. Note- we already added Meta NOINDEX in head tag
Intermediate & Advanced SEO | | Asmi-Ta0 -
Paginated category pages still showing in Google
Despite our blog using rel=next and rel=”prev” we’re still finding paginated pages getting impressions in Google, suggesting they are taking up unnecessary crawl budget. An example is: https://www.theukdomain.uk/seo/page/2/ What steps would you recommend I take to most benefit my sites SEO? Thanks, Sam
Intermediate & Advanced SEO | | sjefferies0 -
React.js Single Page Application Not Indexing
We recently launched our website that uses React.js and we haven't been able to get any of the pages indexed. Our previous site (which had a .ca domain) ranked #1 in the 4 cities we had pages and we redirected it to the .com domain a little over a month ago. We have recently started using prerender.io but still haven't seen any success. Has anyone dealt with a similar issue before?
Intermediate & Advanced SEO | | m_van0 -
How do we decide which pages to index/de-index? Help for a 250k page site
At Siftery (siftery.com) we have about 250k pages, most of them reflected in our sitemap. Though after submitting a sitemap we started seeing an increase in the number of pages Google indexed, in the past few weeks progress has slowed to a crawl at about 80k pages, and in fact has been coming down very marginally. Due to the nature of the site, a lot of the pages on the site likely look very similar to search engines. We've also broken down our sitemap into an index, so we know that most of the indexation problems are coming from a particular type of page (company profiles). Given these facts below, what do you recommend we do? Should we de-index all of the pages that are not being picked up by the Google index (and are therefore likely seen as low quality)? There seems to be a school of thought that de-indexing "thin" pages improves the ranking potential of the indexed pages. We have plans for enriching and differentiating the pages that are being picked up as thin (Moz itself picks them up as 'duplicate' pages even though they're not. Thanks for sharing your thoughts and experiences!
Intermediate & Advanced SEO | | ggiaco-siftery0 -
Why are some pages indexed but not cached by Google?
The question is simple but I don't understand the answer. I found a webpage that was linking to my personal site. The page was indexed in Google. However, there was no cache option and I received a 404 from Google when I tried using cache:www.thewebpage.com/link/. What exactly does this mean? Also, does it have any negative implication on the SEO value of the link that points to my personal website?
Intermediate & Advanced SEO | | mRELEVANCE0 -
Website is not indexed in Google, please help with suggestions
Our client website was removed from Google index. Anybody could recommend how to speed up process of re index: Webmaster tools done SM done (Twitter, FB) sitemap.xml done backlinks in process PPC done Robots.txt is fine Guys any recommendations are welcome, client is very unhappy. Thank you
Intermediate & Advanced SEO | | ThinkBDW0 -
Is 404'ing a page enough to remove it from Google's index?
We set some pages to 404 status about 7 months ago, but they are still showing in Google's index (as 404's). Is there anything else I need to do to remove these?
Intermediate & Advanced SEO | | nicole.healthline0 -
Google indexing flash content
Hi Would googles indexing of flash content count towards page content? for example I have over 7000 flash files, with 1 unique flash file per page followed by a short 2 paragraph snippet, would google count the flash as content towards the overall page? Because at the moment I've x-tagged the roberts with noindex, nofollow and no archive to prevent them from appearing in the search engines. I'm just wondering if the google bot visits and accesses the flash file it'll get the x-tag noindex, nofollow and then stop processing. I think this may be why the panda update also had an effect. thanks
Intermediate & Advanced SEO | | Flapjack0