Severe rank drop due to overwritten robots.txt
-
Hi,
Last week we made a change to drupal core for an update to our website. We accidentally overwrote our good robots.txt that blocked hundreds of pages with the default drupal robots.txt. Several hours after that happened (and we didn't catch the mistake) our rankings dropped from mostly first, second place in Google organic to bottom and mid first page.
Basically I believe we flooded the index with very low quality pages at once and threw a red flag and we got de-ranked.
We have since fixed the robots.txt and have been re-crawled but have not seen a return in rank.
Would this be a safe assumption of what happened? I haven't seen any other sites getting hit in the retail vertical yet in regards to any Panda 2.3 type of update.
Will we see a return in our results anytime soon?
Thanks,
Justin
-
Your present approach is correct. Ensure all these pages are tagged as noindex for now. Remove the block from robots.txt and let Google and Bing crawl these pages.
I would suggest waiting until you are confident all the pages were removed from Google's index, then check Yahoo and Bing. If you decide that robots.txt is the best decision for your company, then you can replace the disallows after confirming your site is no longer affected by these pages.
I would also suggest that, going forward, you ensure any new pages on your site that you do not wish to index always include the appropriate meta tag. If this issue happens again then you will have a layer of protection in place.
-
We're pretty confident thus far that we have flooded the index with about 15,000 low rank URLs all at once. This has happened once in the past a few years back but we didn't flood their index, they were newer pages at the time in which were low quality and could have been seen as spam since there was no real content but adsense so we removed them with a disallow in robots.
We are adding the meta no-index to all of these pages. You're saying we should remove the disallow in robots.txt so googlebot can crawl these pages and see the meta-noindex?
We are a very large site and we're crawled often. We're a PR7 site and MOZrank DA is 79/100. We have dropped from 82.
We're hoping these URLs will be removed quickly, I don't think there is a way of removing 15k links in GWMT without setting off flags also.
-
There is no easy answer for how long it will take.
If your theory about the ranking drop being caused by these pages being added is correct, then as these pages are removed from Google's index, your site should improve. The timeline depends on the size of your site, your site's DA, the PA and links for these particular pages, etc.
If it was my site I would mark the calendar for August 1st to review the issue. I would check all the pages which were mistakenly indexed to be certain they were removed. After, I would check the rankings.
-
Hi Ryan,
Thanks for your response. Actually you are correct. We have found some of the pages that should be no follows still indexed. We are now going to use the noindex, follow meta tags on these pages because we can't afford to have theses pages indexed as they are particularly for clients/users only and are very low quality and have been flagged before.
Now, how long until we see our rank move back? Thats the real big question.
Thanks so much for your help.
Justin
-
That's a great answer Ryan... I wonder, just out of curiosity, if it wouldn't hurt to look at the cached version of the pages if they're indexed? I'd be curious to know if the date they were cached is right near when the robots.txt was changed? I know it wouldn't alter his course of action, but might add further confirmation that this caused the problem?
-
Justin,
Based on the information you provided it's not possible to determine if the robots.txt file was part of the issue. You need to investigate the matter further. Using Google enter a query in an attempt to find some of the previously blocked content. For example, let's assume your site is about SEO but you shared a blog article about your movie review of the latest Harry Potter movie. You may have used robots.txt to block that article because it is unrelated to your site's focus. Perform a search for "Harry Potter insite:mysite.com" replacing mysite.com with your main web address. If the search returns your article, then you know the content was indexed. Try this approach for several of your previously blocked areas of the website.
If you find this content in SERPs, then you need to have it removed. The best thing to do is add the "noindex, follow" tags to all these pages, then remove the block from your robots.txt file.
The problem is that with the block in place on your robots.txt file, Google cannot see the new meta tag and does not know to remove the content from it's index.
One last item to mention. Google does have a URL removal tool but that would not be appropriate in this instance. That tool is designed to remove a page which causes direct damage by being in the index. Trade secrets or other confidential information can be removed with this tool.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are these redirects damaging my rankings
Hi, just been going through my google webmaster tools and i have found a number of soft 404 errors and was shocked to see these redirects going to my home page. | URL | Response Code | Detected |
Technical SEO | | ClaireH-184886
| --- | --- | --- |<colgroup><col style="width: 45px;"><col style="width: 80px;"><col><col style="width: 120px;"><col style="width: 90px;"></colgroup>
| | 1 | staging1/jupgrade2/component/k2/item/475-gastric-band-hypnotherapy-expert-says-government-are-not-doing-enough-to-fight-obesity | | 7/25/13 |
| | 2 | staging1/jupgrade2/category-menu/item/1068-coronation-street-nick-proposes-to-leanne | | 7/28/13 |
| | 3 | staging1/jupgrade2/category-menu/item/1086-coronation-street-fiz-nearly-gets-tyrone-in-trouble-with-kirsty | | 7/28/13 |
| | 4 | staging1/jupgrade2/category-menu/item/1155-coronation-street-kylie-considers-telling-david-about-her-affair | | 7/28/13 |
| | 5 | staging1/jupgrade2/category-menu/item/1157-coronation-street-fiz-is-worried-when-kirsty-pays-her-a-visit | | 7/28/13 |
| | 6 | staging1/jupgrade2/component/k2/item/750-actress-hilary-duff-is-sued-over-car-accident | | 7/25/13 |
| | 7 | staging1/jupgrade2/component/k2/item/843-what-your-dogs-bottom-scooting-behaviour%E2%80%99-really-meansI would have thought that the developer who upgraded my site would have either had these blocked or directed them to the correct page. i am now guessing that there are going to be hundreds of these appearing and would like some serious advice.normally with old pages i would have them going to the relevant articles or pages but it seems the developer has redirected hundreds of these to my home page, which i guess is going to confuse google. i would like to know if this will confuse google have all these pages going to my home page.i have tried to find out where the developer in my site has redirected the pages as they are not in my htaccess file, i use joomla.can anyone let me know what i should be doing with these to solve the problemmany thanks |0 -
My Alexa ranking dropped after a 301 redirect is that bad?
I had all of my non www pages redirect to the www versions. My alexa ranking dropped and keeps dropping after I did this. I'm guessing its because its tracking the non www version. Does anyone know if this is correct and should I worry?
Technical SEO | | CandleCam0 -
Google ranking downgraded
On Sept 29th our website 'keyword' rankings who totally destroy. We were always #1 to #2 for these keywords 1. listing presentation
Technical SEO | | RandyRoussie
2. real estate presentation Our website is: http://www.agentpresentations.com We asked Google through Webmaster Tools to reconsider the site and they replied back saying nothing was wrong. We run Google Adwords and we contacted them and they looked at the keywords and sent screenshots to us showing the ranking we still there. But they are not... as proven in Webmaster tools. So Adwords told us to ask Webmaster Tools to reconsider the site again. Nothing yet... What do you suggest?0 -
Sudden Drop in Keyword Rankings
We launched http://www.manufacturedfun.com/ earlier this year and had been ranking 1 & 2 on Google SERPs for the keywords we optimized, but last week we experienced a sudden drop in rankings that pretty much took us off the radar. For instance, 'popcorn machines' went as far back as page 5 and 'popcorn poppers' dropped even further to page 9. We are currently working on fixing the numerous Duplicate Page Title and Duplicate Page Content errors identified by SEOmoz, but since we have had those for about 6 months and ranked well anyway, I wonder if there's something else that we are missing. Any insight you can offer will be sincerely appreciated. Thank you!
Technical SEO | | GRIP-SEO0 -
Does Bing ignore robots txt files?
Bonjour from "Its a miracle is not raining" Wetherby Uk 🙂 Ok here goes... Why despite a robots text file excluding indexing to site http://lewispr.netconstruct-preview.co.uk/ is the site url being indexed in Bing bit not Google? Does bing ignore robots text files or is there something missing from http://lewispr.netconstruct-preview.co.uk/robots.txt I need to add to stop bing indexing a preview site as illustrated below. http://i216.photobucket.com/albums/cc53/zymurgy_bucket/preview-bing-indexed.jpg Any insights welcome 🙂
Technical SEO | | Nightwing0 -
301 redirect dropped page rank
Hi, We have a www domain that I have changed to a non www domain. The www domain had been in place for some time and had a good page rank, PR4. After this change the page rank dropped significantly (PR0, and now recently back to PR2) despite it being a 301 redirect which I thought "should" carry over the page rank. Yes, I am aware I should have just left it be. Hind sight 20/20 .. ya ya ya 🙂 My questions Is the 301 the correct method for this? Why did the page rank drop despite the 301? Should we go back to the www domain at this point? Thanks Kris
Technical SEO | | adriot0 -
Robots.txt
Hi everyone, I just want to check something. If you have this entered into your robots.txt file: User-agent: *
Technical SEO | | PeterM22
Disallow: /fred/ This wouldn't block /fred-review/ from being crawled would it? Thanks0 -
Robots.txt File Redirects to Home Page
I've been doing some site analysis for a new SEO client and it has been brought to my attention that their robots.txt file redirects to their homepage. I was wondering: Is there a benfit to setup your robots.txt file to do this? Will this effect how their site will get indexed? Thanks for your response! Kyle Site URL: http://www.radisphere.net/
Technical SEO | | kchandler0