Homepage/Root domain de-indexed by Google
-
This morning I discovered that the homepage/root domain of our company site, http://www.collegeplus.org/, has been de-indexed by Google and Bing. Out IT dept. is claiming it's our fault because we changed the meta title on our homepage. But they will not give me access to GWT to see if there's any issues.
I believe the issue lies within our robots.txt file - http://www.collegeplus.org/robots.txt
I also don't believe we're suffering a penalty because all of our tier 2 pages are still indexed when any type of branded search is performed. We don't do things that can get a site de-indexed like this.
Any ideas on what the issue may be? Or at least something to convince our IT dept. that simply changing a meta title won't get your homepage totally de-indexed? Thanks.
-
When I was in a similar situation where I didn't have the best of relations with the development company, I used Pole Position's free Code Monitor (https://polepositionweb.com/roi/codemonitor/index.php) to check the robots.txt files of the live site and any development sites/subdomains on a daily basis. I'd get an email if anything had changed, so I could go to the dev company right away and try to mitigate any problems.
-
Hi Keri. Thank you for the info, I wasn't aware of the view only option. I'll send this post to our IT Director. Appreciate your help! Have a great weekend.
-
So sorry to hear about the battles going on. I've seen some of those, and they're no fun.
One thing that may be of help: last month Google rolled out new user access to GWT, including a way to let view without changing any settings (Barry Schwartz writes about it at http://www.seroundtable.com/google-webmaster-tools-users-14838.html). Is there a chance IT would let your team have a read-only view if you let them know it was now available?
-
Hi Dan. Greatly appreciate your response and insights. I think you've completely identified the issue(s). Basically from a technical SEO perspective our site is a trainwreck hit by a nuclear bomb. The battle between IT and my marketing department rages on, making it really difficult to get anything fixed. There's some politics at play that won't get solved here
Anyway, many thanks for your help on this. We'll try again tomorrow.
-
Hi David
First off (and I know I'm preaching to the choir here) but that's completely silly they won't let you look at WMT!! Seriously?! You're not going to BREAK anything just by looking!!
Arggg...
OK... now that we got that out. Let me give you some ideas.
- The homepage is missing from the sitemap - http://www.collegeplus.org/googlesitemap
- Also, shouldn't the sitemap end in .xml - as in /googlesitemap.xml ?
- The worst is I think what you point out from robots.txt - **Disallow: /.php$* Isn't this asking it to block all pages with the file extension .php??? IF so... your homepage does load with the php extension - http://www.collegeplus.org/index.php
- In general, Google's preferred method of keeping pages out of the index is with a meta robots noindex tag - as opposed to the robots.txt
- ALSO - look at this site search - **over 27,000 pages indexed for /**events?state - i'd say not good!\
- You're not using any canonical tags
- The homepage is NOT indexed in Bing either.
- The robots.txt file does look more messed up the more I look at it - for example they're blocking a forums subfolder, yet none exists on the site. It sits on a subdomain, and is still in the index as you can see here
So there's a lot going on here, and anything could be contributing to the deindexation of your homepage. But I'm <sarcasm>pretty sure</sarcasm> its not your title tags.
Hope that helps get you in the right direction. Either way you've got some on-site stuff to clean up.
-Dan
PS - Meant to say, on a happier note, it was nice to meet you at LinkLove Boston
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Domain not ranking in Google
https://www.buitenspeelgoed.nl/ is a domain acquired by our client. Previously this website was on http://www.buitenspeelgoed-keupink.nl. With the old domain they were ranking top 30 on 'buitenspeelgoed' in google.nl. Now with the new exact match domain they aren't ranking any more (for months). However, the website is indexed, as you can see on http://1l1.be/nz I don't know what to do anymore. Need some advise. What we allready have done the last months: made adjustments to the 301-redirects (this was originaly setup wrong by the webdesigner (de) optimized the homepage on 'buitenspeelgoed' (strange is the fact that the Moz robot can't access the site). Checked the robots.txt to see if the website was blocked for Google Checked the meta robots to see if the website was blocked for Google Disavowed some spammy (old) links which linked to the old domain Checked Search console > Fetch as Google if there isn't any Malware of some kind (and to see if Google can access the site) Checked Search consol to see if there manual spam actions (isn't the case) Checked for duplicate content by copy/paste some texts in Google and see if any other results are showing up (isn't the case for most of the texts) Please let me know what we can do.
Technical SEO | | InventusOnline0 -
Google still listing pages from old domain after 2 change requests
Good Morning I put forward the following question in December 2014 https://moz.com/community/q/google-still-listing-old-domain as pages from our old domain www.fhr-net.co.uk were still indexed in Google. We have submitted two change request in WMT, the most recent was over 6 months ago yet the old pages are still being indexed and we can't see why that would be Any advice would be appreciated
Technical SEO | | Ham19790 -
Removed Subdomain Sites Still in Google Index
Hey guys, I've got kind of a strange situation going on and I can't seem to find it addressed anywhere. I have a site that at one point had several development sites set up at subdomains. Those sites have since launched on their own domains, but the subdomain sites are still showing up in the Google index. However, if you look at the cached version of pages on these non-existent subdomains, it lists the NEW url, not the dev one in the little blurb that says "This is Google's cached version of www.correcturl.com." Clearly Google recognizes that the content resides at the new location, so how come the old pages are still in the index? Attempting to visit one of them gives a "Server Not Found" error, so they are definitely gone. This is happening to a couple of sites, one that was launched over a year ago so it doesn't appear to be a "wait and see" solution. Any suggestions would be a huge help. Thanks!!
Technical SEO | | SarahLK0 -
WordPress - How to stop both http:// and https:// pages being indexed?
Just published a static page 2 days ago on WordPress site but noticed that Google has indexed both http:// and https:// url's. Usually I only get http:// indexed though. Could anyone please explain why this may have happened and how I can fix? Thanks!
Technical SEO | | Clicksjim1 -
Is Google caching date same as crawling/indexing date?
If a site is cached on say 9 oct 2012 doesn't that also mean that Google crawled it on same date ? And indexed it on same date?
Technical SEO | | Personnel_Concept0 -
Having trouble removing homepage from google
For various reasons my client wants their homepage removed from google, no just the content of the page off but the page not to be indexed (yep strange request but we are mere service providers) today I requested in webmaster tool that default.asp was removed. Wht says done but the sites homepage is still listed. The page also has a no index tag on but 24 hours and 18k Google bot hits later it still remains. Anyone got any other suggestions to deindex just the homepage asap please
Technical SEO | | Grumpy_Carl0 -
I have a site that has both http:// and https:// versions indexed, e.g. https://www.homepage.com/ and http://www.homepage.com/. How do I de-index the https// versions without losing the link juice that is going to the https://homepage.com/ pages?
I can't 301 https// to http:// since there are some form pages that need to be https:// The site has 20,000 + pages so individually 301ing each page would be a nightmare. Any suggestions would be greatly appreciated.
Technical SEO | | fthead90 -
Google / Bing Product Feeds - Optimization - Taxonomies
Can anyone provide insightful optimization tips for GOOGLE and BING product feeds? How important is it to use the respective SE' taxonomies? Any other effective tactics apart from data stufffing?
Technical SEO | | DavidS-2820610