Getting rid of a site in Google

JacoRoux

Hi,

I have two sites, lets call them site A and site B, both are sub domains of the same root domain. Because of a server config error, both got indexed by Google.

Google reports millions of inbound links from Site B to Site A

I want to get rid of Site B, because its duplicate content.

First I tried to remove the site from webmaster tools, and blocking all content in the robots.txt for site B, this removed all content from the search results, but the links from site B to site A still stayed in place, and increased (even after 2 months)

I also tried to change all the pages on Site B to 404 pages, but this did not work either

I then removed the blocks, cleaned up the robots.txt and changed the server config on Site B so that everything redirects (301) to a landing page for Site B. But still the links in Webmaster Tools to site A from Site B is on the increase.

What do you think is the best way to delete a site from google and to delete all the links it had to other sites so that there is NO history of this site? It seems that when you block it with robots.txt, the links and juice does not disappear, but only the blocked by robots.txt report on WMT increases

Any suggestions?

JacoRoux

The sites are massive and we are talking massive numbers:

Google reports in WMT that site B still has 259,157,970 links to site A, although when you filter into the report it only shows a few

The current state is that nothing is blocked on Site B, and ALL pages point to the landing page of Site B.

In WMT for site B, G still shows data for all the reports, like search queries, keywords, crawl errors (very old and all fixed) and so on. The reports and data does not bother me as much as the 259,157,970 links it reports on Site A.

On the 11th of April when I started the process of getting rid of these links, there were 554,066,716, this jumped up to 603,404,378 on the 28th of April. It started dropping and was as low as 122,405,100 on the 17th of May, and then started growing again up to where it is now 259,157,970

I also noticed that when the pages was giving 404s that the crawl rate of google dropped to zero, now that its redirecting to the landing page, the crawl rate is back up to about 1,800 per day, which is still very low, considering the numbers we are talking about.

The crawl rate on Site A is okay, at 220,000 per day, but it was as high as 800,000 per day at one stage.

Seaward-Group

If you remove all history of a website it may still appear in the wayback machine.

If you first blocked robots then they wont create the 301 links, they'll just keep the previously cached pages? Maybe remove the robots.txt and let google index every page with the 301 to the landing page, then after they've indexed add the robot.txt back. Have you tried submitting a new sitemap in Webmaster tools pointing all pages at the landing page?

Roughly how many pages are in your website?

JacoRoux

I failed to mention that both sites A and B had the exact same content, database and URL structure, with the only difference being the sub domain.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Getting rid of a site in Google

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Google Is Indexing my 301 Redirects to Other sites

Fetch as Google -- Does not result in pages getting indexed

Merging B2B site with B2C site

Is my site being penalized?

Google penalty or what???

Is it Wortwhile to have a HTML site map for a Large Site

Is it safe to not have a sitemap if Google is already crawling my site every 5-10 min?

High ranked web site on Google GONE - but webspam team says nothing wrong