Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
My website is penalized from google with no message in GWT.
-
On 26 of October 2018 My website have around 1 million pages indexed on google. but after hour when I checked my website was banned from google and all pages were removed. I checked my GWT and I did not receive any message. Can any one tell me what are the possible reasons and how can I recover my website? My website link is https://www.whoseno.com
-
Would you be able to send me a dm with a copy of that email? I'm interested in larger sized automatic sites and trying to figure out where the limit is (and how yours isn't allowed when others are)
-
Thank you for your responses. I just received email from google after 3 days with the reason. They are saying you website is generating automatic content.
-
This is a really fascinating question. It's highly irregular for Google to de-list a site with absolutely no reason given. Even if it's something really bad like serving malware to Google's users, you usually get a hacked content notification
Your assertion that your site has been de-listed by Google due to data you are seeing in various analytics packages is backed up by Google's front-end:
- https://www.google.co.uk/search?q=site%3Awhoseno.com
- https://www.google.com/search?q=site%3Awhoseno.com
- https://www.google.fr/search?q=site%3Awhoseno.com
- https://www.google.bg/search?q=site%3Awhoseno.com
I can't find any pages from your site in Google US, UK, France or Bulgaria. Whatever has happened they seem to have gone fairly thermonuclear!
I performed a 25% crawl of your site using Screaming Frog (rendering / JS enabled), using Google's user agent (Googlebot). Some pages returned an error 404:
- https://www.whoseno.com/number-information
- https://www.whoseno.com/whose-number-is-this
- https://www.whoseno.com/track-location
- https://www.whoseno.com/get-mobile-number-details
- https://www.whoseno.com/phone-number-details
- https://www.whoseno.com/get-complete-details-of-your-ex
- https://www.whoseno.com/track-any-mobile-number
- https://www.whoseno.com/wrong-number
- https://www.whoseno.com/whose-number-is-this-calling-me
- https://www.whoseno.com/phone-number-search
- https://www.whoseno.com/recent-lookups-on-whoseno
- https://www.whoseno.com/get-details-of-any-mobile
- https://www.whoseno.com/get-details-of-any-phone-number-for-free
- https://www.whoseno.com/trace-location-on-map
- https://www.whoseno.com/reverse-directory
- https://www.whoseno.com/track-location-by-phone-number
- https://www.whoseno.com/reverse-phone-lookup
- https://www.whoseno.com/phone-number-lookup
- https://www.whoseno.com/reverse-phone-lookup-service
Although this seems like quite a few broken pages, there were many more which were rendering properly. This just looks like the kind of stuff which Google would flag as crawl errors, rather than taking a site down in its entirety when the majority of pages return 200 (OK).
Some of the URLs like, getting "complete details about your ex" Google may frown upon. People shouldn't really be able to go on a site and get complete details for their ex-partner as that promotes stalking (something which Google is firmly against, and which most first-world governments are moving to take more and more action on). Even if the name of the page is misleading and it doesn't (when working) really supply that functionality, that then makes it a spam page instead (as it looks to satisfy unscrupulous users looking for such information and then fails to deliver).
Out of the pages which are returning 200 (OK), most of them are individual phone number pages. An example might be this page: https://www.whoseno.com/US/2014623561 - the number has been publicly logged as spam. With the advent of GDPR legislation, if you are logging phone numbers and publicly keeping a database of them (without the permission of the phone number's owner) then you may be in breach of new European GDPR legislation (read about it here).
Google wants to continue operating in Europe, so whilst they may be an American company GDPR does heavily impact Google. They want to comply with GDPR
I checked the technical indexation of your pages, there don't seem to be any huge red flags.
- Robots.txt isn't blocking critical pages and resources
- Nor is the Meta no-index tag
- Canonical tags don't seems to be de-indexing real pages and pointing Google to broken ones
- Google's user-agent seems to be able to access most pages properly
I decided to search for your site on Bing to see if they had also de-indexed you:
Bing still holds pages and records of your domain.
One of the results really interested me. There's a Twitter profile listed on those search results, the SERP snippet reads like this:
"Hussɑin Aвduℓℓɑtif (@whoseno) | Twitter
The latest Tweets from Hussɑin Aвduℓℓɑtif (@whoseno_)"_
The Twitter profile has been suspended. This may or may not be your Twitter profile. If it's not your Twitter profile, your digital identity may have accidentally been combined with this person's who may or may not have Twitter ToS or state-level action against them
You need to go to Google's Webmaster support forum here and ask them what the deal is.
It's unlikely to be Penguin / link related and I don't think it's tech related either. It could be GDPR concerns, pollution of your digital identity - combined with a 3rd party who has state-level action against them, or it could be a basic 'Google glitch'
-
Ok, this one may be interesting, if it's none of these options below I'd love to take a deeper look, send me a dm on twitter: https://twitter.com/thomasharvey_me
So, I see that you're on Cloudflare, are you still being crawled by Google?
Have you looked in the old search console? Have you or anyone you work with done anything in the "remove urls" section?
Have you seen any change in crawl stats recently?
Any recent changes to the site that may have caused this?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How does Google handle fractions in titles?
Which is better practice, using 1/2" or ½"? The keyword research suggests people search for "1 2" with the space being the "/". How does Google handle fractions? Would ½ be the same as 1/2?
Intermediate & Advanced SEO | | Choice2 -
SEO on dynamic website
Hi. I am hoping you can advise. I have a client in one of my training groups and their site is a golf booking engine where all pages are dynamically created based on parameters used in their website search. They want to know what is the best thing to do for SEO. They have some landing pages that Google can see but there is only a small bit of text at the top and the rest of the page is dynamically created. I have advised that they should create landing pages for each of their locations and clubs and use canonicals to handle what Google indexes.Is this the right advice or should they noindex? Thanks S
Intermediate & Advanced SEO | | bedynamic0 -
Does google ignore ? in url?
Hi Guys, Have a site which ends ?v=6cc98ba2045f for all its URLs. Example: https://domain.com/products/cashmere/robes/?v=6cc98ba2045f Just wondering does Google ignore what is after the ?. Also any ideas what that is? Cheers.
Intermediate & Advanced SEO | | CarolynSC0 -
Check website update frequency?
Is the tools out there that can check our frequently website is updated with new content products? I'm trying to do an SEO analysis between two websites. Thanks in advance Richard
Intermediate & Advanced SEO | | seoman100 -
Best way to remove full demo (staging server) website from Google index
I've recently taken over an in-house role at a property auction company, they have a main site on the top-level domain (TLD) and 400+ agency sub domains! company.com agency1.company.com agency2.company.com... I recently found that the web development team have a demo domain per site, which is found on a subdomain of the original domain - mirroring the site. The problem is that they have all been found and indexed by Google: demo.company.com demo.agency1.company.com demo.agency2.company.com... Obviously this is a problem as it is duplicate content and so on, so my question is... what is the best way to remove the demo domain / sub domains from Google's index? We are taking action to add a noindex tag into the header (of all pages) on the individual domains but this isn't going to get it removed any time soon! Or is it? I was also going to add a robots.txt file into the root of each domain, just as a precaution! Within this file I had intended to disallow all. The final course of action (which I'm holding off in the hope someone comes up with a better solution) is to add each demo domain / sub domain into Google Webmaster and remove the URLs individually. Or would it be better to go down the canonical route?
Intermediate & Advanced SEO | | iam-sold0 -
Will Google View Using Google Translate As Duplicate?
If I have a page in English, which exist on 100 other websites, we have a case where my website has duplicate content. What if I use Google Translate to translate the page from English to Japanese, as the only website doing this translation will my page get credit for producing original content? Or, will Google view my page as duplicate content, because Google can tell it is translated from an original English page, which runs on 100+ different websites, since Google Translate is Google's own software?
Intermediate & Advanced SEO | | khi50 -
How to find all indexed pages in Google?
Hi, We have an ecommerce site with around 4000 real pages. But our index count is at 47,000 pages in Google Webmaster Tools. How can I get a list of all pages indexed of our domain? trying to locate the duplicate content. Doing a "site:www.mydomain.com" only returns up to 676 results... Any ideas? Thanks, Ben
Intermediate & Advanced SEO | | bjs20100 -
Google is mixing subdomains. What can we do?
Hi! I'm experiencing something that's kind of strange for me. I have my main domain let's say: www.domain.com. Then I have my mobile version in a subdomain: mobile.domain.com and I also have a german version of the website de.domain.com. When I Google my domain I have the main result linking to: www.domain.com but then Google mixes all the domains in the sites links. For example a Sing in may be linking mobile.domain.com, a How it works link may be pointing to de.domain.com, etc What's the solution? I think this is hurting a lot my position cause google sees that all are the same domain when clearly is not. thanks!!
Intermediate & Advanced SEO | | fabrizzio0