Spam posts indexed, what to do now?
-
Hi,
So we had a staff problem last week and we let some spam posts (cheap nike jerseys etc.) that also got indexed by Google. (We just checked and there are lik 105 already indexed)
Of course we have now removed all these spam posts but what is the best practice at this point? Are we supposed to do something else to remove these from Google's index? (maybe through google webmaster tools?) We have already edited robots.txt to disallow those pages as a quick remedy.
And finally, could this have done any harm? We were quite slow noticing these posts to remove them. They were there for about 12 days.
thanks
-
Good to know
-
Hi,
Thanks for the comprehensive answer. We don't have any vulnerabilities. It was all my fault as I completely forgot that I had given administrative access to one of our former content managers who had temporarily allowed anonymous users to post on this certain section of the site. And once he left, we forgot to update that permission and never really noticed those posts, until today.
-
haha I just say you said "all those links had auto-nofollow on them"
NO PROBLEM MAN! rest easy! You cannot get penalized for nofollow links!
-
Thanks for the quick response. We're just requesting URL removal for all those URL's. I hope this makes it all good. No sign of ranking drop at the moment. We're lucky those pages were automatically filtered out by our sitemap.xml and all those links had auto-nofollow on them. Time to consider buying a service like Mollom I guess.
-
Do you know how the spam posts were published on your site? Just make sure the vulnerability is fixed so it doesn't happen again. Once the spam posts you found have been deleted from your site, you shouldn't have to do anything more since they will fall out of Google's index. Keep an eye on Google Webmaster Tools though to see if you notice any more spam pages pop up on Google's radar and then manually remove them.
Here is Google's official answer - http://support.google.com/webmasters/bin/answer.py?hl=en&answer=164734
When a page is updated or removed, it will automatically fall out of our search results. You don’t need to do anything to make this happen.
However, if you urgently need to remove content from Google's search results (for example, if you’ve already removed, updated, or blocked a page accidentally displaying confidential information like credit card numbers), you can request expedited removal of those URLs.
Our removal tools are intended for pages that urgently need to be removed—for example, if they contain confidential data that was accidentally exposed. Using the tools for other purposes may cause problems for your site.
Another Google resource if your site was actually hacked or compromised - http://support.google.com/webmasters/bin/answer.py?hl=en&answer=1269119
To take your site "offline" after being hacked. If your site was hacked and you want to get rid of bad URLs that got indexed, use the URL removal tool to remove any new URLs that the hacker created—for example, http://www.example.com/buy-cheap-cialis-skq3w598.html. But we don't recommend removing your entire site, or removing URLs that you'll eventually want indexed. Instead, clean up the hacking and let us recrawl your site.
-
So someone was posting articles on your site that linked to other sites like paid links?
If you removed the posts no need to block them in robots.txt because they no longer exist so will not get crawled anymore. Yes definitely request removal in WMT URL removal tool and get those pages out of Google's index ASAP.
You're probably OK. Just keep your fingers crossed and an eye on rankings and run a tight ship so that doesn't happen again, definitely something you can get penalized for. Good thing you caught it quickly.
EDIT: if you meant that you let spam comments get posted live/approved by the admin then all you can do is remove the spammy posts and make sure your comment settings are set to need admin approval before getting posed live. No need to block in robots.txt or remove URLs in that case but it doesn't hurt. If the links are off of your site you should be fine.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why seomoz.org still in Google index?
I searched in Google, the number of URLs indexed left in the seomoz.org domain since it changed to moz.comI am surprised that after all this time more than 15,000 URLs indexed:https://www.google.com/webhp?sourceid=chrome-instant&ion=1&espv=2&ie=UTF-8#q=site%3Aseomoz.org%20inurl%3Aseomoz.org If I clicked on any of the results it will be redirect (301) to the new domain, so it is working, but Google still keep these URLs in the index.
Technical SEO | | Yosef
What could be the reason?Will not cause duplicated content issue on moz.com?0 -
Why would GWT say 0 pages indexed ?
Hi Looking in GWT > Google Index > Index Status says 0 pages indexed Yes if i search manually on google for brand site is listed, and i see organic traffic from Google in analytics I take it this is likely an error in GWT and nothing to worry about ? Cheers Dan
Technical SEO | | Dan-Lawrence0 -
Videos not indexed yet in google analytics
Hi I submitted a video stemap to google webmasters a month ago and it shows submitted videos in webmasters account but it is not showing that any video been indexed by google yet. I am not sure whats the issue because its taking long time or may be some error is there. Amit
Technical SEO | | expertvillagemedia0 -
How do you know what version of your site of Google is in their index?
This is going to sound like a strange question, but I am trying to understand which version of our site is in the index. You might think this is an obvious question, but here is why I am asking: 1. Today I searched for a specific keyword and found the listing. 2. I liked on the right arrow next to the listing and checked the cache date. It says 6/28 and shows the site as of 6/28. 3. I expected to see that we were just indexed as we jumped several pages since yesterday and I had just checked two days ago and we hadn't moved at all. It seems like Google may have taken the changes we made on 7/2 but since it is showing 6/28, I am note sure. Since this is confusing, here is the chronology: 1. Made changes 6/20. 2. Site appeared to be indexed on 6/28. 3. Made changes on 7/2. 4. Checked the site on 7/2 and we were in position 60. Checked the site on 7/4 and we were in position 61. 5.. Checked the site today (7/6) and see we are in position 8. The cache date shows as 6/28. I suspect that Google just indexed us yesterday and is reflecting the changes I made on 7/2. But the fact that it says it was cached on 6/28 seems to sugges otherwise. I want to be sure I know which version got us the good rankings - is there any way to be sure? Thanks!!
Technical SEO | | trophycentraltrophiesandawards0 -
Guest Blog Posts and PR
Let's say I do a guest blog post on a PR5 site. Let's also say the post is deep within the site, and the page rank that flows to it is only maybe PR1 or 2. Let's also say it doesn't get many links to it so it forever stays that way. Will Google realize even though the PR of that page is low, it's on a PR5 site, thus giving it more link power than it the overall site were a PR2 site? Did that make any sense?
Technical SEO | | UnderRugSwept0 -
Google is indexing proxy (mirror) site.
We moved the site to a new hosting. Previously the site used Godaddy Windows Hosting with white domain masking. After moving the site we just mirrored the site. We have to use mirrored domain for PPC campaigns because it mirrored site contains true BRAND name and there is better conversion with that domain plus all trade marked keywords are approved for mirrored domain. Robots.txt User-agent: * Host: www.hermitagejewelers.com Disallow: /Bin Disallow: /css www.hermitagejewelers.com is the main domain. Mirror site is www.ermitagejewelers.com (Without the "H" at the beginning) Most of the keywords are now picked up by mirror site. I have not noticed any major changes in ranking except that it ranks for mirror site. We updated the sitemap. Website is designed very poorly (not by us). Also, we submitted the change address request for ermitagejewelers to hermitagejewelers in webmasters. Please let me know any advice to fix that problem. Thank you.
Technical SEO | | MaxRuso1 -
De-indexing thin content & Panda--any advantage to immediate de-indexing?
We added the nonidex, follow tag to our site about a week ago on several hundred URLs, and they are still in Google's index. I know de-indexing takes time, but I am wondering if having those URLs in the index will continue to "pandalize" the site. Would it be better to use the URL removal request? Or, should we just wait for the noindex tags to remove the URLs from the index?
Technical SEO | | nicole.healthline0 -
How do I redirect index.html to the root / ?
The site I've inherited had operated on index.html at one point, and now uses index.php for the home page, which goes to the / page. The index.html was lost in migrating server hosts. How do I redirect the index.html to the / page? I've tried different options that keep giving ending up with the same 404 error. I tried a redirect from index.html to index.php which ended in an infinite loop. Because the index.html no longer exists in the root, should I created it and then add a redirect to it? Can I avoid this by editing the .htaccess? Any help is appreciated, thanks in advance!
Technical SEO | | NetPicks0