How do you deal with comment spam: wordpress?
-
I have akismet installed on my Wordpress blog, and it does a great job of filtering the spam comments, but for some reason my site (and server) gets slammed by the amount of spam comments akismet blocks. If I check my spam folder there will be over 100 spam comments in an hour. (which in turn puts a load on my server.)
Does anyone have any thoughts on how to put a stop to this? (Or at least slow it down?) I know I could use a captcha, but I really don't want to put any barriers on people commenting and I don't even like using those captcha's myself.
Thoughts?
By the way, does anyone know how spam like this works? This has been going on for sometime now. Are spammers just using automated software to do this?
-
By far the best site availability monitoring tool I can recommend is Pingdom.
Signing up for an account is free to monitor one website. You can have it email you or send a text message/tweet when your site goes down. You can also configure how long your site must be out before you get alerted, and how often to be alerted while your site is still down.
Indispensable for understanding what's actually going on with your site.
Paul
P.S. Use the customizations when setting up the monitor so it's actually checking for the existence of a particular word on your page - that way you're testing whether your site is actually rendering, as opposed to just responding to a ping.
-
Thanks for the reply, very helpful info.
As far as server monitoring I don't think I have anything in place. Any suggestions?
-
Sorry - guess should have made that clearer, Rick. There will be a definite reduction in server resources used. The comment still gets partially processed in order to send it to Akismet, but that new setting tells your system to just discard it if it comes back marked as spam. That way, no database writes occur for that spam, which will definitely reduce server load (database reads and writes are fairly "expensive" in terms of added server processing needed).
Without that setting, spam comments that come back from Akismet get written to your database under the Spam table. That's a lot of extra processing for something you were going to throw out anyway.
This won't save as many resources as actually blocking the spam before it even starts to get processed (as the other suggested plugin would do) but you should notice lowered demand on your server resources with this setting. Not to mention a whole lot less crap to clean out every day, as you point out
Paul
P.S One side effect to that setting is you won't be quite as aware of just how much spam you're actually getting since you won't see a lot of it. This means a spam run against some older posts could start really hitting server resources hard but you might not be aware. (Remember, this setting doesn't eliminate the processing demands completely.)
So keep an eye on the stat that shows how many spams Akismet has handled. If you see a prolonged surge, and/or have further server load problems, it will be a signal that more drastic protection methods have become necessary.
Do you have a server monitoring/alerting system in place?
-
Thanks for your very helpful post! It was great!
I never thought of selecting the option to auto delete spam comments on posts older than a month old. Once I did that, it cut down on 80% of the spam I was getting! So thanks!
Quick question on that. Does enabling that option cut down on the server resources? In other words, let's say it cuts down on 200 spam comments a day because they are auto deleted, do those 200 spam comments still get entered in as comments and therefore use server resources? Does this just save me the step of having to go through and delete them / clear the spam folder? Or does this save a huge amount of server resources? Either way it's a huge win!
-
Just wondering if these responses helped answer your question, Rick?
If not, what else might you need clarified tht we may be able to help with?
Paul
-
Ahh... comment spam - the bane of every successful website with an active blog. It's actually a signal of your success that your getting that much spam
I fully agree though - captcha is NEVER the answer if you want to maintain high visitor engagement. You shouldn't be offloading your spam problem onto your visitors to solve. There are better options.
So let's dive in.
How the spam gets generated There are two types of comment spam: bot-generated and manual. The first is created by software "bots" that have been programmed to crawl the web looking for the scripts on a website that allow content submission e.g. comment forms, contact forms etc. The software then accesses the script directly and submits its crapload. WordPress (and othe CMSs) are especially vulnerable because these scripts have the same names on every single install - the bot only has to look for a few very specific filenames in a few standard places.
Because this is two pieces of software talking directly to each other, hundreds, or even thousands of submissions per hour can be generated. The bots generally have no limits on them, so eventually they'll consume so many server resources they degrade or even completely consume the server's ability to do the rest of it's job. (This is considered to be at least 65% of all spam.)
With manual spam, an actual human in a very cheap labour market is paid to go through the posts on a website and manually enter the crapload, entering whatever info into the fields is necessary to make the comment system think it's a legit human-generated comment.
Filtering vs Blocking
The problem with Akismet is that it is a spam filtering tool, not a spam blocking tool. Each comment is allowed to enter the blog system where it is then sent to Akismet's server to be assessed. Akismet then sends it back to your site flagged to go into your spam, moderation, or publication queue. This means each spam message receives the same processing as legit comments, so the system is still using processing and database resources for every single message received. (Even spam gets written to the database and stays there until you decide it should be deleted.)
All very processing intensive, and hence why having Akismet doesn't do anything to reduce the server load of a spam run - and may even increase it slightly.
Optimize Akismet's Settings
So what to do? First, there's a simple checkbox in Akismet settings that can make a huge difference. You can tell Akismet that if it recognizes as spam a comment to a post that's more than a month old, it should just automatically discard it instead of adding it to the spam queue and writing it into the database. This greatly reduces the database activity created by the spam, and also helps keep your spam queue clearer so it's easier recognize legit comments that might have been caught from more recent posts. (Spammers tend to focus on older posts for a number of reasons - mostly becasue they're easier to find) The clear disadvantage is that the (very) few comments falsely identified as spam will be irretrievably gone. I know this could be an issue for you as many of your posts continue to get comments for months after, but if you're clearing 100s of comments an hour, chances are that some legit comments are accidentally getting deleted already.
To enable the automatically discard function, simply go to the Akismet Configuration page under your Plugins (where WordPress.com API Key is entered). At the bottom of the page, check-mark the box for Automatically discard spam comments on posts older than a month. Remember to click the Update options button when done.
Stronger Protection
If you need more protection, you're going to need to install a plugin that intercepts the comments before they get into the system and automatically discards the ones that show the characteristics of bot-submission behaviour. Essentially the plugin analyzes how the comment was posted, rather than its content.
The best-known of these is Bad Behaviour, but it's a pretty heavy-handed solution that has been known to even block GoogleBot and hence cause deindexing of many pages. I'd call it a last-ditch solution.
I'd suggest you try WP Captcha-Free which is a small, very lightweight plugin that invisibly creates a "hash" when the comment is created that must also be present when submitted. Most spam-bots fail at this since they're submitting directly to the comment script and so are blocked before the comment really starts processing. Note that commenters must have Javascript enabled using this system. Since only 2-5% of web users don't have JS enabled, this is a reasonable tradeoff (and much better than pissing of 100% of your commenters by enforcing a captcha)
You will still want Akismet active behind this protection to catch the manually-submitted spam.
So to recap - in order to reduce your server load from spam, you need a system that BLOCKS the spam before it starts to get processed in the first place. Just doing more/better FILTERING won't help as the filtering process actually uses up even more server power. You want your server only processing what is likely to be real comments.
Sorry for the loooong reply but comment spam is a big/complicated issue and if it's approached incorrectly, you can make your problem much worse instead of better.
Fire away with the questions
Paul
-
Your blog will have a 'hook' where scripts can automatically insert comments to your site. Check your server logs - you'll probably see one form or another getting hit, a lot, or a script like xmlrpc.
If it's a form, add CAPTCHA, and that'll stop the scripts from auto-submitting.
If it's something else, consider changing permissions so the whole world can't hit it.
-
You may want to check your GA to see if this started all at once. Your site might be under a DDoS attack, but your server is holding up. That sounds like an awful lot of blocking in one hour. Sometimes the ISP has to step in if you host the server inhouse.
-
Is your blog self hosted or hosted by Wordpress.com ? If it's the one in your profile, then it's hosted by Wordpress.com which would then limit your abilities but then probably shouldn't have server "stress" issues. If it's self hosted, then you can try some of these plugins and see if they help (http://wordpress.org/extend/plugins/search.php?q=spam) Depending upon the software being used to spam your blog, one of these for sure will be able to reduce the issues. So yes, it's automated tools that do all this comment spam.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to export Wordpress comments ONLY to a new domain
Hi Guys, We have a bit of situation here. We have a website (let's say it is www.oldsite.com) where we had more than 2000 posts. There arose a need whereby we had to move some 60-70 posts from this oldsite.com to another domain of ours (www.newsite.com). So, here is what we did: Move those 60-70 posts manually from oldsite.com to newsite.com Did a 301 redirect of each of those 60-70 posts from oldsite.com to newsite.com. Google has now started to rank the posts from the newsite.com for this. That's all good till now. Now, here comes the situation. We also want to move the comments from some of those posts of oldsite.com (some 10-12 posts out of those 60-70) to the respective posts of newsite.com. How do we do that? Do note that we are pretty comfortable with databases and to some extent PHP. Please help.
Web Design | | seocuppa0 -
Is it necessary to Remove 301 redirects from Wordpress after removing the 404 url from Google Webmaster?
There were many 404 urls in my site found by Google Webmaster. I've redirected these urls to the relevant urls with 301 redirect in wordpress. After that I removed these 404 urls from Google Index through Webmaster. "Should I cleanup these 301 redirects from Wordpress or not? ". Help Needed.
Web Design | | SangeetaC0 -
Question & Answers Forum pluggin for wordpress
I am looking to install on my website a Question & Answers Forum pluggin for wordpress Like MOZ Q&A What pluggin do you recommend for questions and answers Would be to replace the one on this site that i don't like due to some problems i have on configuration. http://www.propdental.es/dentista/
Web Design | | maestrosonrisas0 -
Optimizing a Wordpress Blog For SEO
Besides the standard Yoast for SEO plugin tweaks, I'm researching best practices when setting up a wordpress blog. In this instance, the "front end" of the site would be hardcoded static pages designed for an e-commerce site, while there would also be a /blog/ attached to the site - for blog posts, fresh content, etc.
Web Design | | GKLA0 -
.htaccess on wordpress multisites
Hi Moz fans, I have a site thats uses wordpress multisite. we have a number of sites set as subfolders i.e /en/ /de/ /us/ /fr/ ect. I need to force trailing slashes on each site as I currently have the pages being spidered as /about/ and /about and /iso-9001/ and /iso-9001. I have tried numerous things I have found online in the .htaccess and nothing is working. Can anyone HELP!!
Web Design | | imsmlouis0 -
Hey, So I know wordpress is built with a user capability, but what about databasing custom user data?
Hi everyone, so I am working on a project with a friend of mine, without getting into too much detail, here's the problem. We need users to sign in, then for instance, click a button, and then have that action record a value of 1 in a database on our server (preferably accessible through the wordpress admin interface, and in .csv format so that we can make it easy to work with) Any help with databasing, or if wordpress is already built with SOME database technology, etc, etc For instance, SEOmoz's "mozpoint" system.. Wordpress comes built with the ability for users to log in and have profiles with special access, etc, but does it have the ability to log points and values to a specific user as well? Where should I look? What should I google to figure out options? Who could I call / hire?
Web Design | | TylerAbernethy0 -
Random 302 Redirect (Wordpress CMS)
So this new project that i am working on is a redesigned CMS site using Wordpress (php based). Before i started on the project they made a few major updates that include: Changed from ID based URLS to SEO friendly URLS Added multiple languages in directories www.domain.com/en www.domain.com/fr www.domain.com/de etc... Due to the new languages they wanted to have the previous home page, www.domain.com, redirect to the proper language based on their IP address. Currently they are using a 301 redirect through a php header. So if i was visiting the site from my hometown of Cleveland, Ohio and i type www.domain.com into the browser. I would automatically redirect to www.domain.com/en and i would have the option to switch languages if needed. The issue: When i first added their site into SEOMoz the crawl returned a large amount of 302 redirects coming from their old homepage www.domain.com. So i took a look at the header calls using IE's webmaster tools and Firebug in Firefox. In both profilers it showed the same problem. Before the 301 header redirect there was a 302 redirect called first When viewing the response header it mentioned an x-pingback of some file that didn't even exist on the site: www.domain.com/xmlrpc.php This is obviously a huge issue because any link value from the old homepage will be lost due to the 302 not passing the value. I have tried search the almighty Google for help but it has gotten me no where. I have a hunch it is something to do with Wordpress but that is based on nothing but my gut. Any help is greatly appreciated. I got to get that 302 gone or changed to a 301 🙂 Regards - Kyle
Web Design | | kchandler0 -
Finally have a budget for a great seo ecommerce site but need help choosing wordpress, joomla, modx, magneto or? Thank you in advance for your generosity of time
We finally have a budget and want to dump our intuit/homestead site www.originalartbroker.com Our budget is 5k-10k but could do more if needed. I am slowly catching my competition with this homestead site that I built. But I do realize it is time to step back, figure out what is best, and hire a pro to get the job done. I am green in the seo and web development arena so please go easy on me and please help to point me in the right direction. Just went out on a limb a couple years ago playing with homestead.com site software and built what we have today. Didn't know anything about website development...it sort of just happened. I feel and know that homestead.com solution is hindering what we could be doing due to the bloated nature of the site and inability to perform such task as 301 redirects etc.... I have been able to slowly attain first page seo rankings on keywords based of the artists we carry using this po-dunk homestead platform to build my site after a ton of work education thanks to seomoz and a lot of you. But, have never asked for help and could really use some generosity of time in explaining a solution that would work best for our business. Do we just go with a wordpress site that is similar to our current setup and use their plugins? Do we use a cms software solution like magneto or joomla? We will only have 200-300 pieces at any given time. We are constantly selling and buying new pieces providing us content. We are need of a site that can perform well in terms of seo. I have heard of a lot of people talking about joomla, wordpress, and magneto. Would like to be able to have a product catalogue that ultimately sends whatever inventory we are uploading to our social sites and blogs so I don’t have to pump the product out to all of these sites. We offer free custom framing with our pieces and it would be nice to have a program that could wrap the photos of the pieces with the different frames for our customers. When I add a new piece I would like this software to have a predesigned product page that it plugs the information into. I would like it to create the url extension based of the artists name, medium used, and piece name to create unique and individual urls. I would like it to also create its own H tags throughout that product page according to the artist name description, and medium used. I would like to be able to sink this up to google merchant and other sites to carry our product. Bottom line is we sell art. We sell pieces by specific artists. We are constantly buying and selling. I need something powerful that keeps up with our content
Web Design | | forecastedinvestments1