Help Blocking Crawlers. Huge Spike in "Direct Visits" with 96% Bounce Rate & Low Pages/Visit.
-
Hello,
I'm hoping one of you search geniuses can help me.
We have a successful client who started seeing a HUGE spike in direct visits as reported by Google Analytics. This traffic now represents approximately 70% of all website traffic. These "direct visits" have a bounce rate of 96%+ and only 1-2 pages/visit. This is skewing our analytics in a big way and rendering them pretty much useless. I suspect this is some sort of crawler activity but we have no access to the server log files to verify this or identify the culprit. The client's site is on a GoDaddy Managed WordPress hosting account.
The way I see it, there are a couple of possibilities.
1.) Our client's competitors are scraping the site on a regular basis to stay on top of site modifications, keyword emphasis, etc. It seems like whenever we make meaningful changes to the site, one of their competitors does a knock-off a few days later. Hmmm.2.) Our client's competitors have this crawler hitting the site thousands of times a day to raise bounce rates and decrease the average time on site, which could like have an negative impact on SEO. Correct me if I'm wrong but I don't believe Google is going to reward sites with 90% bounce rates, 1-2 pages/visit and an 18 second average time on site.
The bottom line is that we need to identify these bogus "direct visits" and find a way to block them. I've seen several WordPress plugins that claim to help with this but I certainly don't want to block valid crawlers, especially Google, from accessing the site.
If someone out there could please weigh in on this and help us resolve the issue, I'd really appreciate it. Heck, I'll even name my third-born after you.
Thanks for your help.
Eric
-
Hi SirMax,
Thanks for your input. I appreciate it. We'll add Wordfence to our WordPress toolbox and see if that addresses the issue.
In response to previous posts, thanks to everyone for your input. We were able to apply some filters to remove the bogus bot traffic from the analytics and normalize the data, however, this did not actually resolve the issue and in my eyes is more of a BandAid fix. The evil crawlers are still there, we just can't see them.
Thanks again for all of your input.
Eric
-
Hostname filtering does not work any more. Unfortunately most of the spammers have adapted and are using your website as hostname.
For the WordPress I use Wordfence plugin( using paid version - not affiliated with them in any shape or form beyond paying for their services). In the advance blocking you can set limits on how fast and how many pages crawlers can request. You can also block by country or ip range. It can also show you live traffic with a lot of details ( a lot more then google analytic - more like server log ). It might not be the complete remedy but it can help.
-
I wish I had an answer for how to stop the bots from hitting your site at all - I don't think a good one exists, as any solutions that wouldn't also block real human traffic to your site are going to be easy for spam bots to get around. I think your best bet is just to do everything you can to keep your data as clean as possible.
-
Hi Ruth,
Thanks a bunch for taking the time to respond to my post. Great advice. This is reassuring on a number of levels, however, it doesn't address the underlying issue of how to stop these spam bots in the first place.
We've already started the process of filtering out some of this bogus data. We'll also be integrating some WordPress plugins to see if that helps. That said, if the spam bots are hitting Analytics directly, as opposed to the actual website, WP plugins won't do anything.
Anyway, I appreciate your input and advice. Thanks so much.
Eric
-
Hi Eric,
A few things to reassure you off the bat:
- For what it's worth, there is a huge, HUGE amount of crawler spam happening in the web today. Every site I work on is being hit hard with false referrals and direct visits. I know Google Analytics is working on a solution to better filter these visits out. So I wouldn't be too concerned that it is something a competitor is doing to your site, specifically - it's more likely that it's been caught up in the general wave of spam crawlers.
- It's important to note that when we talk about Google looking at bounce rate and dwell time as part of ranking your site, those numbers are specifically from clicks through from search - that's data that Google can get without using your private web analytics data as a ranking factor, which they've said repeatedly that they don't and won't do. So a bunch of direct visits with high bounce rates will NOT affect your rankings.
So, it's not dangerous, just annoying. On to how to get that data out of your reports:
- Make sure you're not filtering out spam referrers at a View level - this can cause those visits to incorrectly appear as direct traffic.
- You could set up an Advanced Segment in Google Analytics to filter out direct visits with visit times of, say, under 5 seconds. Some real traffic may get caught in that, but it will get the noise levels down.
- The best way to filter out spam bot traffic, in my opinion, is to set up hostname filtering. Here's a post on Megalytic on how to do that: https://megalytic.com/blog/how-to-filter-out-fake-referrals-and-other-google-analytics-spam. Make sure you've also got an "Unfiltered Data" View so you'll still have historic raw data if you need it.
Hope that helps! Good luck.
-
Check webserver log files, or log visits (ip address, user agent, __utma, __utmz, possibly browser fingerprint, etc...)
Analyzing those you can easily find out if the traffic is from scraping bot or humans.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Where to find "Keywords that Sent Organic Search Visits" in Google Analytics?
Hi everyone, Tried searching for this answer but couldn't find it. In the new analytics, I can see that there is a graph that shows the number of unique keywords that brought traffic to our site sorted by weeks. I can't seem to find this data in Google Analytics. I can only find the number of visits but not sorted by how many keywords, instead it is just # of visits from all of the keywords. Thanks in advance. SZ
Reporting & Analytics | | swzhai0 -
Showing significant visit in google analytics
Hello Everyone, I was checking GA account of my site and I have found there are few keywords have significant number of visits in GA. These keywords not ranking even in hundred(google SERPs).I am not understanding from where these visitors coming .Please help me out. I am attaching screenshot of those keyword.Last but not least when i check source of these keywords source are google. Thanks aScZ8WT.png
Reporting & Analytics | | Alick3000 -
Google Webmaster Preferred Domain Settings Help
I was recently running my SEO Moz report for my domain and saw a drastic drop in page rankings on the report. After some trouble shooting, I checked my Google Webmaster Tools and saw the "Preferred Domain for my Site has changed" the last week in which I saw the drastic drop. Under the "preferred domain" area, I have it set to www.marware.com but on most of the links to my site, it reads www.marware.com/ . We also have http://marware.com , but am not sure if this carries any relevance as the preferred domain is set for www.marware.com . Does the "/" make a difference in the "Preferred Domain" section? If so, how do I update it? Is there a way to confirm it is set up correctly? In the photo attached, what does "Linked Property" "Analytics Web Property" and "Associated Webmaster Tools Site" mean? Thanks in advance! T5tSCxm
Reporting & Analytics | | Khadaran0 -
Does Audience Overview in goole analytics contain visits from adwords.
Hi everyone I set up a new adwords account for a company and according to adwords we received 700 hits to the site but none showed up in analytics and no enquiries were made. Now the two accounts (adwords and analytics) weren't connected at the time but I should have been able to see the increased visits in analytics anyway as it was a very new site and had no real traffic yet. Someone at Google repeatedly told me that "Audience Overview" (unique visitors) doesn’t include adwords visits only organic traffic. Is this right? Maybe I'm wrong but I always thought it included everything well an estimation anyway. If anyone could help that would be great as it seems a little fishy. Many Thanks,
Reporting & Analytics | | digital.moretogether.com0 -
Figuring Out the Source of "direct traffic" by looking at landing page parameters
I have a client who runs an e-commerce website, and I noticed that 40% of his traffic and 25% of his sales are all attributable to Direct Traffic. At first, I tried to solve this problem by tagging all of the previously untagged links in his e-newsletter, which I expect to be very helpful. However, then I looked at the landing pages for his direct traffic, and I see that it is almost entirely filled with thousands of unique URLs that begin with a question mark followed by the name of his e-newsletter or shopping cart vendor. It would be the equivalent of having a url like the following: "www.willmarlow.com/?constantcontact=keya;sldkfjsdlfkjdf;sldkjf" If we have this amount of information in the link, shouldn't there be a way to add additional parameters to the URL to move this traffic out of the Direct column? Has anyone encountered this before? Thanks.
Reporting & Analytics | | williammarlow0 -
Finding and removing "Bad" Back Links
In the process of trying to figure out where all of the “Bad” backlinks are coming from I used the SEOmoz Site Explorer. I can see the links that may be questionable but am not sure how to determine if these are the issue causing the loss of rank or could it be something else. On Google webmasters they list Siteloki.com as the one with the most links. The count is now at 13,005. (see attached WMT report)
Reporting & Analytics | | rdominey
I first noticed this a month ago, 6,742 links and have tried contacting them with no reply, no results, I have even posted on the site asking to be removed from their listing and not response. Website: www.getyourphotosoncanvas.com I do not understand why this site is not showing up in the Site Explorer link analysis report (See attached)? Could this be some sort of hack or hidden links that Site Explorer does not see? How do I determine if this is real or not, if it is the reason that Google is demoting us? Google says that we are not being manually penalized? 5zAQq Iz9ct0 -
Stats show /blog/wp-cron.php at the top. What is it?
Hi, I have worked with websites for years but have no clue when it comes to Wordpress. We have our main website and then a Wordpress blog running in a subfolder that is only about a year old. The blog has only 7 posts so you can see how small it is vs main website with 200 pages. Usually our main index page of the site is at the top of the stats with the most views and this page /blog/wp-cron.php is about 30% lower. Now suddenly over the last month this page has jumped to the top and accessed almost as much as the home page of the site. We took a big hit with the latest Google Update so we are tyring to determine if there is anything technical in our site that has caused an issue. Thanks in advance Force7
Reporting & Analytics | | Force70 -
How much direct traffic is really direct?
Does anyone else think that a large chunk of traffic labelled as "Direct" in your analytics isn't direct at all. When you analyse traffic trends it seems that a large percentage could just be browsers with their referring URL hidden so it only appears direct. Here's the evidence: When we've been affected by major search algorithm changes, we've seen big changes in direct traffic as well as organic, but not in referral traffic. If direct traffic is just bookmarks, typed-in URLs, and people clicking through from emails why is direct traffic 85% new visitors? We don't do any offline advertising, so you'd expect genuine direct traffic to be returning visitors -- either our brand loyalists or subscribers to our email newsletters. If you segment direct traffic into new and returning visitors and look at a major algo update as discussed in 1), you find all the drop in direct traffic is from New Direct visitors, with no drop at all in Returning Direct visitors. Can anyone explain who these New, Direct visitors are if not simply mislabelled new, search visitors. Cookie deletion can't be the problem (ie: they can't be Returning, Direct really) because the traffic doesn't behave like returning, direct (that is, it varies too much). I'd be really interest to hear theories, and whether anyone has any figures on the extent of HTTP referrer blocking.
Reporting & Analytics | | Dennis-529610