Filters for referral bots ineffective
-
I'm not sure what I'm doing wrong here. I've been trying to block spam from my client's analytics for over a month and I just can't seem to get it right. I am beyond frustrated.
Here is a list of the filters I have:
And here is referral traffic from them regardless:
Specifically this is what the filter looks like. Am I doing something wrong here?
I have also put a filter in place to block traffic from Russia, and it doesn't seem to be working 100% either.
On top of that, it seems anytime I do manage to block something, another site just pops up in its place. Is it endless?
My clients are very small local business and this is absolutely ruining their analytics. I can't even show them their traffic reports since it's absolutely meaningless with all this spam. Any help would be greatly appreciated.
-
Thank you for the quick response. Made the changes you suggested and hopefully over the next few days I'll see the effect.
-
The filter verification takes a small sample of data and sometimes in that sample there won't be any change.
Another reason for what you may get this error is because the filter was saved before the verification, so you will have to delete the filter and create it again for the verification to work. Better yet you can use the filter expression in a segment it will give you a better idea of how the filter will work since the segment is not limited to a few days.
If you try to exclude by the offender don't use referral as a filter field use Campaign Source.
For the filter based on valid hostnames, there is no need to put the domain with and without "www" clientsite.com will match both.
Hope that helps,
-
I've tried the method you suggested for one of my client's sites. I created a new filter, did "Include" -> "Hostname" and added the following:
translate.googleusercontent.com|www.clientsite.com|clientsite.com
It said that it wouldn't change any of the data when I went to verify the filter, which should not be the case. Most of the traffic is from hostnames that are not set. I tried this input as well after reading a different article on the subject:
translate.googleusercontent.com|www.clientsite.com|clientsite.com
Once again to no effect.
So I tried to tackle the referral spam itself (although for this particular site, direct traffic seems to be getting a fair amount of spam as well). I created two filters ( http://imgur.com/ShhAYqv ) that included every spam referral I've ever gotten:
adviceforum.com|social-buttons.com|semalt/.com|free-share-buttons.com|buttons-for-website.com|best-seo.com|darodar.com|semalt.com
dailyrank.net|sitevaluation.org|trafficmonetize.org|4webmasters.org|100dollars-seo.com|webmonetizer.com|event-tracking.com|.*(social-buttons.com)
And, as you can see in the picture, it says there would be no effect.
Aghhhhhh! What am I doing wrong here?
-
I have recently followed Carlos' blog posts and have to say the steps worked a treat. Hopefully the Google Analytics team are working on a way to transfer filters across different analytics accounts to make it easier to keep all sites referral traffic free.
-
Hi Kat,
If you want to exclude the spammer you should use Campaign Source as a field type, instead of predefined filter use:
- Filter type Custom > Exclude
- Filter Field > Campaign Source
- File Pattern Box enter the spammer you want to exclude. You don't need to create a different filter for each one; you can create an expression like this:
4webmasters.com|free-share-buttons.com|trafficmonetize.org|morespammers
The backslashes before the dots "." and the dashes "-" are needed because custom filters use Regular Expressions. You can keep adding up to 255 characters after that you have to create another filter
If you want to try a better solution, I recommend you to try a filter based on your hostnames.
This solution requires a little more time to set up, but it has three huge advantages over the Campaign Source solution- You will stop the spam before it hits you, adding a filter for the referral after you see it will stop it, but by the time you apply it you will have already hits of the spam.
- You will need only ONE filter to stop all ghost spam, instead of creating various sets of filters.
- Lately, some of the spammers(e.g. free-social-buttons) have been hitting GA accounts with fake direct visits along with the referral, the filter for the referral won’t stop the direct visit, on the other hand. The Valid hostname filter will stop ghost spam in any form whether it shows as a referral, keyword or direct visit.
This is what I've been using on my accounts for, the last moths and I haven't received a single hit of ghost spam the vast majority.
Here are a detailed guide and more information about all these solutions and spam in Google Analytics.
http://www.ohow.co/what-is-referrer-spam-how-stop-it-guide/
Hope it helps,
-
I've tried setting up filters in the manner you explain on your site, and yet when I go to verify it, it says that it wouldn't affect my data. I most certainly have spam data from the last seven days so I'm uncertain why it's not working.
I entered in names just as you did in your article, excluding from a campaign source:
dailyrank|sitevaluation|trafficmonetize|4webmasters|100dollars-seo|webmonetizer|event-tracking|free-social-buttons
-
You need to set the exclusion for campaign source, and not the ISP domain.
I highly recommend setting an include only filter for your own hostname.
https://www.marceldigital.com/blog/2015/06/stop-ghost-spam-in-google-analytics
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Any idea why this page is an absolute magnet for bots?
This page on our client's website seems to be an absolute magnet for bots, and it's skewing our Google Analytics stats: https://cbisonline.com/us/catholic-socially-responsible-esg-investing/proxy-voting/ We already filter out lots of bots in GA, primarily through a segment we created several years ago and continue to build upon, but plenty of spam traffic still manages to slip through – mostly to the page above. Last quarter, almost all of it came from two random cities in Europe, so we're going to filter out traffic from those places. (At least for now – not an ideal solution, I know.) But I'm really wondering what drives so many bots to that page in particular. Any insights would be greatly appreciated!
Reporting & Analytics | | matt-145670 -
How to stop Spam Referral Traffic?
We get referral traffic from Spammers to our Wordpress sites. That traffic comes from different countries: Russia, Ukraine, India, Germany, Pakistan etc. What's the best way to get rid of it? Setting up filters in Google Analytics? Is there something else that I need to do? Is there a plug-in that could help? Does that traffic have a negative impact on my SEO? Does it affect the rankings?
Reporting & Analytics | | Armen-SEO2 -
Some goal conversion in Google analytics showing under referral
Yesterday I have created Google analytics account for a new website but few goal conversions comes from payment gateway site (paypal.com, epdq.co.uk) and showing under referral. How to fix this issue so I can know the real source of Goal conversion. *Note - utm_nooverride=1 on thank you page applied, payment gateway URL is already placed in Referral Exclusion List. So please don't suggest either of them. Thanks
Reporting & Analytics | | Alick3000 -
Is real google bot like "fetch"or more like "fetch & render"?
In GWT we have two options to mimic googlebot visits, "fetch" and "fetch and render", but when the real googlebot visit a page, is he behaving like the former or the latter? I can see fetch does fetch only the html, while fetch and render does fetch .js and .css as well. But what does the real googlebot does? I have checked the web server logs, and I can see the real googlebot sometimes request the .js files too, but not every time it visit a page, sometimes it does, sometimes it does not. Has anyone figured out when googlebot actually request javascript files?
Reporting & Analytics | | max.favilli0 -
Google Ad referral
I was wondering if someone could decode the jumble of a referral - this is supposedly the referal that led to a click through to my site via a product listing ad. I am trying to figure out how www.nextag.com comes in to the picture as we do not have refurbexperts even listed there? Thanks to anyone who tries/does work it out. http://www.googleadservices.com/pagead/aclk?sa=L&ai=CGXud6DmDU_qeL5THygHpuICwCaTZwMYD_Nvvv0bEwMS50wEIBhAEIOn5-gEoBVCl7P7f-v____8BYMnu8omYpPQSoAHAhIv9A8gBB8gDG6oEJ0_QwcNc5zNun_d7S5KNcMT6uPjjH_mMDkKFFgBCQ6aKICRPJVVa7MAFBYgGAaAGJoAHqPv0ApAHAeASupqdo-ypit0m&ohost=www.google.com&cid=5GhZEzUCSC6x9n2wxOdz3-mrAfSUkvHKPN3wD5yLInnlNil_&sig=AOD64_1D1z1JPYbFP0UnUglJVOfvd25RfA&adurl=http://refurbexperts.com/product/527/HP-LaserJet-P2015-Laser-Printer-RECONDITIONED%3Futm_source%3Dproductlistingads%26utm_medium%3Dadwords%26utm_campaign%3Dadwords&ctype=5&nb=0&res_url=http%3A%2F%2Fwww.nextag.com%2Fhp-p2015-laserjet%2Fproducts-html%3Fnxtg%3D116d0a1c0504-9FFEB16DE52A7E2A&rurl=http%3A%2F%2Fwww.nextag.com%2Fgoto.jsp%3Fp%3D3652%26search%3Dhp%2520p2015%2520laserjet%26t%3Dag%253D1384181795%26crid%3D48271786%26gg_aid%3D20169721025%26gg_site%3D%26gclid%3DCjgKEAjwzIucBRDzjIz9qMOB3TASJABBIwL1LHK7GcAPS6yHGpd9Kq3wsZrcPORAWD8QCWivr4W75PD_BwE&nm=11&nx=43&ny=12&is=700x181&clkt=187
Reporting & Analytics | | henya0 -
Subdomain referral
Hi, We have two sites mysite.com and aa.mysite.com. We want to track these sites separately. We created two different properties with IDs UA-XXX-1 for main domain UA-XXX-2 for subdomain. gaq code for main domain is as following, gaq code for subdomain is as following, It works well for subdomain. But we want to see traffic from subdomain to main domain as referral.
Reporting & Analytics | | OkanKortan
What is wrong with the gaq code of main domain?
Do i need to change code or is it enough to inactivate subdomain tracking option in profile settings of main domain? Thanks in advance,0 -
How to filter pages in Analytics by multiple criteria
Hello, we have several pages with the same page title. Now out of all those pages I want to pick two. Let's call them "/page1" and "/page2". For those pages I want the following information (combined for both): Avg. time on page, Bounce rate, Navigation Summary Normally I get all the information under "Content" "Pages" and by choosing the "page title" as primary dimension and clicking on the respective page title. Let's call it "page | title". Choosing the filter for 1 page works fine (I just enter "/page1" in simple filter). But how can I filter for two pages ( entering " include page ends with /page1 and include page ends with /page2" in the advance filter will show 0 results). Thanks in advance
Reporting & Analytics | | guitarslinger0 -
Internal site referrers
Hi, So I have a segment of my website-let’s call it /examplea, I am trying to figure out how many visits I have to /examplea from all other areas of my website i.e. /exampleb, /examplec etc to /examplea so almost internal site refers to a particular segment of my website, Any thoughts on how to do this within Google analytics ? Marc
Reporting & Analytics | | NRMA0