Stripping Out Referral Spam From Past Reports
-
Hi,
I'm looking to confirm the best approach for retroactively stripping away referral spam (free buttons, SEMalt, etc.). Now to be clear, I already have filters in place to ignore them from current stats, so moving forward I'm fine. However, I'd love to go back and check untainted stats.
I've setup segments using a regex to strip the root words away and it seems to be working. I have a regex setup to strip out things like: social-buttons|seoanalyses|copyrightclaims|classifiedads|jobsense|free-share-buttons|e-buyeasy|acrobats.hol|cheap-online|amezon|search-help|qut-smoking and so forth.
I've been going through my referral data, noticing obvious spam, and adding their domains to my segment. Is this the optimal way for me to get a clear, untainted view of my past stats?
-
Sweet, glad to hear our filters will suffice. Thanks for the input, Daniel.
-
Hey, no worries and you're right that your filters should block them as well. Using .htaccess would be just an additional defense mechanism but may not be necessary.
-
Hi Daniel,
Thanks again for the response. What would be the difference in Analytics data between my filters and going straight to .htaccess? If the data is the same, is there an additional benefit to .htaccess?
For regular users, I'd suspect less bandwidth since they can't load my domain, but I don't think these bots actually load the page or visit.
-
I would use your .htaccess file to block them with the following code (this would for example block referrals from semalt.com and semalt.com subdomains):
RewriteEngine On
Options +FollowSymlinks
RewriteCond %{HTTP_REFERER} ^https?://([^.]+.)*semalt.com\ [NC,OR]
RewriteRule .* – [F]
You can also use .htaccess to block IP addresses associated with the spammy sources.
edit: just saw your edit but hope this helps nevertheless!
-
Hi Daniel,
Thanks for the additional tips. I do have the bot filtering feature enabled as another point of protection. I checked my referral exclusion list and apparently set this up about a year ago for the initial wave of referral bots I noticed. I didn't know it added them to direct.
The majority of my spam referral hosts have been added to regular filters. I think with the combination of my retroactive approach and new filters, I should have reliable data going forward.
-
Hi there,
You’re on the right track and the best way to retroactively remove spammy sources is through report filters and advanced segments.
A couple other notes:
- A good way to spot spammy referrers is to sort by bounce rate and eliminate any with 100% bounce and over 10 sessions.
- Avoid using the “referral exclusion list” since this will just count spam traffic as direct traffic instead.
- You should also enable the GA ‘bot filtering’ feature under ‘Reporting view settings’ as seen here
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Spam Direct Traffic
Hello, Lately, I have been receiving a big amount of unexpected direct traffic from Boston. After analyzing with Analytivs, this is what I get (please, check attachment). Normally I would be blocking this traffic source straight away from my Google Analytics account, and also blocking this traffic from accesing my servers, but check out the analytic metrics: this traffic represents 12% of my total traffic right now!!! av. session duration is 4:53 !! bounce rate is 72% !!!! pages/session 1.44 !! Service provider is "Microsoft Corporation" who looks like one of the typical spammy service providers. My question is, is this a bot?? what do you think ? Thanks, Luis zUlVHIi
Reporting & Analytics | | Yeeply.com1 -
Direct / (none) Spam Traffic Help
In July 2015, we experienced an over 1,000% increase in traffic and it has remained like that ever since. It's all spam traffic and I have no clue how to get rid of it. I added in your typical .htaccess blocks from known culprits with little to no effect. Read up on Ghost traffic and applied filters to no effect. The spam is completely distributed as far as I can tell both geographically as well as by network providers. Where once we had pretty decent bounce rates of around 50%, now, since all my Analytics data is meaningless - it's around 90%. I could apply a filter but beyond my GA account providing no insights, I'm also concerned about the increased use of server resources. I'd ideally like to stop the traffic completely. The only distinguishing feature of the traffic that I have been able to determine is browser size. Comparing June 2015 to July 2015 we saw the following: Browser size visits: 620 x 460 = 6,828 vs 0, 610 x 450 = 175 vs 0, 1330 x 630 = 71 vs 1, 1890 x 940 = 67 vs 0, 780 x 580 = 58 v 5. Other than that, I can find no unifying theme to the traffic beyond being traffic hitting our homepage and having no medium. Nothing special that I am aware of happened in July. We didn't do any sort of...really anything. We did have our network compromised by ransomware in the beginning of June, which we promptly ignored and restored backups - at no point did we try to contact the criminals, but I am doubtful there is any connection considering that our website is remotely hosted. If anyone has any suggestions or has seen anything like this before, please let me know. spam-traffic.jpg
Reporting & Analytics | | Nivik230 -
Google Analytics goals by source report?
Hello everybody. Is there way in Google analytics to create report on what goals have been completed per each source? Example: Lets say I have 3 goals: Subscription, Purchase, Quote. How can I get report, saying something like this: google / organic - Subscription - 5 conversions
Reporting & Analytics | | DmitriiK
Purchase - 3 conversions
Quote - 10 conversions and so on. P.S. Basically, I want the reverse of standard Google Analytics goal completions report, where you can click on goal and see which sources/mediums completions came from. I'd like to do the opposite - "click" on source/medium and see which goals have been completed. Thanks0 -
Google Analytics Landing Page Report Discrepancy
I have noticed that when I run a landing page report and use the advanced option so I can view only the landing pages that include a particular string in the URL, have noticed that I in the report, the graph at the top will say one thing, but the data below says something else. For example, the graph for one particular search shows 200 Impressions, but the info below says 700 impressions and 610 clicks. Anyone seen anything similar or have any ideas why? Thanks! Craig
Reporting & Analytics | | TheCraig0 -
Google Ad referral
I was wondering if someone could decode the jumble of a referral - this is supposedly the referal that led to a click through to my site via a product listing ad. I am trying to figure out how www.nextag.com comes in to the picture as we do not have refurbexperts even listed there? Thanks to anyone who tries/does work it out. http://www.googleadservices.com/pagead/aclk?sa=L&ai=CGXud6DmDU_qeL5THygHpuICwCaTZwMYD_Nvvv0bEwMS50wEIBhAEIOn5-gEoBVCl7P7f-v____8BYMnu8omYpPQSoAHAhIv9A8gBB8gDG6oEJ0_QwcNc5zNun_d7S5KNcMT6uPjjH_mMDkKFFgBCQ6aKICRPJVVa7MAFBYgGAaAGJoAHqPv0ApAHAeASupqdo-ypit0m&ohost=www.google.com&cid=5GhZEzUCSC6x9n2wxOdz3-mrAfSUkvHKPN3wD5yLInnlNil_&sig=AOD64_1D1z1JPYbFP0UnUglJVOfvd25RfA&adurl=http://refurbexperts.com/product/527/HP-LaserJet-P2015-Laser-Printer-RECONDITIONED%3Futm_source%3Dproductlistingads%26utm_medium%3Dadwords%26utm_campaign%3Dadwords&ctype=5&nb=0&res_url=http%3A%2F%2Fwww.nextag.com%2Fhp-p2015-laserjet%2Fproducts-html%3Fnxtg%3D116d0a1c0504-9FFEB16DE52A7E2A&rurl=http%3A%2F%2Fwww.nextag.com%2Fgoto.jsp%3Fp%3D3652%26search%3Dhp%2520p2015%2520laserjet%26t%3Dag%253D1384181795%26crid%3D48271786%26gg_aid%3D20169721025%26gg_site%3D%26gclid%3DCjgKEAjwzIucBRDzjIz9qMOB3TASJABBIwL1LHK7GcAPS6yHGpd9Kq3wsZrcPORAWD8QCWivr4W75PD_BwE&nm=11&nx=43&ny=12&is=700x181&clkt=187
Reporting & Analytics | | henya0 -
Mysterious Referral Link
This keeps coming up in some of our top Referrals on our website in Analytics. Does anyone know what it is? We have tried researching it and have had not luck. http://vizedhtmlcontent.next.ecollege.com/
Reporting & Analytics | | TracSoft0 -
A lot of traffic to one page from Google referral
We recently received a lot of traffic to one page from
Reporting & Analytics | | underthesun808
google.com referral. When I look in analytics it reports that the traffic is
coming from /url that’s not real helpful. Is there a way to get more specific
information as to what the referring url was?0 -
Best practice SEO/SEM/Analaytics/Social reports
Hi All, does anyone have a best practice excel spreadsheet of a internal report we should be using.... ie what are the main factors we should be tracking? Unqiue views? time spent on site? Where they came from? seo/sem/network/direct to site? social media tracking? amount of +1/fb likes/tweets etc thanks
Reporting & Analytics | | Tradingpost0