Stripping Out Referral Spam From Past Reports
-
Hi,
I'm looking to confirm the best approach for retroactively stripping away referral spam (free buttons, SEMalt, etc.). Now to be clear, I already have filters in place to ignore them from current stats, so moving forward I'm fine. However, I'd love to go back and check untainted stats.
I've setup segments using a regex to strip the root words away and it seems to be working. I have a regex setup to strip out things like: social-buttons|seoanalyses|copyrightclaims|classifiedads|jobsense|free-share-buttons|e-buyeasy|acrobats.hol|cheap-online|amezon|search-help|qut-smoking and so forth.
I've been going through my referral data, noticing obvious spam, and adding their domains to my segment. Is this the optimal way for me to get a clear, untainted view of my past stats?
-
Sweet, glad to hear our filters will suffice. Thanks for the input, Daniel.
-
Hey, no worries and you're right that your filters should block them as well. Using .htaccess would be just an additional defense mechanism but may not be necessary.
-
Hi Daniel,
Thanks again for the response. What would be the difference in Analytics data between my filters and going straight to .htaccess? If the data is the same, is there an additional benefit to .htaccess?
For regular users, I'd suspect less bandwidth since they can't load my domain, but I don't think these bots actually load the page or visit.
-
I would use your .htaccess file to block them with the following code (this would for example block referrals from semalt.com and semalt.com subdomains):
RewriteEngine On
Options +FollowSymlinks
RewriteCond %{HTTP_REFERER} ^https?://([^.]+.)*semalt.com\ [NC,OR]
RewriteRule .* – [F]
You can also use .htaccess to block IP addresses associated with the spammy sources.
edit: just saw your edit but hope this helps nevertheless!
-
Hi Daniel,
Thanks for the additional tips. I do have the bot filtering feature enabled as another point of protection. I checked my referral exclusion list and apparently set this up about a year ago for the initial wave of referral bots I noticed. I didn't know it added them to direct.
The majority of my spam referral hosts have been added to regular filters. I think with the combination of my retroactive approach and new filters, I should have reliable data going forward.
-
Hi there,
You’re on the right track and the best way to retroactively remove spammy sources is through report filters and advanced segments.
A couple other notes:
- A good way to spot spammy referrers is to sort by bounce rate and eliminate any with 100% bounce and over 10 sessions.
- Avoid using the “referral exclusion list” since this will just count spam traffic as direct traffic instead.
- You should also enable the GA ‘bot filtering’ feature under ‘Reporting view settings’ as seen here
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved URL Crawl Reports providing drastic differences: Is there something wrong?
A bit at a loss here. I ran a URL crawl report at the end of January on a website( https://www.welchforbes.com/ ). There were no major critical issues at the time. No updates were made on the website (that I'm aware of), but after running another crawl on March 14, the report was short about 90 pages on the site and suddenly had a ton of 403 errors. I ran a crawl again on March 15 to check if there was perhaps a discrepancy, and the report crawled even fewer pages and had completely different results again. Is there a reason the results are differing from report to report? Is there something about the reports that I'm not understanding or is there a serious issue within the website that needs to be addressed? Jan. 28 results:
Reporting & Analytics | | OliviaKantyka
Screen Shot 2022-03-16 at 3.00.52 PM.png March 14 results:
Screen Shot 2022-03-15 at 10.31.22 AM.png March 15 results:
Screen Shot 2022-03-15 at 4.06.42 PM.png0 -
My domain as a sufix on GA reports
Hi, There's a friend of mine who asked me with some weird results on her sites' Google Analytics reports (http://esther-roche.es). I've searched the GA code and its GA account config and I've no clue about where this esther-roche.es is to remove it. Any idea? Thanks in advance, BzgPcAo.jpg
Reporting & Analytics | | Webicultors0 -
2 days in the past week Google has crawled 10x the average pages crawled per day. What does this mean?
For the past 3 months my site www.dlawlesshardware.com has had an average of about 400 pages crawled per day by google. We have just over 6,000 indexed pages. However, twice in the last week, Google crawled an enormous percentage of my site. After averaging 400 pages crawled for the last 3 months, the last 4 days of crawl stats say the following. 2/1 - 4,373 pages crawled 2/2 - 367 pages crawled 2/3 - 4,777 pages crawled 2/4 - 437 pages crawled What is the deal with these enormous spike in pages crawled per day? Of course, there are also corresponding spikes in kilobytes downloaded per day. Essentially, Google averages crawling about 6% of my site a day. But twice in the last week, Google decided to crawl just under 80% of my site. Has this happened to anyone else? Any ideas? I have literally no idea what this means and I haven't found anyone else with the same problem. Only people complaining about massive DROPS in pages crawled per day. Here is a screenshot from Webmaster Tools: http://imgur.com/kpnQ8EP The drop in time spent downloading a page corresponded exactly to an improvement in our CSS. So that probably doesn't need to be considered, although I'm up for any theories from anyone about anything.
Reporting & Analytics | | dellcos0 -
What does "on first page" mean in seomoz ranking reports?
Hi - When reports here show numbers of keywords appearing "on first page", there must be some implicit assumption made about the number of results listed per page. 1. Can anyone tell me what that assumption is? Is it 10? 20? 2. What about universal results Local links? If the answer to number one is, for instance, 20 results per page, then are there any assumptions made about the number of universal results Local links included? I'm just trying to understand what the reports mean. Thanks, Tim
Reporting & Analytics | | tcolling0 -
A lot of traffic to one page from Google referral
We recently received a lot of traffic to one page from
Reporting & Analytics | | underthesun808
google.com referral. When I look in analytics it reports that the traffic is
coming from /url that’s not real helpful. Is there a way to get more specific
information as to what the referring url was?0 -
When i first add my url to seomoz then i had a general report of all the faults my website had in SEO and suggestions where can i find it now , i cant find it ?!
when i first add my url to seomoz then i had a general report of all the faults my website had in SEO and suggestions where can i find it now , i cant find it ?!
Reporting & Analytics | | fireproductsuk0 -
Facebook referrals
Does anyone know how to find this out. When I use Google Analytics to monitor a website, I am receiving referrals from Facebook, but it does not tell me the source on Facebook. Only that it is coming from Facebook somewhere.
Reporting & Analytics | | esn0 -
How to measure number of visits from Google News coming from Google Universal Search (NOT referral coming directly coming from news.google.com) with google analyitcs
I'm running a news site, and I have a problem of accuratly measuring which traffic is REALLY coming from google news. I analyzed a lot of individual articles and I come to the conclusion, that the visits, that come from the google news section in the universal search results are counted as "normal" search engine traffic in google analytics. So if you do a Google search for a topic that includes links from Google news, you don't get an accurate referral count. As an example, if you do a search for "eBay", incorporated into the page 1 search results you may also see Google news results as well.
Reporting & Analytics | | Mulle
If someone clicks on that Google news link that appears in Google search, it shows up in Google analytics as a referral from Google search, when it was actually from a Google news referral. I was already checking google analytics and google news help forums and searched SEO blogs for this. But I wasn't able to find a working solution. Can anybody help me out with this problem? Thanks so much, Matthias0