PDF web traffic hitting our site
-
Hi there,
Over the last few months our traffic has spiked due to irrelevant pdf documents sending us crap traffic, our bounce rate is sky high as well as other metrics. I don't want to just filter out this traffic in GA rather try and stop our site from being attacked.
Any advice on a way forward would be great.
Thanks
-
Based on this I don't think you have anything to worry about. It doesn't appear to be an attack, as you described in your original post. An actual attack on your website would have much higher volume. The worst this could possibly be is spam, which is mainly just annoying.
Easy solution: you don't want to filter out this traffic from GA because it may be useful at some point. So just create another view in GA, and name it "unfiltered". This view will have no filters and you can see all traffic in its raw glory. In your main view, name it something like "master" or "the one view to view them all" or whatever you want and set filters to remove that traffic from view.
Personally it looks more to me like these are old pdfs that other websites are linking to, which is what your hosting provider has also said. Your best move here is actually to setup redirects to relevant pages to recapture some of those links that are probably ending in 404s and get some link equity to important pages.
-
HI Alick, seems to be coming from an external source, I've included a screen grab for you too.
I've also discussed this with our hosting provider who gave the following response:
Thanks for the info from Webmaster Tools. That screenshot that shows the HTTP response is just showing that a request to http://www.icmp.co.uk/lulu-the-lioness-a-heroines-story.pdf throws a 301 redirect over to https://www.icmp.ac.uk/lulu-the-lioness-a-heroines-story.pdf — this runs because of the standard HTTPS/primary domain redirect code in settings.php and unfortunately doesn’t tell us much here.
I pulled down the database again and ran a search for a few of these filenames, and those came up empty. Looks like these don’t touch Drupal at all. When we saw them in the database before, in the sessions table, that was likely just because that filter module was storing browser history in user session data for some reason.
I did a little research here, and I think that leaves a few potential causes:
Another site is linking to these files (even though they don’t exist), and this is where Google is picking up/indexing the URLs from. This should be checkable in Google Analytics if you look at Referrals to those files.
These were listed on the sitemap at some point (but not any longer: https://www.icmp.ac.uk/sitemap.xml).
These files existed at some point in the past, but have since been deleted.
There was a DNS misconfiguration at some point, and that domain name was pointing to a different server where these files did exist.
While these are a little annoying to see in Analytics, from what I’ve read, 404s don’t negatively impact the site from an SEO standpoint, and there’s no evidence that the site itself is compromised at all, so unless we see evidence otherwise, I wouldn’t worry about these.
-
Hi,
Pdf trafic from your own site or other sites?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Paid traffic or "Paid Search" is not showing in my Google Analytics
Hi, I have two campaigns running in Google Adwords or Google Ads now and I saw in Google Ads account that I had 5 clicks today (09/18/2018) but when I try to search for this clicks in my Google Analytics in ACQUISITION > All Traffic > Channels I don't find nothing about "Paid Search" or something like that. Bellow is a picture of my Google Analytics account to prove it. The accounts are linked and I can find the 2 campaigns in the Analytics. How can I interpret this picture? Where the paid traffic is showing? or not showing there? Thanks Leandro uvAtrsg
Reporting & Analytics | | lmoraes0 -
How do sites without access to a site's analytical data, determine a site's organic traffic?
I've recently used a organic traffic checker that showed you your traffic compared to each google algo update. I was interested in how they derived the organic traffic totals for each month, without having access to our site's google analytics? I've since compared the data to historical google analytics data and it's not wrong, isn't 100% match either but isn't far from fact. So if they're predicting or making a guess, it's rather spot on, site crawlers and SERPs snapshots only provide so much info, I'm just wondering where they get the rest from and how?
Reporting & Analytics | | Deacyde0 -
Referral Traffic from Google
Hello, I have a question about my company's new website. I've worked in SEO and studied Google Analytics results for a few years now but have never really come across something like this. I started in this position in January of this year and when I started breaking down the traffic sources in Google Analytics, I noticed most of the traffic was coming from Google.com as a referral source. I had never seen Google.com as a referral source before so I looked into options for what it could be. It was not a paid ad and our organic traffic was coming through in Analytics, Before I could get any further, our new website was launched (we switched CRM's to WordPress) and the referral traffic from google went from 2,966 in January of 2015 to 22 in February 2015. for more comparison, in February of 2014, the referral traffic from Google was 2,496. I expected a drop when we switched CRM's but we correctly re-directed all pages and created a new sitemap and our organic traffic is up since the switch (not enough to cover drop in referral). I thought at first this had to do with our Google sellers account being de-activated when we made the switch, but I quickly fixed this over a month ago and no change. I'm wondering if anyone has ever seen Google.com come through as a referral source in Google Analytics and if they we're able to figure out what it actually was. This would be a great help! Thank you, Alex
Reporting & Analytics | | RASEO1 -
Can you track two Google Analytics Accounts on one site?
If you have a site that had an old analytics account and then implemented a new one is it possible to run tracking code that records to both accounts without causing your site or data issues? We are doing this so we don't loose data at any point - ideally it wouldn't have been split between the two but making one redundant isn't an option. Ideally we would have merged the data from both accounts and had one - however the research we have done points to this not being a possibility - unless one of you guys knows different? It would be great if anyone has experience on any this.. Thanks
Reporting & Analytics | | ChrisAllbones0 -
Referral Traffic vs. Campaign Traffic in Google Analytics
I have two sites: a blog and an ecommerce site. The blog funnels people to the ecommerce site. In Analytics I'm seeing declines in referral traffic from the blog to the ecommerce site. During the same time I'm seeing an increase in campaign traffic to the ecommerce site, with most campaign traffic coming from the blog. I believe the increase in campaign traffic is largely a result of simply having installed more tracking links. This leads me to believe that the declines I'm seeing in referral traffic is simply a result of the increase in campaign traffic. In other words, what was once counted and reported as being referral traffic is now being counted and reported as campaign traffic. So my question is this: In Google Analytics is campaign traffic ALSO reported as referral traffic, or is campaign traffic reported separately and not duplicated in referral traffic reports? I'll provide a concrete example to make this more clear in case it isn't: Say site X sends 1000 visits each month to site Y. Say 50 of those visits come from a single link on X. If that link is changed so that campaign Z data info added (via the Google URL Builder), would you expect to then see 950 referral visits each month from site X to site Y plus 50 campaign visits to site Y via new campaign Z, or would you continue to see 1000 referral visits plus the new 50 campaign visits? Many thanks in advance to anyone that can shed some light on this.
Reporting & Analytics | | aaronprimal0 -
Amazon.com inc.increase in direct traffic
Hi All, I have seen a increase of direct traffic from hostname amazon.com inc. This only happened on one day. Any ideas what/why it is? Thanks
Reporting & Analytics | | Sayers0 -
Ecommerce tracking problem with multiple domains on 1 web property
We recently launched 13 product marketing sites based on a Wordpress Multisite setup. To track all 13 sites we decided to use one web property, so we can use the same tracking code for all domains. To track the sites seperately, we set up a profile for each product site with a hostname filter. This works well for most stats, but eCommerce tracking is where the problem starts. Per profile we obviously want to track the products purchases on that website only. What happens now is that all purchases from all product sites are tracked on all profiles. How can we change this? Should I alter the eCommerce tracking code or can I fix this with an additional filter per profile? Thanks for your help Mozzers! Regards, Pieter van Diggele
Reporting & Analytics | | RBO
Online Marketing Executive
Elsevier Opleidingen0 -
.com version and .org version of site
So i just discovered that a site I now managae has a .com version - as well as the .org version that is the one everyone knows about! I'm guessing this is not a good thing... So the whole site eg www.abc.org/example has a mirror page www.abc.com/example.... What should I do about this? Is it really bad to have 2 versions out there? Thanks!
Reporting & Analytics | | inhouseninja0