PDF web traffic hitting our site
-
Hi there,
Over the last few months our traffic has spiked due to irrelevant pdf documents sending us crap traffic, our bounce rate is sky high as well as other metrics. I don't want to just filter out this traffic in GA rather try and stop our site from being attacked.
Any advice on a way forward would be great.
Thanks
-
Based on this I don't think you have anything to worry about. It doesn't appear to be an attack, as you described in your original post. An actual attack on your website would have much higher volume. The worst this could possibly be is spam, which is mainly just annoying.
Easy solution: you don't want to filter out this traffic from GA because it may be useful at some point. So just create another view in GA, and name it "unfiltered". This view will have no filters and you can see all traffic in its raw glory. In your main view, name it something like "master" or "the one view to view them all" or whatever you want and set filters to remove that traffic from view.
Personally it looks more to me like these are old pdfs that other websites are linking to, which is what your hosting provider has also said. Your best move here is actually to setup redirects to relevant pages to recapture some of those links that are probably ending in 404s and get some link equity to important pages.
-
HI Alick, seems to be coming from an external source, I've included a screen grab for you too.
I've also discussed this with our hosting provider who gave the following response:
Thanks for the info from Webmaster Tools. That screenshot that shows the HTTP response is just showing that a request to http://www.icmp.co.uk/lulu-the-lioness-a-heroines-story.pdf throws a 301 redirect over to https://www.icmp.ac.uk/lulu-the-lioness-a-heroines-story.pdf — this runs because of the standard HTTPS/primary domain redirect code in settings.php and unfortunately doesn’t tell us much here.
I pulled down the database again and ran a search for a few of these filenames, and those came up empty. Looks like these don’t touch Drupal at all. When we saw them in the database before, in the sessions table, that was likely just because that filter module was storing browser history in user session data for some reason.
I did a little research here, and I think that leaves a few potential causes:
Another site is linking to these files (even though they don’t exist), and this is where Google is picking up/indexing the URLs from. This should be checkable in Google Analytics if you look at Referrals to those files.
These were listed on the sitemap at some point (but not any longer: https://www.icmp.ac.uk/sitemap.xml).
These files existed at some point in the past, but have since been deleted.
There was a DNS misconfiguration at some point, and that domain name was pointing to a different server where these files did exist.
While these are a little annoying to see in Analytics, from what I’ve read, 404s don’t negatively impact the site from an SEO standpoint, and there’s no evidence that the site itself is compromised at all, so unless we see evidence otherwise, I wouldn’t worry about these.
-
Hi,
Pdf trafic from your own site or other sites?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Self-Reffering Traffic After upgrading to Universal Analytics
Backstory: We have always had an issue with self-referring traffic but in waiting for the UA upgrade we put it on the backburner for getting fixed. We have now upgraded to UA and I was under the impression that GA would automatically exclude the domain associated with a property as a referral source. However this is not what I am seeing under my referral traffic source. With 10 websites having issues with this I need some help. Should I use the referral exclusion list? Also on a handful of our sites we have region specific URLs, I am also seeing these come in as self-referring traffic. I should also mention that about 85% of our sales are being attributed to the self-referring traffic. Here are two sites for example sake: ZootSports.com and K2snowboarding.com
Reporting & Analytics | | K2_Sports0 -
Google Analytics: How to Track Blog Traffic that Enter the Purchase Funnel?
I've been trying to figure this out for awhile, but I have had no luck. The current ecommerce store that I work for is trying to find out how to track how many people coming in via the blog are converting/buying. The site lives on Magento and the blog is on wordpress and they both use the same Google Analytics code. Site URL: http://website.com/ Blog URL: http://website.com/blog Is there anyway to do this so you can see which landing pages are driving conversions? If not, Is it possible to set up Google Analytics to show conversions and revenue coming from people who enter through blog directory?
Reporting & Analytics | | Erik-M0 -
Had suspicious spike in Adsense clicks, next day site ranking tanks
Yesterday, one of my sites had extreme Adsense clicks for several hours in the morning, which brought it up to CTRs of around 120%. My normal CTR is about 10-15%. It added several hundred dollars income over and above my normal amount. After that, it went back to normal. I have waited to see if Google would adjust the income down, as someone or some bot seemingly clicked the heck out of the site's ads. Nothing has been adjusted; it's been 24 hours. Question #1: what usually causes this type of insane clicking to occur (i.e. competitors messing me?) Then, today I noticed something else disturbing. I cannot find my site in the top 100 SERPs for the main keyword. I was at #1 for a couple years, then, when I changed themes from Thesis to Genesis (site otherwise exactly the same) a couple months ago, I bounced around various positions on the first page. In the last couple weeks we've been bouncing between the teens and the thirties. Two days ago we were at #15. (the site is still indexed when I use "site:" to check. It seems awfully coincidental that yesterday I had the Adsense click explosion, and today I'm not even in the top 100 for the first time in my pretty stable two-year history, and have no idea how far behind 100 I am. I went to Google Webmaster Tools and see no errors or warnings relating to this. Adsense has not sent me any messages. So... Question #2: does Google search apply some sort of penalty to site that have suspicious Adsense clicking? By the way, I don't have any funny business going on with any bad SEO practices, it's all above board, and I have thousands of real readers each day Liking and commenting on the pages. It's a very real site. Note: I have been checking the ranking each day via a Google Incognito window and searching for the term. Of course I use MOZ but I do the Incognito search for a quick real time check, which I've found to be accurate.
Reporting & Analytics | | bizzer0 -
Loss of referral traffic after change of CMS, host
I have a client that changed from MoveableType to Wordpress. He also changed from a dedicated server to WP Engine. He may have blocked search engines for a week or two, so his organic traffic is down but only by 25%. He's 301 redirecting all of the old pages. The mystery is that his referral traffic from Google is down 90%. It's a popular blog, so that's thousands. It's been going on a month now. Anyone seen this before?
Reporting & Analytics | | Hyper-Dog0 -
Our Firefox organic traffic seems to have been re-allocated to direct - anyone know why?!
In Google Analytic, the majority of organic traffic via Firefox browsers to one of our websites suddenly dropped off, and an immediate lift in direct traffic via Firefox browsers appeared. This trend has continued ever since. I've searched to find Firefox releases that might have affected it (eg. security/cookie settings), but haven't been able to find anything, or talk of this happening to other people, and the anomaly doesn't appear in our other website analytics either...anyone have any ideas?! The attached GA graphs show Firefox browser versions 21.0 and 22.0, but it's the same story across all versions. (You can also see that I'd also filtered out traffic via IOS 6 and Android 4 operating systems to remove any effect from these known issues). Thanks for any help that you can offer! kPRnrcd.png dy0Bjg7.png
Reporting & Analytics | | HubMDP1 -
Webmaster tools traffic on one keyword dropped through the floor - ideas?
Hi there, We design and sell our own product range in a narrow niche, and we are also stocked by Amazon and a lot of other big retailers in the UK. During the first two weeks of Dec 2012 the position of one of our main keywords, which was in google SERPs on page 1 (8 or 9), dropped to page 4. The keyword describes the niche we're in. The drop is shown in the webmaster tools traffic report for that keyword. But it's the only one of our keywords where this has happened, and furthermore it hasn't happened for variations of the keyword. And in Adwords our quality score for the keyword is 10 For example say we were making and selling shopping trolleys - our keyword "shopping trolleys" has dropped through the floor, but "shopping trolleys (on) wheels" is just fine. Can anyone shed any light on what's going on here? Losing this one keyword has cost us some good organic traffic. i1uxSlB.png
Reporting & Analytics | | w1ll1am0 -
Should you get a new Google Analytics account if your site has a new domain after a site redesign/new development?
We recently developed a new site for a client and they have opted to move forward with a domain change. Should we create a new Google Analytics account for the new site?
Reporting & Analytics | | TheOceanAgency0 -
Site crawler hasn't crawled my site in 6 days!
On 4.23 i requested a site crawl. My site only has about 550 pages. So how can we get faster crawls?
Reporting & Analytics | | joemas990