PDF web traffic hitting our site
-
Hi there,
Over the last few months our traffic has spiked due to irrelevant pdf documents sending us crap traffic, our bounce rate is sky high as well as other metrics. I don't want to just filter out this traffic in GA rather try and stop our site from being attacked.
Any advice on a way forward would be great.
Thanks
-
Based on this I don't think you have anything to worry about. It doesn't appear to be an attack, as you described in your original post. An actual attack on your website would have much higher volume. The worst this could possibly be is spam, which is mainly just annoying.
Easy solution: you don't want to filter out this traffic from GA because it may be useful at some point. So just create another view in GA, and name it "unfiltered". This view will have no filters and you can see all traffic in its raw glory. In your main view, name it something like "master" or "the one view to view them all" or whatever you want and set filters to remove that traffic from view.
Personally it looks more to me like these are old pdfs that other websites are linking to, which is what your hosting provider has also said. Your best move here is actually to setup redirects to relevant pages to recapture some of those links that are probably ending in 404s and get some link equity to important pages.
-
HI Alick, seems to be coming from an external source, I've included a screen grab for you too.
I've also discussed this with our hosting provider who gave the following response:
Thanks for the info from Webmaster Tools. That screenshot that shows the HTTP response is just showing that a request to http://www.icmp.co.uk/lulu-the-lioness-a-heroines-story.pdf throws a 301 redirect over to https://www.icmp.ac.uk/lulu-the-lioness-a-heroines-story.pdf — this runs because of the standard HTTPS/primary domain redirect code in settings.php and unfortunately doesn’t tell us much here.
I pulled down the database again and ran a search for a few of these filenames, and those came up empty. Looks like these don’t touch Drupal at all. When we saw them in the database before, in the sessions table, that was likely just because that filter module was storing browser history in user session data for some reason.
I did a little research here, and I think that leaves a few potential causes:
Another site is linking to these files (even though they don’t exist), and this is where Google is picking up/indexing the URLs from. This should be checkable in Google Analytics if you look at Referrals to those files.
These were listed on the sitemap at some point (but not any longer: https://www.icmp.ac.uk/sitemap.xml).
These files existed at some point in the past, but have since been deleted.
There was a DNS misconfiguration at some point, and that domain name was pointing to a different server where these files did exist.
While these are a little annoying to see in Analytics, from what I’ve read, 404s don’t negatively impact the site from an SEO standpoint, and there’s no evidence that the site itself is compromised at all, so unless we see evidence otherwise, I wouldn’t worry about these.
-
Hi,
Pdf trafic from your own site or other sites?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Spammy traffic
Hi guys! There has been an unusual amount of traffic to one particular page on our blog. Now that the business is slow, I had the time to look into it. Half of that traffic is coming from India and we are a Chicago based limo service. We 'll skip why that happens part for now.
Reporting & Analytics | | echo1
I let Hotjar record 100 session and upon reviewing them I have noticed a lot of sessions with no interaction, no mouse movement, no clicks; just scroll and exit. My question is related to implementing a type of verification (captcha) for that blog page to see if the bots / behavior changes.
Are there any downside in terms of in page SEO for this type of implementation?
What is good practice for something like this?0 -
Slow growth in organic traffic
Hi, Recently our website after 5 months of launch started seeing more organic traffic, thanks to the seo effort being put. However, off late the growth is quite slow and we don't see much increase in organic traffic any more. Any idea on what could have happened to cause this or is it normal. Here is the link to the website to check and if you can recommend anything - https://www.keyhub.com Thanks, Search-Console-Search-Analytics-https-www-keyhub-com-1.png
Reporting & Analytics | | kh-priyam1 -
How can I redirect incoming links from an old version of my site ending in .ctlg and .ivnu?
My original site was published in 2001 using "version 2" software from Ivenue, the hosting company that I signed up with at that time. The site's structure was built in such a way that the primary category pages ended in the extension .ivnu. Product or item pages on the shopping cart side ended in the extension .ctlg. My site's name was and is [Lamplight Feather, Inc.](<a class="webkit-html-attribute-value webkit-html-external-link" href="http://www.tonyhill.net/" target="_blank">http://www.tonyhill.net/</a>). We built our business between 2001 and 2011 and by the last three years (2009 - 2011) of using their version two were averaging a million dollars per year in gross sales. We decided to "upgrade" to Ivenue's "version 3" in 2011 to take advantage of some more modern options and because their newer software created web pages ending in .html which we thought more desirable. We made the switch in late 2011. But it was a disaster. Traffic and sales dropped precipitously. For the past two years (2012-2013) our annual gross sales average dropped to $400,000. (Two other factors were involved beside losing the many incoming links and link juice we had built up over the years: Panda came in that fall and my little niche market (decorative feathers) was flooded with competitors.) However as I try to rebuild our traffic and business little by little, I am stumped as to how to redirect the many incoming links that went to our first site's .ivnu and .ctlg pages. I have constructed redirects for some of our current but changed .html pages like this and put them in the file cabinet and they work: For (example): http://www.tonyhill.net/feathers_c384589.html then But trying the same thing for (example) http://www.tonyhill.net/craftfeathers.ivnu still returns a 404. Is there something I am missing. Ivenue is useless in this matter by the way. Their "technicians" are no help. I plan to be migrating my site once again to a new hosting company and hope to solve this problem before then. Thanks for the attention, Tony Hill This is an example from Google Webmaster of the type of links that show up as 404's that I would like to redirect: | URL: | http://www.tonyhill.net/productCat96521.ctlg | | | Error details | Linked from | | <colgroup><col></colgroup>
Reporting & Analytics | | featherman
| http://www.tonyhill.net/productCat43986.ctlg |
| http://forum.muppetcentral.com/showthread.php?t=21416&page=2 |
| http://www.cosplay.com/showthread.php?p=3832751 |
| http://forum.muppetcentral.com/showthread.php?t=21416&page=2&highlight=fur |
| http://www.muppetcentral.com/forum/threads/puppeteers-resources-links.19330/page-2 |
| http://www.muppetcentral.com/forum/threads/how-do-you-like-my-puppets.18549/page-2 | | | | |0 -
Recent large spike in traffic from same location?
Since May 21st our site is showing via Google Analytics a large spike in traffic to our home page. The traffic increase is 4-5 times our average. I was able to track the source to the same town we are headquartered. We have lost 4 spots for our top keyword. We went from 7 to 11 in Google in this time period. I understand it could be related to Penguin. But I also suspect this issue is negatively effecting ranking by increasing bounce rate and decreasing pages per visit and time on site. Before/After May 21st Returning Visitors: 52% / 23% New Visitors: 48% / 77% Pages per Visit: 12.4 / 5.44 Duration of visit Avg: 10.23 / 4.22 I would greatly appreciate any ideas on the cause of this issue. Also any input on the above metrics and sudden lost in a couple spots in SERP. Thanks.
Reporting & Analytics | | devonkrusich0 -
No (Not Provided) Traffic
Hi All, I have a site that gets around 500,000 visits every month from the UK and US, but funny enough the number number of visits under (not provided) traffic is only 179 per month. For most websites I usually get 10% or 15% of the traffic under not provided, what could the reasons be for such a low percentage of (not provided)? Thanks, Carlos
Reporting & Analytics | | Carlos-R0 -
Why is my direct traffic down DRASTICALLY?
I have been seeing a trend for a while that is intesifying. My direct traffic numbers are down A LOT. We are not down 50% to LY (in actual number not just percentage of traffic) I am trying to understand what could be the causes of this issue. I was considering simply bigger meaner competition, but I actually perform decently on my returning customers. Also my performance on my brand keyword is more inline with my current trend so I would except these KW to do equally as bad if the actual brand/store was the issue. The more surprising even, is the fact that I can trace back the start of the trend exactly to the day. Overnight on Sept 22 LY direct traffic went down 30% (to LY) when it was trending UP 20-25%(to LY) before. Now, we did do a redesign of the website on May 2011 (4 months before the drop), and did change host Oct 2011 (a couple weeks after the start of the trend). Do you have any clue as to why this could be happening? Did GA start tracking direct traffic differently?
Reporting & Analytics | | CassisGroup
Any thoughts?0 -
No Link Data Available for this URL appears often. Are my sites too small to show up?
I am on the trial until Mar 5, 2012. I seldom get the info I want. Is it because my sites are too narrow a niche? I don't seem to be getting the data I'd like from your service. I'm trying to like it, but when I keep getting messages like this, it makes it hard to justify: "No Link Data Available for this URL appears often" Sample sites that I am unable to get data. I especially would like to know how many backlinks exist for each site. I paid someone to help me with them and I'd like to verify their work.: http://costaricadentistreview.com/ http://costaricadentistreviews.com/ http://costaricadentalimplants.org Any suggestions? Thanx Kurt Gross
Reporting & Analytics | | kurtray0 -
Is there any web analytics tool that let us track number of outgoing clicks (and visits) ?
I just wonder if we can measure outgoing visits from a specific URL with an online tool or not?
Reporting & Analytics | | merkal20050